Write a java program to solve the questions

Assignment Help JAVA Programming
Reference no: EM131420096

Consider the following maze environment. Write a Java program to solve the following questions. 

2418_Figure.png

The transition model is as follows: the intended outcome occurs with probability 0.8, and with probability 0.1 the agent moves at either right angle to the intended direction. If the move would make the agent walk into a wall, the agent stays in the same place as before. The rewards for the white squares are -0.04, for the green squares are +1, and for the brown squares are -1. Note that there are no terminal states; the agent's state sequence is infinite.

Part 1: Assuming the known transition model and reward function listed above, find the optimal policy and the utilities of all the (non-wall) states using both value iteration and policy iteration. Display the optimal policy and the utilities of all the states, and plot utility estimates as a function of the number of iterations as in Figure 17.5(a) in the above reference book (for value iteration, you should need no more than 50 iterations to get convergence). In this question, use a discount factor of 0.99.  Below are some reference utility values (computed with a different discount factor) to help you get an idea if the trend of your answers is correct.

Part 2: Design a more complicated maze environment of your own and re-run the  algorithms  designed for Part 1 on it. How does the number of states and the complexity of the environment affect convergence? How complex can you make the environment and still be able to learn the right policy?

Using method of value iteration for Part 1

  • Descriptions of implemented solutions
  • Plot of optimal policy
  • Utilities of all states
  • Plot of utility estimates as a function of the number of iterations

Using method of policy iteration for Part 1

  • Descriptions of implemented solutions
  • Plot of optimal policy
  • Utilities of all states
  • Plot of utility estimates as a function of the number of iterations

Source code for Part 1.

Part 2 bonus questions

  • Answers of the questions in the report
  • Source code

Attachment:- Assignment Files.rar

Reference no: EM131420096

Questions Cloud

Design an experiment to test the claims of astrology : Then give each client a copy of both readings (true and reverse) and see if he or she can distinguish the true one.
Will a judgment rendered by california court be enforceable : He effected service on both of the defendants under the provisions of the California "long-arm" statute. Assuming neither defendant appears in the action, will a judgment rendered by the California Court be enforceable against either of them?
Describe how you plan to obtain letters of recommendation : HA499:As you begin to apply for positions or look for advancement in the position you have,describe how you plan to obtain letters of recommendation and from whom you wish to get them.Who is an appropriate person to ask for a letter of recommendation..
Find the tension in rope required to lower the block : the block is supported by a vertical wall as shown below.the coefficient of friction btwn the wall and block is µ 0.3 and that btwn the wedge and horizontal surface is 0.25.find the tension in rope required to lower the block
Write a java program to solve the questions : Consider the following maze environment. Write a Java program to solve the following questions. Assuming the known transition model and reward function listed above, find the optimal policy and the utilities of all the (non-wall) states using both ..
Describe the motivator-hygiene model : Describe the motivator-hygiene model. What is the motivators factors and what are some hygiene factors? How can you apply this model in workplace? List examples? What are ways to improve motivation in the workplace?
People living in different regions on the world : How did trade, commerce, and exploitation from the 15th century onwards alter the political, economic, social, and cultural landscapes across the globe? Did the coming together of the hemispheres have a mutually beneficial impact on people living ..
Explain the leaders role in employee engagement : Select only one of the outcomes listed below, which will become the focal point of your Discussion Board (DB) response for this week. In your DB response, compare your organization or one that you are familiar with, to another organization based o..
Determine what elements of the production : Determine what elements of the production and delivery of the product or service would be subject to benchmarking and describe how you would identify those organizations to which comparisons could be made in a benchmarking process.

Reviews

len1420096

3/8/2017 11:52:30 PM

Chapter 16- 17 file is just reference material for the assignment. Assignment - This is one that needs to be done. This assignment covers topics in Module 3: Agent Decision Making. Refer to Chapters 16 and 17 in the reference book “Artificial Intelligence: A Modern Approach” by S. Russell and P. Norvig. Prentice-Hall, third edition, 2010. For the ease of access, a softcopy of these two chapters can be found in the NTULearn system.

Write a Review

JAVA Programming Questions & Answers

  Design a class named mydate

Design a class named MyDate. The class contains: The data fields year, month, and day that represesents a date. month is 0-based, i.e., 0 is January. A no-arg constructor that creates a MyDate object for the current date

  Objects-type case

With object-oriented programming, classes and objects can be used to avoid "type- case" statements. Here is a program in which a form of case statement is used that inspects a user-de?ned type tag to distinguish between different classes of shape ..

  Prepare executable programs a translate program and a

prepare executable programs. a translate program and a dictionary program. the programs will be written in any

  What are the values of these boolean expressions

Describe the steps for inserting a new item at the head of a linked list? Make sure you consider all possible incoming conditions.

  Write a method called makerow

Write a method called makeRow that is passed two arguments: an int and a String. It returns a String containing n copies of s, concatenated in a row. For instance, if we call the method with makeRow(5, "*"), the method returns *****.

  Write a java program that (1) defines a base/super class a

Write a Java program that (1) Defines a base/super class A, a subclass B (inherits from A), and a subclass C (inherits from B). The A class must have a method f() which is not implemented in A,

  Write a java program that finds the temperature

Write a JAVA program that finds the temperature that is the same in both Celsius and Fahrenheit.

  Write java application that asks user to enter two integers

Write a Java application that asks a user to enter two integers (A and B) and then display the results of A/B. Catch any division by zero errors and prompt the user to reenter the value of B. Be sure your code compiles and runs as expected. Name y..

  Create a website

The preplanning design consists of a document that defines the element specifically requested by the client and indicates where these features will be located on the Website

  What situation or problem is that you intend to improve

what you plan to accomplish and why, include an overview of situation or organization and what situation/problem is that you intend to improve - usually 1-2 pages) Cite and support all content appropriately.

  Program (using java) that is suppose to find the largest

writing a program (using java) that is suppose to find the lowest integer that can be evenly divided by a range (ex. 1-25). I need help fixing/debugging it so it comes up with the proper output.

  Use java

How to use Java.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd