Implement the value iteration algorithm for mdp

Assignment Help C/C++ Programming
Reference no: EM131196911

Question 1

You are facing the following problem: You are given a robot and your task is to "guide" the robot through a 2-dimensional maze such that the robot can reach a desired goal state. Assume that the maze is defined as a regular 2-dimensional grid with discrete grid points. Some of the grid points can be visited by the robot while other grid points (such as those denoting walls and other obstacles) are off limit. Lets call the set of grid points that a robot can visit "states".

Assume that the following is given:

- A list of valid states (spaces that the robot can visit). It can be assumed that the list is complete and hence, the list defines the maze.

- A starting state. This defines the starting position of the robot.

- A goal state which defines the target state for the robot.

Your task is to write a rule called roby in Prolog. The rule is to find the shortest path from the given start state to the given goal state. The robot can take one step at a time and permitted are the moves up, left, right, and down. Diagonal movements are not allowed. This means that the robot can only move to states which are directly adjacent to a current state.

Your Prolog program is to read the list of states, the goal state, and the target state from a database containing "facts" called DB.pl.
You are given the files DB.pl and main_template.pl. The file DB.pl contains a description of the maze which is shown on the next page. Your task is to:

- Extend the content of main_template.pl file such that is uses the facts as defined in DB.pl to compute the shortest path from the given start state to a given goal state. Your program should work correctly for any other maze, too.

- Use SWI Prolog to solve the task. Do not make use of additional libraries (your code must be stand-alone, without further dependencies).

- Ensure that your solution will work for other mazes, too. Your code will be tested on different versions of DB.pl. Some of these mazes may not have a solution or may have more than one shortest paths (i.e. several solutions of the same length). Your program should produce a correct response (i.e. list all shortest paths, or should "fail" if there is no solution.

For example, for the maze defined in the provided DB.pl your code should produce the following sequence as output: s(7,3),s(7,4),s(7,5),s(7,6),s(6,6),s(5,6),s(4,6),s(4,5),s(4,4),s(5,4)

Note that some of the states in this maze are out of reach for this robot or, if the robot had started from s(8,1) then there would have been no solution.

952_Figure.png

Question 2

Implement the value iteration algorithm for MDP which computes the solution to the situation shown below. You may write your code in either C or C++. Your code must be implemented as a self-contained single source code file which does not require any additional libraries during compilation, does not require any additional data files during run-time, and does not expect any user inputs. For each value of k, your program is to print (to the screen) the reward vector J. Your program is to terminate when convergence is observed (use epsilon=0.0001). For each time step k print the optimal policy.

Your name and student number should be in the comment header of the source code file.

875_Figure1.jpg

Attachment:- maintemplate.rar

Reference no: EM131196911

Questions Cloud

Trade deficit saving is less than investment : Why is it that when you have a trade deficit saving is less than investment. S-I=NX. A trade deficit just means you bought more products from foreign sellers than you sold to them. So is a trade deficit even a bad thing? And what does it have to do w..
The amount of over or underapplied overhead for 20x5 : The amount of over or underapplied overhead for 20X5. Indicate whether overhead was overapplied or underapplied.
What factors lead you to the conclusion : What factors lead you to this conclusion? You may want to do additional research of sources to reach a conclusion. If so, please identify the sources that added to your analysis.
Risk taking-favor of low-risk managerial strategies : One of the reasons Joseph Schumpeter argued that capital was doomed was because he predicted that big corporations would naturally shift away from risk-taking entrepreneurship in favor of low-risk managerial strategies. Has this happened? Have major ..
Implement the value iteration algorithm for mdp : Implement the value iteration algorithm for MDP which computes the solution to the situation shown below. You may write your code in either C or C++. Your code must be implemented as a self-contained single source code file which does not require..
Describe why is monetary amount of each fair share different : Why is the monetary amount of each fair share different? How much money is owed to each of the two people who do not "win" the collection of frogs? In your opinion how "Fair" is the process described above?
Perspectives in marketing planning : What are the major changes and perspectives in marketing planning? Please discuss
List business areas and processes used in umuc pizza shops : List three business areas and/or processes used in the UMUC Pizza shops that could be supported by an IT solution. Explain how each IT project listed above specifically improves and/or supports Bill's UMUC Pizza business.
Primary means of market segmentation : TRUE OR FALSE: Fine Image Stores sell arts & crafts supplies to consumers who are highly creative, intelligent, and imaginative. They enjoy activities like painting and writing. Fine Image should use demographics as their primary means of market s..

Reviews

Write a Review

C/C++ Programming Questions & Answers

  Function that accepts a pointer to a string as an argument

Must actual count the number of words. User must be able to input a stringand then pass the string to the function. The function must also display the average number of letters in each word.

  Write the function - void shuffle

Write the function: void shuffle(int ar[], int size); This function "shuffles" the elements in the array pointed by 'ar' (and whose length is 'size').

  Develop a global function customerinformation

Use a parameterized constructor to initialize data members - Develop a global function "CustomerInformation" that prints the customers information (number of dept each customer can handel). ( Use copy constructor).

  One the same set of axes

One the same set of axes, plot the monopolist's demand and marginal revenue curves. Indicate where theprice elasticity of demand is elastic, inelastic and unit elastic.

  Payroll and uses the selection construct

This problem involves payroll and uses the selection construct. A possible restatement: An hourly employee's regular payRate is $16.78/hour for hoursWorked

  Program to compute the weekly wages repeatedly

It contains the C++ wages program using a repeat loop in order to enable the user to compute several wages. The loop ends when the user enters -1 for either the hours_worked or the pay_rate. C++ uses the "do" keyword instead of "repeat".

  Calculate and return the average of one of numerical values

Write a method that uses the array to output to the console the list of entries in reverse order. Each entry is displayed on a new line.

  Programs written with inheritance

Many programs written with inheritance could be written with composition instead, and vice versa. Rewrite the classes Point3D, Sphere and Cylinder using composition rather than inheritance

  Implement a simple calculator program

Implement a simple calculator

  Problem related to c programming

write a Grade Book program for his class to help him compute final grades. Design a program that asks for the student's name and four test grades. You are to display the student's name, four test grades, the average of the four test grades and the..

  Write a c program that converts celsius temperatures to

write a c program that converts celsius temperatures to fahrenheit temperatures.the formula isf95c 32.f is fahrenheit

  Define a class that consists of three objects

Define a class that consists of three objects: day, month, and year. Within this class define two member function (constructor ) to initialize the objects to today?s date and one that display the date as follows: 05/7/2002

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd