Implement the value iteration algorithm for mdp

Assignment Help C/C++ Programming
Reference no: EM131196911

Question 1

You are facing the following problem: You are given a robot and your task is to "guide" the robot through a 2-dimensional maze such that the robot can reach a desired goal state. Assume that the maze is defined as a regular 2-dimensional grid with discrete grid points. Some of the grid points can be visited by the robot while other grid points (such as those denoting walls and other obstacles) are off limit. Lets call the set of grid points that a robot can visit "states".

Assume that the following is given:

- A list of valid states (spaces that the robot can visit). It can be assumed that the list is complete and hence, the list defines the maze.

- A starting state. This defines the starting position of the robot.

- A goal state which defines the target state for the robot.

Your task is to write a rule called roby in Prolog. The rule is to find the shortest path from the given start state to the given goal state. The robot can take one step at a time and permitted are the moves up, left, right, and down. Diagonal movements are not allowed. This means that the robot can only move to states which are directly adjacent to a current state.

Your Prolog program is to read the list of states, the goal state, and the target state from a database containing "facts" called DB.pl.
You are given the files DB.pl and main_template.pl. The file DB.pl contains a description of the maze which is shown on the next page. Your task is to:

- Extend the content of main_template.pl file such that is uses the facts as defined in DB.pl to compute the shortest path from the given start state to a given goal state. Your program should work correctly for any other maze, too.

- Use SWI Prolog to solve the task. Do not make use of additional libraries (your code must be stand-alone, without further dependencies).

- Ensure that your solution will work for other mazes, too. Your code will be tested on different versions of DB.pl. Some of these mazes may not have a solution or may have more than one shortest paths (i.e. several solutions of the same length). Your program should produce a correct response (i.e. list all shortest paths, or should "fail" if there is no solution.

For example, for the maze defined in the provided DB.pl your code should produce the following sequence as output: s(7,3),s(7,4),s(7,5),s(7,6),s(6,6),s(5,6),s(4,6),s(4,5),s(4,4),s(5,4)

Note that some of the states in this maze are out of reach for this robot or, if the robot had started from s(8,1) then there would have been no solution.

952_Figure.png

Question 2

Implement the value iteration algorithm for MDP which computes the solution to the situation shown below. You may write your code in either C or C++. Your code must be implemented as a self-contained single source code file which does not require any additional libraries during compilation, does not require any additional data files during run-time, and does not expect any user inputs. For each value of k, your program is to print (to the screen) the reward vector J. Your program is to terminate when convergence is observed (use epsilon=0.0001). For each time step k print the optimal policy.

Your name and student number should be in the comment header of the source code file.

875_Figure1.jpg

Attachment:- maintemplate.rar

Reference no: EM131196911

Questions Cloud

Trade deficit saving is less than investment : Why is it that when you have a trade deficit saving is less than investment. S-I=NX. A trade deficit just means you bought more products from foreign sellers than you sold to them. So is a trade deficit even a bad thing? And what does it have to do w..
The amount of over or underapplied overhead for 20x5 : The amount of over or underapplied overhead for 20X5. Indicate whether overhead was overapplied or underapplied.
What factors lead you to the conclusion : What factors lead you to this conclusion? You may want to do additional research of sources to reach a conclusion. If so, please identify the sources that added to your analysis.
Risk taking-favor of low-risk managerial strategies : One of the reasons Joseph Schumpeter argued that capital was doomed was because he predicted that big corporations would naturally shift away from risk-taking entrepreneurship in favor of low-risk managerial strategies. Has this happened? Have major ..
Implement the value iteration algorithm for mdp : Implement the value iteration algorithm for MDP which computes the solution to the situation shown below. You may write your code in either C or C++. Your code must be implemented as a self-contained single source code file which does not require..
Describe why is monetary amount of each fair share different : Why is the monetary amount of each fair share different? How much money is owed to each of the two people who do not "win" the collection of frogs? In your opinion how "Fair" is the process described above?
Perspectives in marketing planning : What are the major changes and perspectives in marketing planning? Please discuss
List business areas and processes used in umuc pizza shops : List three business areas and/or processes used in the UMUC Pizza shops that could be supported by an IT solution. Explain how each IT project listed above specifically improves and/or supports Bill's UMUC Pizza business.
Primary means of market segmentation : TRUE OR FALSE: Fine Image Stores sell arts & crafts supplies to consumers who are highly creative, intelligent, and imaginative. They enjoy activities like painting and writing. Fine Image should use demographics as their primary means of market s..

Reviews

Write a Review

C/C++ Programming Questions & Answers

  Writing a program that creates a linked list

Writing a program that creates a linked list from 100 randomly generated numbers and sorts the using a heap. One heap in ascending order and one heap in descending order. Then displays the original linked list and both sorted lists.

  Write a program to implement a simple batch operating system

Write a program C/C++ to implement a simple batch operating system. Develop a simple loader, scheduler, memory manager, and shell.

  Write a program to track hourly employee arrival

A company hires you to write a program to track hourly employee arrival and departure times from work. In essence, you are tasked to make an online time clock - If the user enters an incorrect value more than 3 times, display a prompt that the prog..

  The value of minimal positive vector element

I need function which will evaluate. The value of minimal positive vector element - if none exists, return -1;

  What are its volume and surface area

A toy ball has a diameter of 8.1inches. What are its volume and surface area? Round to one decimal place.

  A file (complex.txt) which has a number of complex numbers

Read in a file (complex.txt) which has a number of complex numbers in the form of a+bi (e.g. 3+5i  2-3i etc).

  Object oriented programming paradigm is better choice

Explain why object oriented programming paradigm is considered better choice than structured programming paradigm. Does it reflect our "natural" way of thinking or do you find it artificial?

  Write c program to find the distance traveled

Write C program to find the distance traveled at regular intervals of time given the values of ‘u' and ‘a'. The program should provide the flexibility to the user to select his own time intervals and repeat the calculations for different values of..

  Basics of code development on linux and cryptography

Prepare a C program that gives simple mono-alphabetic substitution between plaintext, and Enhance your code to use "-e" to encrypt a string argument and "-d" to decrypt it using argv and argc

  You will create a linked list module

You will create a linked list module that exactly meets the specifications given in the supplied header (.h) file. The playlist program must accept a filename on the command line (argv).

  Accomplish the preceding task

However, the customer details maintenance application crashed as Dwayne mistakenly entered the wrong file path. Therefore, he asks Elina to modify the application so that it does not crash due to similar errors. Modify the code that Elina needs to ..

  Associate the variable contains

Given the string, s , and the list, lst , associate the variable contains with True if every string in lst appears in s(and False otherwise). Thus, given the string Hello world and the list ["H", "wor", "o w"] , contains would be associated with True..

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd