Implement the value iteration algorithm for mdp

Assignment Help C/C++ Programming
Reference no: EM131194343

Question 1

You are facing the following problem: You are given a robot and your task is to "guide" the robot through a 2-dimensional maze such that the robot can reach a desired goal state. Assume that the maze is defined as a regular 2-dimensional grid with discrete grid points. Some of the grid points can be visited by the robot while other grid points (such as those denoting walls and other obstacles) are off limit. Lets call the set of grid points that a robot can visit "states".

Assume that the following is given:

- A list of valid states (spaces that the robot can visit). It can be assumed that the list is complete and hence, the list defines the maze.

- A starting state. This defines the starting position of the robot.

- A goal state which defines the target state for the robot.

Your task is to write a rule called roby in Prolog. The rule is to find the shortest path from the given start state to the given goal state. The robot can take one step at a time and permitted are the moves up, left, right, and down. Diagonal movements are not allowed. This means that the robot can only move to states which are directly adjacent to a current state.

Your Prolog program is to read the list of states, the goal state, and the target state from a database containing "facts" called DB.pl.
You are given the files DB.pl and main_template.pl. The file DB.pl contains a description of the maze which is shown on the next page. Your task is to:

- Extend the content of main_template.pl file such that is uses the facts as defined in DB.pl to compute the shortest path from the given start state to a given goal state. Your program should work correctly for any other maze, too.

- Use SWI Prolog to solve the task. Do not make use of additional libraries (your code must be stand-alone, without further dependencies).

- Ensure that your solution will work for other mazes, too. Your code will be tested on different versions of DB.pl. Some of these mazes may not have a solution or may have more than one shortest paths (i.e. several solutions of the same length). Your program should produce a correct response (i.e. list all shortest paths, or should "fail" if there is no solution.

For example, for the maze defined in the provided DB.pl your code should produce the following sequence as output: s(7,3),s(7,4),s(7,5),s(7,6),s(6,6),s(5,6),s(4,6),s(4,5),s(4,4),s(5,4)

1096_Fig.jpg

Note that some of the states in this maze are out of reach for this robot or, if the robot had started from s(8,1) then there would have been no solution.

Question 2

Implement the value iteration algorithm for MDP which computes the solution to the situation shown below. You may write your code in either C or C++. Your code must be implemented as a self-contained single source code file which does not require any additional libraries during compilation, does not require any additional data files during run-time, and does not expect any user inputs. For each value of k, your program is to print (to the screen) the reward vector J. Your program is to terminate when convergence is observed (use epsilon=0.0001). For each time step k print the optimal policy. Use the discount factor.

Your name and student number should be in the comment header of the source code file.

1909_Fig1.jpg

Reference no: EM131194343

Questions Cloud

Capacity per day for the worker : Defining one flow unit as one minute of work, what is the capacity per day for the worker at "Toast" step? Round your answer to the nearest integer. Your answer is.
Write a letter on given topic : Assume that you are a CPA and a tax specialist. - Write a letter to Gustav, Heidi, and the other shareholders offering your advice.
Environment initiatives by eni : Write a detailed report in approximately 750 to 800 words document on the topic: Environment Initiatives by ENI. Introduction:  ENI is an Italian Multinational Oil and gas company headquartered in Rome, Italy with a market capitalization of 87 billio..
Determine q and w for the process : Four kilograms of steam in a pistonlcylinder device at 400 kPa and 448.15 K (175°C) undergoes a mechanically reversible, isothermal compression to a final pressure such that the steam is just saturated. Determine Q and W for the process.
Implement the value iteration algorithm for mdp : CSCI924/CSCI424 - Reasoning and Learning - Implement the value iteration algorithm for MDP which computes the solution to the situation. You may write your code in either C or C++. Your code must be implemented as a self-contained single source co..
How much heat is transferred in the process : Heat is transferred to the vessel until one phase just disappears, and a single phase remains. Which phase (liquid or vapor) remains, and what are its temperature and pressure? How much heat is transferred in the process?
Markets into meaningful customer groups : Customer driven marketing strategy decisions require dividing up the markets into meaningful customer groups (segmentation), choosing who to serve (targeting), creating market offerings that offer value to those groups (differentiation), and creat..
Describe how an organization can adapt its direction : Describe a minimum of five challenges from the list below that are defining the future strategic direction of health care. Proposed health care reform and legislation.
What is its final temperature and how much work is done : If it undergoes a reversible, adiabatic expansion to 150 kPa, what is its final temperature and how much work is done?

Reviews

len1194343

9/5/2016 8:45:10 AM

I need this assignment solution but I got average sol last time I got just 12 marks out of 15 so we pay money so please assign this task to your best expert, looking for 100% marks.

Write a Review

C/C++ Programming Questions & Answers

  Calculates average of three exams for each student

The program reads the grades for students, calculates average of three exams for each student, and puts out to another file (called Summarygrades.txt) Student ID average and letter grade: example below,

  Interactive and user friendly

Discuss your experiences with respect to the features you think make these technologies more interactive and user friendly. Provide rationale to support your answer.

  Find the optimal solution for the problem

Develop a linear program and hence find the optimal solution for the problem.

  Write a c program to search a list of integers

Write a C program to search a list of integers for a particular value. The user should be prompted to enter the value to be searched for. Your program will then print out whether the value was found or not.

  Visitor bureau of las vegas

Past surveys reveal that 30% of tourists going to Las Vegas to gamble spend more than $1,000. The Visitor's Bureau of Las Vegas wants to update this percentage.

  Determines whether to accept an object.

Modify the implementation of the DataSet class in Section to use both a Measurer and a Filter object. Only objects that the filter accepts should be processed.

  Explain characteristics unique to object-oriented programing

The machine language treats a message as bits while object-oriented programming language treats every message as an object. Do you agree with this statement? Why or why not? Explain the characteristics unique to object-oriented programming and how..

  8-bit or 16-bit mode, and the prescaling ratio

8-bit or 16-bit mode, and the prescaling ratio. You can assume XTAL = 10M Hz or at the frequency you specify.

  Write a c++ code for how to learn oop

Write a C++ code for How to learn OOP (Object oriented programming) by CPP(C Plus Plus) - general-purpose programming language

  Write a c++ statement that prints the value

How many parameters does the function test have? What is the type of the function test?

  Details of the stock of a shop

David needs to create a program to maintain the details of the stock of a shop. He needs to ensure that item category is Women, Men, Girls, Boys, and Babies. He needs to accept item category from the user.

  What are inverse least squares

The data for these calculations are:knowns.xyz contains the spectra of twenty known samples, concentrations.xyz contains the concentrations of a target chemical T in each of the twenty samples, and unknowns.xyz gives the spectra of fifteen more sa..

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd