Define the states of mdp for the dice game

Assignment Help Basic Computer Science
Reference no: EM133248083

Assignment:

1. Consider a dice game that is similar to the card game blackjack. The goal of the game is to obtain numbers that sum to as near as possible to 13 without going over. The game starts with rolling a dice by the dealer and player, and each of them has an independent random number from 1 to 6. The player can request additional rolling (hit) until he/she decides to stop (stand) or exceed 13 (bust). After the player stands, the dealer can roll additional dice according to its policy. If neither player nor dealer busts, the outcome (win, lose, draw) is decided by whose sum is closer to 13. The reward for winning is 1 dollar, drawing is 0, and losing is -1. This game can be modeled by the Markov decision process (MDP). Please answer the following questions.

(a) Define the states of MDP for the dice game, and answer the number of states.

(b) Define the actions of MDP for the dice game, and answer the number of actions.

(c) Suppose the player adopts a policy that rolls additional dice until the sum is 10 or greater. Answer the state transition probability matrix.

This game can be modeled by Markov decision process (MDP). Please answer the following questions.

(a) Define the states of MDP for the dice game, and answer the number of states.

(b) Define the actions of MDP for the dice game, and answer the number of actions.

(c) Suppose the player adopts a policy that rolls additional dice until the sum is 10 or greater.

Answer the state transition probability matrix.

Reference no: EM133248083

Questions Cloud

Is the metoo movement a marketing issue : Is the #MeToo movement a marketing issue? How are customers likely to react to allegations of workplace harassment? How should companies deal with the issue
Different types of marketing channels : HRT 3020 California Polytechnic State University, different types of marketing channels (distribution channels) of the brand by each category
How to change my character and habits : BUS 1038 George Brown College how to change my character and habits at the right place and time is challenging. Demonstrating good work ethic to employers
Compare the trait approach and psychoanalytic approach : Compare the trait approach, psychoanalytic approach, and humanistic approach. Evaluate each in terms of at least one strength and one weakness
Define the states of mdp for the dice game : Consider a dice game that is similar to the card game blackjack. The goal of game is to obtain numbers that sum to as near as possible to 13 without going over.
How the experts influenced people jugement : BUSINESS 345 Simon Fraser University How the experts influenced people jugement and they decision making? How not relevant and not reliable experts can be?
Identify antecedents to the behavior and outcomes : Identify antecedents to the behavior and outcomes of engaging in that behavior. Describe the behavior you selected
Describe the roles that a salesperson and the sales force : MAR 2011 St. Petersburg College What are blogs, and how are marketers using them to market their products and services? What advantages and disadvantages
Define racism and give a brief explanation : Define racism and give a brief explanation of how racism have an emotional impact on individuals. How the government is influence by racism

Reviews

Write a Review

Basic Computer Science Questions & Answers

  Identifies the cost of computer

identifies the cost of computer components to configure a computer system (including all peripheral devices where needed) for use in one of the following four situations:

  Input devices

Compare how the gestures data is generated and represented for interpretation in each of the following input devices. In your comparison, consider the data formats (radio waves, electrical signal, sound, etc.), device drivers, operating systems suppo..

  Cores on computer systems

Assignment : Cores on Computer Systems:  Differentiate between multiprocessor systems and many-core systems in terms of power efficiency, cost benefit analysis, instructions processing efficiency, and packaging form factors.

  Prepare an annual budget in an excel spreadsheet

Prepare working solutions in Excel that will manage the annual budget

  Write a research paper in relation to a software design

Research paper in relation to a Software Design related topic

  Describe the forest, domain, ou, and trust configuration

Describe the forest, domain, OU, and trust configuration for Bluesky. Include a chart or diagram of the current configuration. Currently Bluesky has a single domain and default OU structure.

  Construct a truth table for the boolean expression

Construct a truth table for the Boolean expressions ABC + A'B'C' ABC + AB'C' + A'B'C' A(BC' + B'C)

  Evaluate the cost of materials

Evaluate the cost of materials

  The marie simulator

Depending on how comfortable you are with using the MARIE simulator after reading

  What is the main advantage of using master pages

What is the main advantage of using master pages. Explain the purpose and advantage of using styles.

  Describe the three fundamental models of distributed systems

Explain the two approaches to packet delivery by the network layer in Distributed Systems. Describe the three fundamental models of Distributed Systems

  Distinguish between caching and buffering

Distinguish between caching and buffering The failure model defines the ways in which failure may occur in order to provide an understanding of the effects of failure. Give one type of failure with a brief description of the failure

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd