Fill This Form To Receive Instant Help

Help in Homework
trustpilot ratings
google ratings


Homework answers / question archive / Artificial Intelligence   Total Points: 50   Write a program to define an MDP (Markov’s Decision Process) problem for a generic grid world (e

Artificial Intelligence   Total Points: 50   Write a program to define an MDP (Markov’s Decision Process) problem for a generic grid world (e

Computer Science

Artificial Intelligence

 

Total Points: 50

 

Write a program to define an MDP (Markov’s Decision Process) problem for a generic grid world (e.g. figure 17.1 of AIMA Textbook 4th edition page 563) and use value iteration (figure 17.6) to print the values of states in each iteration. After termination of the value iteration, the final policy should be printed. Next, implement a modified policy iteration (policy iteration algorithm is provided in figure 17.9 and modified policy iteration is discussed on page 578) to calculate and print the optimal policy. The description of MDP (T, R, gamma, and epsilon) should be loaded from the text file as input. You can use the sample input format included with the assignment or use your own. Also, write-up describing the format of the input file and how to run the program is not necessary.

 

Programming Language

Python

 

Grading Rubric

Parse input text into MDP class => 5

Value Iteration => 15

Print values of states in each iteration=> 5

Policy Iteration => 15

Print final policy => 5

Writeup => 5

Option 1

Low Cost Option
Download this past answer in few clicks

16.99 USD

PURCHASE SOLUTION

Already member?


Option 2

Custom new solution created by our subject matter experts

GET A QUOTE

Related Questions