Trusted by Students Everywhere
Why Choose Us?
0% AI Guarantee

Human-written only.

24/7 Support

Anytime, anywhere.

Plagiarism Free

100% Original.

Expert Tutors

Masters & PhDs.

100% Confidential

Your privacy matters.

On-Time Delivery

Never miss a deadline.

Artificial Intelligence   Total Points: 50   Write a program to define an MDP (Markov’s Decision Process) problem for a generic grid world (e

Computer Science Nov 02, 2021

Artificial Intelligence

 

Total Points: 50

 

Write a program to define an MDP (Markov’s Decision Process) problem for a generic grid world (e.g. figure 17.1 of AIMA Textbook 4th edition page 563) and use value iteration (figure 17.6) to print the values of states in each iteration. After termination of the value iteration, the final policy should be printed. Next, implement a modified policy iteration (policy iteration algorithm is provided in figure 17.9 and modified policy iteration is discussed on page 578) to calculate and print the optimal policy. The description of MDP (T, R, gamma, and epsilon) should be loaded from the text file as input. You can use the sample input format included with the assignment or use your own. Also, write-up describing the format of the input file and how to run the program is not necessary.

 

Programming Language

Python

 

Grading Rubric

Parse input text into MDP class => 5

Value Iteration => 15

Print values of states in each iteration=> 5

Policy Iteration => 15

Print final policy => 5

Writeup => 5

Archived Solution
Unlocked Solution

You have full access to this solution. To save a copy with all formatting and attachments, use the button below.

Already a member? Sign In
Important Note: This solution is from our archive and has been purchased by others. Submitting it as-is may trigger plagiarism detection. Use it for reference only.

For ready-to-submit work, please order a fresh solution below.

Or get 100% fresh solution
Get Custom Quote
Secure Payment