Skip to content

Latest commit

 

History

History
2 lines (2 loc) · 241 Bytes

File metadata and controls

2 lines (2 loc) · 241 Bytes

MDP with Value Iteration and Policy Iteration

Solving MDP is a first step towards Deep Reinforcement Learning. This notebook show you how to implement Value Iteration and Policy Iteration to solve OPENAI GYM FrozenLake Enviorment.