Reinforcement Learning_Value Iteration and Policy Iteration




It contains?Lesson 3 of the David Silver's lecture and Chapter 4 of?Shiyu Zhao's Mathematical Foundation of Reinforcement Learning.
[1]?https://www.davidsilver.uk/teaching/
[2]?https://github.com/MathFoundationRL/Book-Mathmatical-Foundation-of-Reinforcement-Learning
標(biāo)簽: