Reference
[1] https://github.com/MathFoundationRL/Book-Mathmatical-Foundation-of-Reinforcement-Learning