Introduction to Reinforcement Learning
Exploitation and Exploration
Markov Decision Processes and Dynamic Programming
Theoretical Fund. of Dynamic Programming
Model-free Prediction