# Book_Reinforcement Learning: An Introduction ###### tags: `Reinforcement Learning` `book` Part I --- - [Reinforcement Learning: An Introduction_Chapter 2 Multi-armed Bandits](https://hackmd.io/@shaoeChen/Sk7frzq4u) - [Reinforcement Learning: An Introduction_Chapter 3 Finite Markov Decision Processes](https://hackmd.io/@shaoeChen/Hkm3mMjL_) - [Reinforcement Learning: An Introduction_Chapter 4 Dynamic Programming](https://hackmd.io/@shaoeChen/SydNt5G3_) - [Reinforcement Learning: An Introduction_Chapter 5 Monte Carlo Methods](https://hackmd.io/@shaoeChen/rJ46Bsqu_) - [Reinforcement Learning: An Introduction_Chapter 6 Temporal-Difference Learning](https://hackmd.io/@shaoeChen/Byd6HbFa_) - [Reinforcement Learning: An Introduction_Chapter 7 $n$-step Bootstrapping](https://hackmd.io/@shaoeChen/By2QkpgkY) - [Reinforcement Learning: An Introduction_Chapter 8 Planning and Learning with Tabular Methods](https://hackmd.io/@shaoeChen/rkeER4JxK) - [Reinforcement Learning: An Introduction_Chapter Part II: Approximate Solution Methods](https://hackmd.io/@shaoeChen/B1Tp3dHzK) Part II --- - [Reinforcement Learning: An Introduction_Chapter 9 On-policy Prediction with Approximation](https://hackmd.io/@shaoeChen/rJGNlRUGY) - [Reinforcement Learning: An Introduction_Chapter 10 On-policy Control with Approximation](https://hackmd.io/@shaoeChen/BybdEqI8F) - [Reinforcement Learning: An Introduction_Chapter 11 Off-policy Methods with Approximation] - [Reinforcement Learning: An Introduction_Chapter 12 Eligibility Traces] - [Reinforcement Learning: An Introduction_Chapter 13 Policy Gradient Methods](https://hackmd.io/@shaoeChen/SyxRNt0LF)
{"metaMigratedAt":"2023-06-16T00:19:02.662Z","metaMigratedFrom":"Content","title":"Book_Reinforcement Learning: An Introduction","breaks":true,"contributors":"[{\"id\":\"e57c4a1a-a8a4-452b-b10c-1b448f321365\",\"add\":1653,\"del\":8}]"}
Expand menu