# Book_李宏毅老師Deep Reinforcement Learning 2018課程筆記
###### tags: `book`
台大李宏毅老師Deep Reinforcement Learning 2018課程筆記
[課程撥放清單](https://www.youtube.com/playlist?list=PLJV_el3uVTsODxQFgzMzPLa16h6B8kWM_)
課程筆記
---
- [李宏毅_DRL Lecture 1: Policy Gradient (Review)](https://hackmd.io/@shaoeChen/HkH2hSKuS)
- [李宏毅_DRL Lecture 2: Proximal Policy Optimization (PPO)](https://hackmd.io/@shaoeChen/Syez2AmFr)
- [李宏毅_DRL Lecture 3: Q-learning (Basic Idea)](https://hackmd.io/@shaoeChen/SyqVopoYr)
- [李宏毅_DRL Lecture 4: Q-learning (Advanced Tips)](https://hackmd.io/@shaoeChen/HyyXreFcB)
- [李宏毅_DRL Lecture 5: Q-learning (Continuous Action)](https://hackmd.io/@shaoeChen/By-fACxor)
- [李宏毅_DRL Lecture 6: Actor-Critic](https://hackmd.io/@shaoeChen/rkiOH4MoS)
- [李宏毅_DRL Lecture 7: Sparse Reward](https://hackmd.io/@shaoeChen/rktZw3xhH)
- [李宏毅_DRL Lecture 8: Imitation Learning](https://hackmd.io/@shaoeChen/H1aW8iEhS)
{"metaMigratedAt":"2023-06-15T00:53:33.938Z","metaMigratedFrom":"Content","title":"Book_李宏毅老師Deep Reinforcement Learning 2018課程筆記","breaks":true,"contributors":"[{\"id\":\"e57c4a1a-a8a4-452b-b10c-1b448f321365\",\"add\":918,\"del\":19}]"}