2020.11.11 Confirmed for the performances of Value-based algorithms for several atari games such as Q-Bert, Breakout, Seaquest, Boxing, Pong, etc.
2021.02.15 Confirmed for the performances of Policy-based algorithms and distributed algorithms for CartPole and Lunarlander.
I'll write up the detailed comments in all the codes soon.
Colab online codes (Click 'colab' icon).
- Value Based
- Policy Based
Ray Pararell python package Totorial series.
Paper References
- DQN, Double DQN, Dueling DQN, PER, C51, Noisy DQN, Rainbow
- REINFORCE, Actor-Critic - Sutton's Textbook
Code References
- Sutton - Reinforcement Learning Textbook 2nd ed.
- https://github.com/Curt-Park/rainbow-is-all-you-need
- https://github.com/MrSyee/pg-is-all-you-need
- https://github.com/ShangtongZhang/DeepRL
- https://github.com/sfujim
- https://github.com/yandexdataschool/Practical_RL
- https://github.com/seungeunrho/minimalRL

Provide feedback

Saved searches