-
2020.11.11 Confirmed for the performances of Value-based algorithms for several atari games such as Q-Bert, Breakout, Seaquest, Boxing, Pong, etc.
-
2021.02.15 Confirmed for the performances of Policy-based algorithms and distributed algorithms for CartPole and Lunarlander.
-
I'll write up the detailed comments in all the codes soon.
-
Colab online codes (Click 'colab' icon).
-
Value Based
-
Policy Based
-
-
Ray Pararell python package Totorial series.
-
Paper References
-
DQN, Double DQN, Dueling DQN, PER, C51, Noisy DQN, Rainbow
-
REINFORCE, Actor-Critic - Sutton's Textbook
-
-
Code References
- Sutton - Reinforcement Learning Textbook 2nd ed.
- https://github.com/Curt-Park/rainbow-is-all-you-need
- https://github.com/MrSyee/pg-is-all-you-need
- https://github.com/ShangtongZhang/DeepRL
- https://github.com/sfujim
- https://github.com/yandexdataschool/Practical_RL
- https://github.com/seungeunrho/minimalRL