Authors: Gokul Puthumanaillam, Manav Vora
This is a code for comparing different algorithms to solve the lunar lander problem. The algorithms simulated are:
- Monte Carlo
- Q-Learning
- DQN
- SARSA
Simulation | Final Trajectory |
---|---|
Random policy | |
Monte Carlo | |
QLearning | |
SARSA | |
DQN |