Material for MLT Reinforcement Learning workshops and study sessions.
Also, check out our MLT repo with top Deep RL resources (tutorials, code, books).
- ε Decay
- k-Armed Bandit
- Exploration vs Explotation
- Original concept and Python code: Anugraha Sinha
- Javascript implementation: Francisco Dalla Rosa Soares
Presentation
- Introduction to RL
- Important elements of an RL problem
- Description of Markov Decision Process (MDP) and and Markov Assumption.
- Importance of parametrization of State, Action, Reward and Environment.
- Model Based and Model Free Methods
- Meaning of Control Problem and Evaluation Problem.
- Algorithm of Policy Evaluation and Value iteration methods
Code examples
- Finding the best route through a maze/obstruction avoidance using policy iteration algorithm.
- Above problem statement with value iterations algorithm.
- Code exercise