Machine-Learning-Tokyo / Reinforcement_Learning Public

Notifications You must be signed in to change notification settings
Fork 9
Star 50

Material for MLT Reinforcement Learning workshops and study sessions

50 stars 9 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
rlbook/ch2-karmedbandit		rlbook/ch2-karmedbandit
session #1		session #1
.gitignore		.gitignore
README.md		README.md

Repository files navigation

Reinforcement_Learning

Material for MLT Reinforcement Learning workshops and study sessions.

Also, check out our MLT repo with top Deep RL resources (tutorials, code, books).

RL Interactive Tools

Îµ Decay
k-Armed Bandit
Exploration vs Explotation

Original concept and Python code: Anugraha Sinha
Javascript implementation: Francisco Dalla Rosa Soares

Intro to Reinforcement Learning – Session #1

by Anugraha Sinha

[Meetup] & [Slides and Code]

Presentation

Introduction to RL
Important elements of an RL problem
Description of Markov Decision Process (MDP) and and Markov Assumption.
Importance of parametrization of State, Action, Reward and Environment.
Model Based and Model Free Methods
Meaning of Control Problem and Evaluation Problem.
Algorithm of Policy Evaluation and Value iteration methods

Code examples

Finding the best route through a maze/obstruction avoidance using policy iteration algorithm.
Above problem statement with value iterations algorithm.
Code exercise

About

Material for MLT Reinforcement Learning workshops and study sessions

Custom properties

Report repository

Releases

No releases published

Packages

No packages published

Contributors 3

Languages