RLBook

Bandits

Implementation of the several bandits algorithms for exploration discussed in the Chapter 2.

Policy Iteration

An answer to exercise 4.7 from the book along with the recreation of example 4.2

Value Iteration

Recreated example 4.3 and answered question 4.9

Project Goal

Reinforce the learning from the book by demonstrating learned skills with exercises from Reinforcement Learning Book by Sutton and Barto and my own exercises.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

RLBook

Bandits

Policy Iteration

Value Iteration

Project Goal

Files

README.md

Latest commit

History

README.md

File metadata and controls

RLBook

Bandits

Policy Iteration

Value Iteration

Project Goal