Skip to content

Stanford CS234: Reinforcement Learning assignments and practices

License

Notifications You must be signed in to change notification settings

ksang/cs234-assignments

Repository files navigation

cs234-assignments

Stanford CS234: Reinforcement Learning assignments and practices

Overview

  • This project are assignment solutions and practices of Stanford class CS234.

  • The assignments are for Winter 2020, video recordings are available on Youtube.

  • For detailed information of the class, goto: CS234 Home Page

  • Assignments will be updated with my solutions, currently WIP.

Index

There are totally three assignments, each of them has programming part and written part.

Assignment 1 written

  • Grid World

  • Value of Different Policies

  • Fixed Point

Assignment 1 coding

  • Frozen Lake MDP, policy evaluation, policy improvement, policy iteration, value iteration.

  • Details see source code: vi_and_pi.py

Assignment 2 written

Assignment 2 coding

  • Q-learning

  • Linear Approximation

  • Implementing DeepMind's DQN

  • DQN on Atari

  • n-step Estimators

Assignment 3 written

Assignment 3 coding

  • Policy Gradient Methods

  • Best Arm Identification in Multi-armed Bandit

Notes on minimum LaTex environment installation on OSX

  • Install basic tex package

      brew cask install basictex
    
  • Install missing packages from the assignments.

    • if some package is missing, tex compiler such as pdftex will give you their name, e.g. nicefrac.sty.

    • Search the package name on CTAN, and get the parent package name, e.g. units

    • Below command will install the package:

        sudo tlmgr install units
      
    • I already did this for you, here is the command for install all dependencies for this assignment:

        sudo tlmgr update --self
        sudo tlmgr install units fullpage preprint \
                            wrapfig was apptools appendix \
                            titlesec enumitem breakurl \
                            algorithm2e ifoddpage relsize cm-super \
                            lastpage comment framed biblatex typewriter \
                            tcolorbox environ trimspaces
      
  • Use your favorite editor with LaTeX support and enjoy the math. I'm using Atom with LaTex and pdf-view package.

Any questions or advice, just open an issue or pull request.

About

Stanford CS234: Reinforcement Learning assignments and practices

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages