Skip to content

Latest commit

 

History

History
18 lines (16 loc) · 314 Bytes

README.md

File metadata and controls

18 lines (16 loc) · 314 Bytes

R2D2

RECURRENT EXPERIENCE REPLAY IN DISTRIBUTED REINFORCEMENT LEARNING (n-step Q-learning is not implemented)

Setup

conda env create --file environment.yaml

Usage

OpenAI Gym environment

python models/lstm_burn_in/run.py

Marlo environment

python models/lstm_burn_in/run_marlo.py