An investigation of offline reinforcement learning in factorisable action spaces

This repository contains the code and datasets used to produce the results in our paper published in TMLR.

Installation instructions for the DeepMind Control suite can be found here.

Installation instructions for the Maze environment can be found here.

Datasets can be downloaded from here.

Errata

In Appendix C.2 of the paper the random-medium-expert datasets for the DMC suite environments/tasks are stated to be 1M transitions, the same as medium and expert. This is incorrect and the paper should have stated the random-medium-expert datasets are 200k transitions. These datasets were intentionally smaller than medium and expert as we wanted to create more of a challenge based on sub-optimality, diversity and (relatively) small numbers of transitions.

Instructions for running code

We provide individual examples of running each algorithm for one set of DMC and Maze datasets. To train on a different dataset, simply update the associated parameters.

DMC

For the DMC suite we have created a separate package for loading environments and datasets. This can be installed by cloning this repository, navigating to the root and running

pip install -r requirements.txt

Further instructions, including folder structures for the data, can be accessed here.

Maze

Be sure to update expert and random scores as well as the number of sub-action dimensions.

A full list of expert and random scores is available in the file "Expert_Random_Scores.csv"

Feedback

If you experience any problems or have any queries, please raise an issue or pull request.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
Algorithms		Algorithms
Utils		Utils
DMC_Example_DQN_CQL.py		DMC_Example_DQN_CQL.py
DMC_Example_DecQN.py		DMC_Example_DecQN.py
DMC_Example_DecQN_BCQ.py		DMC_Example_DecQN_BCQ.py
DMC_Example_DecQN_CQL.py		DMC_Example_DecQN_CQL.py
DMC_Example_DecQN_IQL.py		DMC_Example_DecQN_IQL.py
DMC_Example_DecQN_OneStep.py		DMC_Example_DecQN_OneStep.py
DMC_Example_FactorisedBC.py		DMC_Example_FactorisedBC.py
Expert_Random_Scores.csv		Expert_Random_Scores.csv
Maze_Example_DQN_CQL.py		Maze_Example_DQN_CQL.py
Maze_Example_DecQN.py		Maze_Example_DecQN.py
Maze_Example_DecQN_BCQ.py		Maze_Example_DecQN_BCQ.py
Maze_Example_DecQN_CQL.py		Maze_Example_DecQN_CQL.py
Maze_Example_DecQN_IQL.py		Maze_Example_DecQN_IQL.py
Maze_Example_DecQN_OneStep.py		Maze_Example_DecQN_OneStep.py
Maze_Example_FactorisedBC.py		Maze_Example_FactorisedBC.py
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

An investigation of offline reinforcement learning in factorisable action spaces

Errata

Instructions for running code

DMC

Maze

Feedback

About

Releases

Packages

Contributors 2

Languages

AlexBeesonWarwick/OfflineRLFactorisableActionSpaces

Folders and files

Latest commit

History

Repository files navigation

An investigation of offline reinforcement learning in factorisable action spaces

Errata

Instructions for running code

DMC

Maze

Feedback

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages