Binary Reward Labeling

This repository is based on the CORL library.

Installation

Set up and install the d4rl environments by following the instructions provided in the d4rl documentation until you can successfully run import d4rl in your Python environment.

Clone the GitHub repository and install required packages:

git clone https://github.com/uiuc-focal-lab/ORL.git && cd ORL
pip install -r requirements/requirements_dev.txt

Initialize Wandb

Initialize Wandb by running the following command inside the folder:

wandb init

Follow the prompts to create a new project or connect to an existing one. Make sure you have the necessary API key and project settings configured, and update the project argument.

For more information on how to use Wandb, refer to the Wandb documentation.

Generate Preference Datasets

Run the shell files. They will be written into the saved folder.

. generate_pbrl_datasets.sh
. generate_pbrl_datasets_no_overlap.sh

Run Example

Run the sample Python command. Make sure you have the necessary dependencies installed and the Python environment properly configured.

. example.sh

Full Experiment and Ablation Study Scripts

To run the full experiment and ablation study, use the following scripts:

main.sh: Contains commands for the full experiment.
abl.sh: Contains commands for the ablation study.

Execute these scripts in your terminal:

. main.sh

. abl.sh

Experiment Results

Main Experiments

Training log of learning with different methods on different datasets: Oracle True Reward, ORL, Latent Reward Model, and IPL with True Reward

Ablation Studies

Training log of learning with a method on datasets of different sizes

Comparison between the learning efficiency of ORL combined with different standard offline RL algorithms

Comparison between the cases where single or multiple preference labels are given to each pair of trajectories

Comparison between datasets with different settings of structured overlapping trajectories

Name		Name	Last commit message	Last commit date
Latest commit History 100 Commits
.github		.github
algorithms		algorithms
configs		configs
requirements		requirements
results		results
saved		saved
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
abl.sh		abl.sh
example.sh		example.sh
generate_ipl_datasets.sh		generate_ipl_datasets.sh
generate_pbrl_datasets.sh		generate_pbrl_datasets.sh
generate_pbrl_datasets_no_overlap.sh		generate_pbrl_datasets_no_overlap.sh
main.sh		main.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Binary Reward Labeling

Installation

Initialize Wandb

Generate Preference Datasets

Run Example

Full Experiment and Ablation Study Scripts

Experiment Results

Main Experiments

Ablation Studies

About

Releases

Packages

Contributors 2

Languages

uiuc-focal-lab/ORL

Folders and files

Latest commit

History

Repository files navigation

Binary Reward Labeling

Installation

Initialize Wandb

Generate Preference Datasets

Run Example

Full Experiment and Ablation Study Scripts

Experiment Results

Main Experiments

Ablation Studies

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages