This repository is a reimplementation of Policy Optimization with Demonstrations (ICML 2018).
This code is highly based on OpenAI baselines gail.
To run POfD
on delayed Mujoco tasks:
python baselines/gail/run_mujoco.py --env_id Hopper-v1 --reward-freq 10 --num_epochs 1000