GitHub - vivekmyers/palo: Code for the paper "Policy Adaptation via Language Optimization: Decomposing Tasks for Few-Shot Imitation"

Policy Adaptation via Language Optimization: Decomposing Tasks for Few-Shot Imitation

Vivek Myers, Bill Chunyuan Zheng, Oier Mees, Sergey Levine, Kuan Fang

This repository contains the code for Policy Adaptation via Language Optimization (PALO), which combines a handful of demonstrations of a task with proposed language decompositions sampled from a VLM to quickly enable rapid nonparametric adaptation, avoiding the need for a larger fine-tuning dataset.

Environment

conda create -n palo python=3.10
conda activate palo
pip install -e . 
pip install -r requirements.txt

For GPU:

pip install --upgrade "jax[cuda]" -f https://storage.googleapis.com/jax-releases/jax_cuda_releases.html

For TPU

pip install --upgrade "jax[tpu]" -f https://storage.googleapis.com/jax-releases/libtpu_releases.html

See the Jax Github page for more details on installing Jax.

Using PALO

Running Optimization

To get the best language decomposition from PALO, you can run the following commands:

python palo/optimize.py --instruction [Your Instruction Here] --trajectory_path [Your data here] \
 --checkpoint_path "./agent/checkpoint/" --im_size 224 --config_dir "./agent/config.pkl"

Citation

PLease consider citing our work if you find it useful:

@inproceedings{myers2024policy,
  title={Policy Adaptation via Language Optimization: Decomposing Tasks for Few-Shot Imitation},
  author={Vivek Myers and Bill Chunyuan Zheng and Oier Mees and Sergey Levine and Kuan Fang},
  booktitle={Conference on Robot Learning},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
agent		agent
data		data
jaxrl_m		jaxrl_m
palo		palo
scripts		scripts
Dockerfile		Dockerfile
PALO_showcase.ipynb		PALO_showcase.ipynb
README.md		README.md
config.pkl		config.pkl
environment.yml		environment.yml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Policy Adaptation via Language Optimization: Decomposing Tasks for Few-Shot Imitation

Environment

Using PALO

Running Optimization

Citation

About

Releases

Packages

Contributors 3

Languages

vivekmyers/palo

Folders and files

Latest commit

History

Repository files navigation

Policy Adaptation via Language Optimization: Decomposing Tasks for Few-Shot Imitation

Environment

Using PALO

Running Optimization

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages