Multi-Transmotion:
Pre-trained Model for Human Motion Prediction

Yang Gao, Po-Chien Luan, Alexandre Alahi

Annual Conference on Robot Learning (CoRL), München, November 2024

Abstract

The ability of intelligent systems to predict human behaviors is crucial, particularly in fields such as autonomous vehicle navigation and social robotics. However, the complexity of human motion have prevented the development of a standardized dataset for human motion prediction, thereby hindering the establishment of pre-trained models. In this paper, we address these limitations by integrating multiple datasets, encompassing both trajectory and 3D pose keypoints, to propose a pre-trained model for human motion prediction. We merge seven distinct datasets across varying modalities and standardize their formats. To facilitate multimodal pre-training, we introduce Multi-Transmotion, an innovative transformer-based model designed for cross-modality pre-training. Additionally, we present a novel masking strategy to capture rich representations. Our methodology demonstrates competitive performance across various datasets on several downstream tasks, including trajectory prediction in the NBA and JTA datasets, as well as pose prediction in the AMASS and 3DPW datasets.

Getting Started

Install the requirements using pip:

pip install -r requirements.txt

Unified Human Motion Data Framework

The framework is using modified trajnet++ to generate specific tabular format in '.ndjson' files. Specifically, the first part of '.ndjson' file represents trainable scenes/samples and the second part denotes the detailed human motion information. There are 207 columns of comprehensive human data are listed below:

[0]: Frame ID
[1]: Agents ID
[2:4]: Trajectory (x, y)
[4:8]: 3D Bounding Box (h, w, l, rot_z)
[8:12]: 2D Bounding BOx (bb_left, bb_top, bb_width, bb_height)
[12:129]: 3D pose keypoitns (x0,x2, ..x38, y0, y2, ..y38, z0, z2, ..z38)
[129:207]: 3D pose keypoitns (xx0,xx2, ..xx38, yy0, yy2, ..yy38)

The missing data will be denoted as 'null'.

Here we will show an example to convert the NBA dataset into the Unified Human Motion Dataframework. We have conveniently added the data of NBA to the release section of the repository (for license details, please refer to the original papers).

Download the data from our release and place the raw data subdirectory of NBA under extract_data/NBA/raw/

Extract NBA data

python extract_data/NBA/extract_NBA.py --split train

Convert extracted data into UniHuMotion framework.

mv extract_data/NBA/output_csv/nba_train.csv UniHuMotion_trajnetpp/data/UniHuMotion_NBA/
cd UniHuMotion_trajnetpp
python -m trajnetdataset.convert --acceptance 1.0 1.0 1.0 1.0 --train_fraction 1.0 --val_fraction 0.0 --fps 5 --obs_len 10 --pred_len 20 --chunk_stride 1

Generate trainable data cache (Efficient dataloading).

mv output_pre UHM_NBA
mv UHM_NBA ../data/UniHuMotion/
cd ..
python UniHuMotion_cache/cache_generator.py --UniHuMotion_dataset UHM_NBA

Pre-training and fine-tuning

The code is using pytorch DDP so that it can be easily deployed on multiple gpus.

Demo command for pre-training:

python pretrain/train.py --cfg pretrain/configs/UniHuMotion.yaml --exp_name test_git

Demo command for fine-tuning on NBA dataset:

python ft_NBA/train.py --cfg ft_NBA/configs/UniHuMotion.yaml --exp_name default

Demo command for evaluation on NBA dataset:

python ft_NBA/evaluate.py --ckpt ./experiments/default_experiment/checkpoints/checkpoint_default/FT_NBA_ckpt.pth.tar --metric ade_fde

For the ease of use, we have also provided the pre-trained fine-tuned models in the release section of this repo.

Work in Progress

This repository is work-in-progress and will continue to get updated and improved over the coming months.

@inproceedings{gao2024multi,
    title={Multi-Transmotion: Pre-trained Model for Human Motion Prediction},
    author={Gao, Yang and Luan, Po-Chien and Alahi, Alexandre},
    booktitle={8th Annual Conference on Robot Learning},
    year={2024}
}

This work is a follow-up work of Social-Transmotion:

@InProceedings{saadatnejad2024socialtransmotion,
    title={Social-Transmotion: Promptable Human Trajectory Prediction}, 
    author={Saeed Saadatnejad and Yang Gao and Kaouther Messaoud and Alexandre Alahi},
    booktitle={International Conference on Learning Representations (ICLR)},
    year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
UniHuMotion_cache		UniHuMotion_cache
UniHuMotion_trajnetpp/trajnetdataset		UniHuMotion_trajnetpp/trajnetdataset
docs		docs
extract_data/NBA		extract_data/NBA
ft_NBA		ft_NBA
pretrain		pretrain
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-Transmotion:
Pre-trained Model for Human Motion Prediction

Yang Gao, Po-Chien Luan, Alexandre Alahi

Annual Conference on Robot Learning (CoRL), München, November 2024

Abstract

Getting Started

Unified Human Motion Data Framework

Pre-training and fine-tuning

Work in Progress

About

Releases

Packages

Languages

License

vita-epfl/multi-transmotion

Folders and files

Latest commit

History

Repository files navigation

Multi-Transmotion: Pre-trained Model for Human Motion Prediction

Yang Gao, Po-Chien Luan, Alexandre Alahi

Annual Conference on Robot Learning (CoRL), München, November 2024

Abstract

Getting Started

Unified Human Motion Data Framework

Pre-training and fine-tuning

Work in Progress

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Multi-Transmotion:
Pre-trained Model for Human Motion Prediction

Packages