CP-VTON+ (CVPRW 2020)

Official implementation for "CP-VTON+: Clothing Shape and Texture Preserving Image-Based Virtual Try-On" from CVPRW 2020.
Project page: https://minar09.github.io/cpvtonplus/.
Saved/Pre-trained models: Checkpoints
Dataset: VITON_PLUS
The code and pre-trained models are tested with pytorch 0.4.1, torchvision 0.2.1, opencv-python 4.1 and pillow 5.4 (Python 3 env).

Project page | Paper | Dataset | Model | Video

Usage

This pipeline is a combination of consecutive training and testing of GMM + TOM. GMM generates the warped clothes according to the target human. Then, TOM blends the warped clothes outputs from GMM into the target human properties, to generate the final try-on output.

Install the requirements
Download/Prepare the dataset
Train GMM network
Get warped clothes for training set with trained GMM network, and copy warped clothes & masks inside data/train directory
Train TOM network
Test/evaluate with test set, test GMM first, then copy warped clothes & masks inside data/test directory, and test TOM

Installation

This implementation is built and tested in PyTorch 0.4.1. Pytorch and torchvision are recommended to install with conda: conda install pytorch=0.4.1 torchvision=0.2.1 -c pytorch
For all packages, run pip install -r requirements.txt

Data preparation

For training/testing VITON dataset, our full and processed dataset is available here: https://1drv.ms/u/s!Ai8t8GAHdzVUiQQYX0azYhqIDPP6?e=4cpFTI. After downloading, unzip to your data directory. Then skip the following and move to Training/Testing.

If you want to use the viton_resize dataset from cp-vton and do the processing manually, please run the following steps:

Run python data_download.py, it will download the full dataset into data/ folder, both train and test.
Run python dataset_neck_skin_correction.py, for both 'train' and 'test' data for training and testing. Please set the correct paths in the script before running. It will add a new segmentation label for neck/skin areas, and save the new segmentation in "image-parse-new" folder.
Run python body_binary_masking.py, for both 'train' and 'test' data for training and testing. Please set the correct paths in the script before running. It will create the body binary masks for the inputs of the networks.

Training

Run python train.py with your specific usage options for GMM and TOM stage.
For example, GMM: python train.py --name GMM --stage GMM --workers 4 --save_count 5000 --shuffle
Then run test.py for GMM network with the training dataset, which will generate the warped clothes and masks in "warp-cloth" and "warp-mask" folders inside the "result/GMM/train/" directory. Copy the "warp-cloth" and "warp-mask" folders into your data directory, for example inside "data/train" folder.
Run TOM stage, python train.py --name TOM --stage TOM --workers 4 --save_count 5000 --shuffle

Testing

Run 'python test.py' with your specific usage options.
For example, GMM: python test.py --name GMM --stage GMM --workers 4 --datamode test --data_list test_pairs.txt --checkpoint checkpoints/GMM/gmm_final.pth
Then run test.py for GMM network with the testing dataset, which will generate the warped clothes and masks in "warp-cloth" and "warp-mask" folders inside the "result/GMM/test/" directory. Copy the "warp-cloth" and "warp-mask" folders into your data directory, for example inside "data/test" folder.
Run TOM stage: python test.py --name TOM --stage TOM --workers 4 --datamode test --data_list test_pairs.txt --checkpoint checkpoints/TOM/tom_final.pth

Inference/Demo

Download the pre-trained models from here: https://1drv.ms/u/s!Ai8t8GAHdzVUiQA-o3C7cnrfGN6O?e=EaRiFP. Then run the same step as Testing to test/inference our model. The code and pre-trained models are tested with pytorch 0.4.1, torchvision 0.2.1, opencv 4.1 and pillow 5.4.

Testing with custom images

to run the model with custom internet images, make sure you have the following:

image (image of a person, crop/resize to 192 x 256 (width x height) pixels)
image-parse (you can generate with LIP_JPPNet or CIHP_PGN pretrained networks from the person image. If you use LIP_JPPNet/LIP_SSL, then you need to run dataset_neck_skin_connection.py and body_binary_masking.py after generating the segmentation files. For CIHP_PGN segmentation, see this comment)
cloth (in-shop cloth image, crop/resize to 192 x 256 (width x height) pixels)
cloth-mask (binary mask of cloth image, you can generate it with simple pillow/opencv function)
pose (pose keypoints of the person, generate with openpose COCO-18 model (OpenPose from the official repository is preferred))
Also, make a test_pairs.txt file for your custom images. Follow the VITON dataset format to keep same arrangements, otherwise you can modify the code.

What to do in case of unexpected results

There are many factors that can make distorted/unexpected results. Can you please do the following?

First try the original viton dataset and test pair combinations, check the immediate results and final output. Check if they are as expected.
If the original viton results are not as expected, please check the issues raised in github, people found several issues and see how they solved it.
If the original viton test results are as expected, then run your custom test sets and check the immediate results and debug where its going wrong.
If you are testing with custom images then check the github repository readme and related issues on how to run with custom images.

Its difficult to understand your issue from only single image/output. As I mentioned, there are various factors. Please debug yourself step by step and see where its going wrong. Check all the available inputs/outputs visually, and check multiple cases to see if the issue is happening for all cases. Good luck to you!

Citation

Please cite our paper in your publications if it helps your research:

@InProceedings{Minar_CPP_2020_CVPR_Workshops,
	title={CP-VTON+: Clothing Shape and Texture Preserving Image-Based Virtual Try-On},
	author={Minar, Matiur Rahman and Thai Thanh Tuan and Ahn, Heejune and Rosin, Paul and Lai, Yu-Kun},
	booktitle = {The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops},
	month = {June},
	year = {2020}
}

Acknowledgements

This implementation is largely based on the PyTorch implementation of CP-VTON. We are extremely grateful for their public implementation.

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
data		data
shell		shell
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
body_binary_masking.py		body_binary_masking.py
cp_dataset.py		cp_dataset.py
data_download.py		data_download.py
dataset_neck_skin_correction.py		dataset_neck_skin_correction.py
grid.png		grid.png
networks.py		networks.py
requirements.txt		requirements.txt
teaser.png		teaser.png
test.py		test.py
train.py		train.py
visualization.py		visualization.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CP-VTON+ (CVPRW 2020)

Usage

Installation

Data preparation

Training

Testing

Inference/Demo

Testing with custom images

What to do in case of unexpected results

Citation

Acknowledgements

About

Releases

Packages

Languages

License

QosmoInc/cp-vton-plus

Folders and files

Latest commit

History

Repository files navigation

CP-VTON+ (CVPRW 2020)

Usage

Installation

Data preparation

Training

Testing

Inference/Demo

Testing with custom images

What to do in case of unexpected results

Citation

Acknowledgements

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages