Generative Knowledge Transfer using Stable Diffusion

Generative Knowledge Transfer refers to the process of leveraging the generative knowledge of large scale foundational generative models for improving the performance of downstream machine learning tasks. In this repository, we experiment with this idea by selecting AFHQ as a benchmark dataset and generate synthetic data using Stable Diffusion in a zero-shot setting. We evaluate the data quality of the generated dataset by training different classifiers on the datasets and validate their performance on AFHQ validation set.

Installation

Conda:

conda env create -f environment.yaml
conda activate gkt

or

Pip:

python3 -m venv gkt
source gkt/bin/activate
pip install -r requirements.txt

Dataset

AFHQ Dataset

The repository makes use of stargan-v2 to download dataset. Set up the submodules using the following command:

git submodule update --init

To download the dataset, do the following:

cd ./stargan-v2/
bash download.sh afhq-dataset

For our implementation, we only used images of cat and dog. However, you can include the images of wild animal for your experiments.

Synthetic Data

For generating synthetic data, we use the submodules from the stable-diffusion. Follow the steps below to generate synthetic dataset:

Download the stable-diffusion checkpoint from hugging face.
Read the run_data.sh and choose the parameters for the dataset sampling.
Run the run_data.sh file

./run_data.sh

Training

For training classifiers using AFHQ dataset:

Check src/config_afhq.yaml file and set parameter values for training.
Run the following command:

accelerate launch src/train.py src/config_afhq.yaml

Similarly, for training classifiers using synthetic stable-diffusion dataset:

Check src/config_sd.yaml file and set parameter values for training.
Run the following command:

accelerate launch src/train.py src/config_sd.yaml

Evaluation

For evaluating the classifiers:

Check the src/config_eval.yaml file and set parameter values for evaluation.
Run the following command:

python3 src/eval.py src/config_eval.yaml

Support

If you have any questions or comments, please open an issue on our GitHub repository

Acknowledgement

The project was implemented during my Ph.D. studies as a part of Deep Generative Models course at Linkoping University. This work was supported by the Wallenberg AI, Autonomous Systems and Software Program (WASP), funded by the Knut and Alice Wallenberg Foundation.

The AFHQ dataset was used for the experiments. Also, synthetic dataset was created using the Stable Diffusion model developed by CompVis, Stability AI and Runway. We would like to acknowledge their contribution to the field of generative models.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
src		src
stable-diffusion @ 21f890f		stable-diffusion @ 21f890f
stargan-v2 @ 875b70a		stargan-v2 @ 875b70a
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
run_data.sh		run_data.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Generative Knowledge Transfer using Stable Diffusion

Table of Contents

Installation

Conda:

Pip:

Dataset

AFHQ Dataset

Synthetic Data

Training

Evaluation

Support

Acknowledgement

About

Releases

Packages

Languages

License

NitheshChandher/gkt_stablediffusion

Folders and files

Latest commit

History

Repository files navigation

Generative Knowledge Transfer using Stable Diffusion

Table of Contents

Installation

Conda:

Pip:

Dataset

AFHQ Dataset

Synthetic Data

Training

Evaluation

Support

Acknowledgement

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages