D-TIIL

This repository contains the code and data for the following paper:

Exposing Text-Image Inconsistency Using Diffusion Models (ICLR 2024)

Dataset

Please download the dataset here: [Google Drive]. When seeking permission, kindly provide your details along with the intended purpose for using this dataset. Please be aware that our dataset is exclusively intended for research purposes.

Installation

We tested on the environment of torch 1.13.1 with a cuda version of 11.7

pip install -r requirements.txt

Quick Start

Please check the provided jupyter notebook for details, or you can easily run the model using following code:

import torch
from PIL import Image
from pipeline import DTIILPipeline
im = Image.open('./asset/exampe.jpg').resize((512,512)).convert("RGB")
model_id = "runwayml/stable-diffusion-v1-5"
pipe = DTIILPipeline.from_pretrained(model_id, safety_checker=None)
mask = pipe(prompt, im)['final_mask']

Citation

If you find our code or dataset useful, please cite:

@inproceedings{
  huang2024exposing,
  title={Exposing Text-Image Inconsistency Using Diffusion Models},
  author={Mingzhen Huang and Shan Jia and Zhou Zhou and Yan Ju and Jialing Cai and Siwei Lyu},
  booktitle={The Twelfth International Conference on Learning Representations},
  year={2024},
}

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
asset		asset
Incon-Det-Demo.ipynb		Incon-Det-Demo.ipynb
README.md		README.md
pipeline.py		pipeline.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

D-TIIL

Dataset

Installation

Quick Start

Citation

About

Releases

Packages

Contributors 2

Languages

Mingzhen-Huang/D-TIIL

Folders and files

Latest commit

History

Repository files navigation

D-TIIL

Dataset

Installation

Quick Start

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages