Skip to content

Commit

Permalink
Init.
Browse files Browse the repository at this point in the history
  • Loading branch information
EzioBy committed Dec 30, 2024
1 parent 82db150 commit 5e9f6ff
Show file tree
Hide file tree
Showing 13 changed files with 450 additions and 1 deletion.
40 changes: 39 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
@@ -1 +1,39 @@
# edicho
# Edicho: Consistent Image Editing in the Wild

> **Edicho: Consistent Image Editing in the Wild** <br>
> Qingyan Bai, Hao Ouyang, Yinghao Xu, Qiuyu Wang, Ceyuan Yang, Ka Leong Cheng, Yujun Shen, Qifeng Chen <br>
<div align=center>
<img src="./docs/assets/teaser.jpg" width=850px>
</div>

**Figure:** Given two images in the wild, Edicho generates consistent editing versions of them in a zero-shot manner. Our approach
achieves precise consistency for editing parts (left), objects (middle), and the entire images (right) by leveraging explicit correspondence.

**[**[**Paper**](https://arxiv.org/pdf/2412.XXXXX)**]**
**[**[**Project Page**](https://ezioby.github.io/edicho/)**]**

As a verified need, consistent editing across in-the-wild images remains a technical challenge arising from various unmanageable factors, like object poses, lighting conditions, and photography environments.
<i>Edicho</i>, embodying "echoing the editing effect", steps in with a training-free solution based on diffusion models, featuring a fundamental design principle of using <b>explicit</b> image correspondence to direct editing.
Specifically, the key components include an attention manipulation module and a carefully refined classifier-free guidance (CFG) denoising strategy, both of which take into account the pre-estimated correspondence.
Such an inference-time algorithm enjoys a plug-and-play nature and is compatible to most diffusion-based editing methods, such as ControlNet and BrushNet.
Extensive results demonstrate the efficacy of <i>Edicho</i> in consistent cross-image editing under diverse settings.

<div align=center>
<img src="./docs/assets/pipeline.jpg" width=900px>
</div>

**Figure:** Method overview. To achieve consistent editing, we propose a training-free and plug-and-play method that injects the pre-computed correspondence into the pre-trained diffusion models and guides the denoising in the two levels of (a) attention features and (b) noisy latents in classifier-free guidance (CFG).


## BibTeX

If you find our work helpful for your research, please consider to cite:
```bibtex
@inproceedings{bai2024edicho,
title = {Edicho: Consistent Image Editing in the Wild},
author = {Bai, Qingyan and Ouyang, Hao and Xu, Yinghao and Wang, Qiuyu and Yang, Ceyuan and Cheng, Ka Leong and Shen, Yujun and Chen, Qifeng},
booktitle = {arXiv preprint arXiv:2412.XXXXX},
year = {2024}
}
```
Binary file added docs/assets/3d_matching.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs/assets/additional_results.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
5 changes: 5 additions & 0 deletions docs/assets/bootstrap.min.css

Large diffs are not rendered by default.

Binary file added docs/assets/corr.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs/assets/customization.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
37 changes: 37 additions & 0 deletions docs/assets/font.css
Original file line number Diff line number Diff line change
@@ -0,0 +1,37 @@
/* Homepage Font */

/* latin-ext */
@font-face {
font-family: 'Lato';
font-style: normal;
font-weight: 400;
src: local('Lato Regular'), local('Lato-Regular'), url(https://fonts.gstatic.com/s/lato/v16/S6uyw4BMUTPHjxAwXjeu.woff2) format('woff2');
unicode-range: U+0100-024F, U+0259, U+1E00-1EFF, U+2020, U+20A0-20AB, U+20AD-20CF, U+2113, U+2C60-2C7F, U+A720-A7FF;
}

/* latin */
@font-face {
font-family: 'Lato';
font-style: normal;
font-weight: 400;
src: local('Lato Regular'), local('Lato-Regular'), url(https://fonts.gstatic.com/s/lato/v16/S6uyw4BMUTPHjx4wXg.woff2) format('woff2');
unicode-range: U+0000-00FF, U+0131, U+0152-0153, U+02BB-02BC, U+02C6, U+02DA, U+02DC, U+2000-206F, U+2074, U+20AC, U+2122, U+2191, U+2193, U+2212, U+2215, U+FEFF, U+FFFD;
}

/* latin-ext */
@font-face {
font-family: 'Lato';
font-style: normal;
font-weight: 700;
src: local('Lato Bold'), local('Lato-Bold'), url(https://fonts.gstatic.com/s/lato/v16/S6u9w4BMUTPHh6UVSwaPGR_p.woff2) format('woff2');
unicode-range: U+0100-024F, U+0259, U+1E00-1EFF, U+2020, U+20A0-20AB, U+20AD-20CF, U+2113, U+2C60-2C7F, U+A720-A7FF;
}

/* latin */
@font-face {
font-family: 'Lato';
font-style: normal;
font-weight: 700;
src: local('Lato Bold'), local('Lato-Bold'), url(https://fonts.gstatic.com/s/lato/v16/S6u9w4BMUTPHh6UVSwiPGQ.woff2) format('woff2');
unicode-range: U+0000-00FF, U+0131, U+0152-0153, U+02BB-02BC, U+02C6, U+02DA, U+02DC, U+2000-206F, U+2074, U+20AC, U+2122, U+2191, U+2193, U+2212, U+2215, U+FEFF, U+FFFD;
}
Binary file added docs/assets/global_comparison.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs/assets/local_comparison.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs/assets/pipeline.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
156 changes: 156 additions & 0 deletions docs/assets/style.css
Original file line number Diff line number Diff line change
@@ -0,0 +1,156 @@
/* Body */
body {
background: #e3e5e8;
color: #ffffff;
font-family: 'Lato', Verdana, Helvetica, sans-serif;
font-weight: 300;
font-size: 14pt;
}

/* Hyperlinks */
a {text-decoration: none;}
a:link {color: #1772d0;}
a:visited {color: #1772d0;}
a:active {color: red;}
a:hover {color: #f09228;}

/* Pre-formatted Text */
pre {
margin: 5pt 0;
border: 0;
font-size: 12pt;
background: #fcfcfc;
}

/* Project Page Style */
/* Section */
.section {
width: 768pt;
min-height: 100pt;
margin: 15pt auto;
padding: 20pt 30pt;
border: 1pt hidden #000;
text-align: justify;
color: #000000;
background: #ffffff;
}

/* Header (Title and Logo) */
.section .header {
min-height: 80pt;
margin-top: 30pt;
}
.section .header .logo {
width: 80pt;
margin-left: 10pt;
float: left;
}
.section .header .logo img {
width: 80pt;
object-fit: cover;
}
.section .header .title {
margin: 0 60pt;
text-align: center;
font-size: 22pt;
}

/* Author */
.section .author {
margin: 5pt 0;
text-align: center;
font-size: 16pt;
}

/* Institution */
.section .institution {
margin: 5pt 0;
text-align: center;
font-size: 16pt;
}

/* Hyperlink (such as Paper and Code) */
.section .link {
margin: 5pt 0;
text-align: center;
font-size: 16pt;
}

/* Teaser */
.section .teaser {
margin: 20pt 0;
text-align: center;
}
.section .teaser img {
width: 90%;
}

/* Section Title */
.section .title {
text-align: center;
font-size: 22pt;
margin: 5pt 60pt 20pt 0pt; /* top right bottom left */
}

/* Section Body */
.section .body {
margin-bottom: 15pt;
text-align: justify;
font-size: 14pt;
}

/* Rendering Videos */
.container_rendering {
display: flex;
justify-content: space-between;
align-items: center;
}
.rendering_item {
display: flex;
align-items: center;
margin: 0 10px;
}
.rendering_item img {
width: 50%;
height: auto;
margin-right: 0px;
}
.rendering_item video {
width: 50%;
height: auto;
}

/* BibTeX */
.section .bibtex {
margin: 5pt 0;
text-align: left;
font-size: 22pt;
}

/* Related Work */
.section .ref {
margin: 20pt 0 10pt 0; /* top right bottom left */
text-align: left;
font-size: 18pt;
font-weight: bold;
}

/* Citation */
.section .citation {
min-height: 60pt;
margin: 10pt 0;
}
.section .citation .image {
width: 120pt;
float: left;
}
.section .citation .image img {
max-height: 60pt;
width: 120pt;
object-fit: cover;
}
.section .citation .comment{
margin-left: 130pt;
text-align: left;
font-size: 14pt;
}
Binary file added docs/assets/teaser.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading

0 comments on commit 5e9f6ff

Please sign in to comment.