question about generated images #115

rickyars · 2022-04-02T18:44:42Z

I've run the same data through two GANs: lightweight-gan and Playform. The results are similar but different. I'm wondering if anyone has suggestions for cleaning up the outputs from the lightweight-gan. they seem to be a bit muddier/noisy. See attachments

.

iScriptLex · 2022-04-02T22:55:02Z

These things are called checkerboard artifacts. They are caused by upsampling data between layers of network. These artifacts are highly visible in the early stages of training, but slowly reduces with every step and becomes barely noticeable after ~80000 iterations.
But if you want to get rid of it completely, even in the early stages, you can do it in several ways:

Use --antialias key. This significantly increases training time, but entirely removes these artifacts. You can make training faster by writing your own smoothing function because the default one is unoptimized and slow.
Change upsampling type to bilinear (this also effectively removes such artifacts and works faster than --antialias method). To do so, just edit the file lightweight_gan.py and in upsample function change this line:
return nn.Upsample(scale_factor = scale_factor)
to this:
return nn.Upsample(scale_factor = scale_factor, mode='bilinear', align_corners=False)
Rewrite the code of the main layers of Generator and Discriminator so that the size of the convolution kernel is a multiple of the scaling factor (i.e. 2).

rickyars · 2022-04-02T23:12:01Z

@iScriptLex What timing. Earlier today I found the --antialias flag. (1) Training takes much, much longer and (2) it does fix my problem! Now I can't use --show-progress because of an issue with some dict. I've opened an issue with the error.

rickyars · 2022-04-02T23:52:18Z

Option 1: The intermediate results from --antialias look amazing, but it's very slow. After 4 hours, it also doesn't look anywhere close to converging, but the result is trippy:

rickyars · 2022-04-03T03:17:04Z

@iScriptLex sorry to bother you again. Looking at the code, the generator seems pretty easy to modify. It's currently at 3, so I'll bump it up to 4. However, the discriminator is is a lot more complicated. Which lines would I focus on if I wanted to modify the "main layers"? Thank you!

lucidrains · 2022-04-03T04:43:42Z

@rickyars let me know how well this works for you! bfd0e8a (separate branch for now)

rickyars · 2022-04-03T04:52:10Z

@lucidrains wow! you are fast! i need to sleep but i'll test it out tomorrow. thank you!

iScriptLex · 2022-04-03T14:26:37Z

FastGAN suppresses these artifacts by using interesting method - noise injection. I assume the same method will work for LightweightGAN.
First, add this code before Generator class:

class NoiseInjection(nn.Module):
    def __init__(self):
        super().__init__()

        self.weight = nn.Parameter(torch.zeros(1), requires_grad=True)

    def forward(self, feat, noise=None):
        if noise is None:
            batch, _, height, width = feat.shape
            noise = torch.randn(batch, 1, height, width).to(feat.device)

        return feat + self.weight * noise

Then, change Generator layer structure from this:

upsample(),
Blur(),
nn.Conv2d(chan_in, chan_out * 2, 3, padding = 1),
norm_class(chan_out * 2),
nn.GLU(dim = 1)

to this (just add NoiseInjection before norm_class):

upsample(),
Blur(),
nn.Conv2d(chan_in, chan_out * 2, 3, padding = 1),
NoiseInjection(),
norm_class(chan_out * 2),
nn.GLU(dim = 1)

rickyars · 2022-04-03T14:32:30Z

Again, @iScriptLex, thank you for the quick reply! Do you ever sleep? I'll give this a try after my run.

lucidrains · 2022-04-03T16:31:30Z

@iScriptLex thank you! 🙏

@rickyars added it for you 30b2e05

lucidrains · 2022-04-03T17:57:41Z

adding noise really helps with texture formation for my dataset (probably not for the smooth drawings of @rickyars ' dataset) i'm using bilinear upsample + noise injection. i knew i should have carried over a few more elements from Stylegan2!

@iScriptLex thank you, anonymous stranger :)

lucidrains · 2022-04-03T18:02:31Z

while we are focused on this repository, i'll improve the efficient attention greatly :) i've grown so much (in terms of my knowledge of attention) since leaving GANs behind

lucidrains · 2022-04-03T19:09:03Z

@rickyars cool, let me know how your experiment goes :) the bilinear + noise works very well for me, mainly because my dataset (flowers) has a lot of texture and noisy green nature stuff in the background, and the network learns to utilize the noise well. your dataset seems to be mostly smooth vector graphics, so you probably won't see the benefit

lucidrains · 2022-04-03T19:38:52Z

lucidrains · 2022-04-04T00:03:59Z

@rickyars what is that?? pacman ghosts or atari space invaders?

rickyars · 2022-04-04T01:12:57Z

@lucidrains I'm trying to make a bitGAN: https://bitgans.com/

rickyars · 2022-04-04T01:54:35Z

@lucidrains Heads up. I saw you pushed some changes to attention, but I'm not getting as good of results with version 0.21.1.

lucidrains · 2022-04-04T02:15:49Z

@rickyars darn, do you want to try 0.21.2? I can always revert it back to the old linear attention if it still doesn't do well

rickyars · 2022-04-04T02:21:45Z

@lucidrains I'll try 0.21.2 and see how it looks. Otherwise, I might go back to 0.20.8.

Two questions for you:

Any reason why I'm suddenly seeing the cutout leak through in early training?
How are the initial generator images created? Right now it looks like it's all black. The other GAN I've been using, has a very colorful first guess and then starts to converge rather quickly. I'm hoping to poke around that section of the code.

lucidrains · 2022-04-04T02:38:49Z

it has been a while since i've done any GAN training, nor am i caught up with literature, so take what i say with a grain of salt

iirc, the augmentations were not supposed to leak if you kept it at a low enough probability during training. however, i have vague memories of someone else complaining about cutout augmentation leaking, so perhaps the paper's conclusions weren't true, which happens. you can customize the augmentations with a command-line flag (remove the cutout augmentation altogether)

the initial black images is likely due to the initialization of the network. i don't think the convergence rate should differ that much. lightweight-gan is among the fastest GAN out there for training.

rickyars · 2022-04-04T02:51:57Z

i wasn't seeing any issue with the cutouts until version 0.21, which is why i was asking if something else changed when you pushed through the new noise injection. i looked at the commits and didn't see anything, so i'm baffled!

i'm running 0.21.2 with no cutout and i'll let you know how it goes. my training set is randomly generated so i can also create more images to feed the GAN.

btw, the reason i'm using lightweight GAN is because of it's speed. as you can see, i don't need a lot of detail in my outputs and i'm actually more interested in watching the GAN learn and quick convergence.

lucidrains · 2022-04-04T03:08:36Z

@rickyars i think the cutout leaking happens on and off, regardless of version

yes, watching GANs learn right in front of you is a magical experience :) ok keep me posted about your results. my flowers seem to be blooming inside the machine quite well, but i'll let it train until 10k steps to make sure the new attention didn't break anything

rickyars · 2022-04-04T03:10:39Z

@lucidrains Thanks again for the help today.

Btw, I love this line: "my flowers seem to be blooming inside the machine quite well"

rickyars · 2022-04-04T12:23:58Z

@lucidrains my Google colab instance shut down last night. I restarted this morning and I'm getting very, very different results today. Normally I would be getting something that looks like my end result by this iteration. Now I'm getting:

Did anti-alias somehow get turned on by default? This is the exact same code from yesterday just a fresh pip install lightweight-gan

rickyars · 2022-04-04T13:01:38Z

This is with !pip install lightweight-gan==0.20.8 at the same iteration:

lucidrains · 2022-04-04T15:16:42Z

Ohh I turned on bilinear upsampling by default, which would produce what you are seeing, at least, early in training

rickyars · 2022-04-04T16:10:18Z

Wow. I'm an idiot. I was making code changes but not actually running that code. I will delete my test runs from the thread so as not to confuse other people. Sorry!

rickyars · 2022-04-04T16:36:20Z

@lucidrains i feel like such an idiot. i've gone back to 0.21.0 to test just the noise inject by turning off bilinear filters (i think i know how to do this now). do you know if 0.21.2 has any runtime impacts? i don't know why but my iterations seem to be taking longer.

lucidrains · 2022-04-04T16:57:54Z

@rickyars yea, there is a slight increase in run-time, mainly due to the attention modifications, and i may keep it (you can always turn it off by setting --attn-res-layers [])

rickyars · 2022-04-04T18:31:26Z

@lucidrains now that i know what i'm doing, i started a test with the power of two convolutional window. it's looking pretty good! i'll probably add the noise and use that for my final set of run. will post a pic in a bit.

rickyars · 2022-04-04T19:06:47Z

@lucidrains i'm actually really happy with these results so far:

lucidrains · 2022-04-04T19:09:33Z

@rickyars very nice! maybe i'll add the power of 2 kernels :) it'll be slightly slower yet again, but GPUs keep getting more powerful by the year, so who cares :)

lucidrains · 2022-04-04T19:09:58Z

@rickyars you turned off the bilinear upsampling?

rickyars · 2022-04-04T19:15:41Z

@rickyars you turned off the bilinear upsampling?

Yes. This output was generated on bfd0e8a.

Bilinear is off. It seems to to really hurt my use case.

lucidrains · 2022-04-04T19:20:43Z

@rickyars ok, ill turn it off for now :)

rickyars · 2022-04-04T19:26:39Z

@lucidrains thank you!

lucidrains closed this as completed Apr 4, 2022

lucidrains reopened this Apr 4, 2022

question about generated images #115

question about generated images #115

Comments

rickyars commented Apr 2, 2022 • edited Loading

iScriptLex commented Apr 2, 2022

rickyars commented Apr 2, 2022 • edited Loading

rickyars commented Apr 2, 2022 • edited Loading

rickyars commented Apr 3, 2022

lucidrains commented Apr 3, 2022 • edited Loading

rickyars commented Apr 3, 2022

iScriptLex commented Apr 3, 2022

rickyars commented Apr 3, 2022

lucidrains commented Apr 3, 2022

lucidrains commented Apr 3, 2022

lucidrains commented Apr 3, 2022

lucidrains commented Apr 3, 2022

lucidrains commented Apr 3, 2022

lucidrains commented Apr 4, 2022 • edited Loading

rickyars commented Apr 4, 2022

rickyars commented Apr 4, 2022

lucidrains commented Apr 4, 2022

rickyars commented Apr 4, 2022 • edited Loading

lucidrains commented Apr 4, 2022

rickyars commented Apr 4, 2022 • edited Loading

lucidrains commented Apr 4, 2022

rickyars commented Apr 4, 2022

rickyars commented Apr 4, 2022

rickyars commented Apr 4, 2022

lucidrains commented Apr 4, 2022

rickyars commented Apr 4, 2022 • edited Loading

rickyars commented Apr 4, 2022

lucidrains commented Apr 4, 2022

rickyars commented Apr 4, 2022

rickyars commented Apr 4, 2022

lucidrains commented Apr 4, 2022

lucidrains commented Apr 4, 2022

rickyars commented Apr 4, 2022 • edited Loading

lucidrains commented Apr 4, 2022

rickyars commented Apr 4, 2022

rickyars commented Apr 2, 2022 •

edited

Loading

rickyars commented Apr 2, 2022 •

edited

Loading

rickyars commented Apr 2, 2022 •

edited

Loading

lucidrains commented Apr 3, 2022 •

edited

Loading

lucidrains commented Apr 4, 2022 •

edited

Loading

rickyars commented Apr 4, 2022 •

edited

Loading

rickyars commented Apr 4, 2022 •

edited

Loading

rickyars commented Apr 4, 2022 •

edited

Loading

rickyars commented Apr 4, 2022 •

edited

Loading