🏞️ Seraena

What is Seraena?

Seraena is 🚧 WIP 🚧 PyTorch code for stably training mode-dropping deterministic latent autoencoders like TAESD using only conditional adversarial loss (without LPIPS/L1 or pretraining).

What can you do with the Seraena code?

This repo includes an example TAESDXL training notebook which trains a lightweight single-step decoder for the SDXL VAE using Seraena. It also trains a simple (MSE-distilled) encoder for completeness.

If you find any other interesting uses for the Seraena code / models, LMK and I can link them here.

Are there any pretrained Seraena model checkpoints available?

Yes.

How does Seraena work?

It's basically the usual PatchGAN discriminator + rescaled gradient setup (just with a replay buffer on generated samples). See the code.

Why is Seraena marked 🚧 WIP 🚧 ?

Although Seraena is quite simple, there are still several YOLO'd hyperparameters and design choices present in the Seraena code (learning rates, batch and replay buffer size, discriminator architecture). I haven't done any serious benchmarking, ablations, or tuning of these choices. I also haven't verified if Seraena can match the full performance of released TAESD or SD-VAE.

If you want a serious, battle-tested autoencoder training repo I recommend looking at the Stability or MosaicML codebases.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
checkpoints		checkpoints
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
TAESDXL_Training_Example.ipynb		TAESDXL_Training_Example.ipynb
screenshot.png		screenshot.png
seraena.py		seraena.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🏞️ Seraena

What is Seraena?

What can you do with the Seraena code?

Are there any pretrained Seraena model checkpoints available?

How does Seraena work?

Why is Seraena marked 🚧 WIP 🚧 ?

About

Languages

License

madebyollin/seraena

Folders and files

Latest commit

History

Repository files navigation

🏞️ Seraena

What is Seraena?

What can you do with the Seraena code?

Are there any pretrained Seraena model checkpoints available?

How does Seraena work?

Why is Seraena marked 🚧 WIP 🚧 ?

About

Resources

License

Stars

Watchers

Forks

Languages