A 64x64 pre-trained diffusion model is all you need for 1-step high-resolution SOTA generation
NeurIPS24
Unified framework enables diverse samplers and 1-step generation SOTAs
ICLR24
Applications:
[SoundGen]
![](/sony/creativeai/raw/main/assets/guitarampmodeling.png)
Improving Unsupervised Clean-to-Rendered Guitar Tone Transformation Using GANs and Integrated Unaligned Clean Data
DAFx24
![](/sony/creativeai/raw/main/assets/amt.png)
DiffRoll: Diffusion-based Generative Music Transcription with Unsupervised Pretraining Capability
ICASSP23
![](/sony/creativeai/raw/main/assets/STARSS23.png)
STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events
NeurIPS23
### Contact