Latent Diffusion-GAN: Adversarial Learning in the Autoencoded Latent Space
| dc.contributor.author | Jun, U-Chae | |
| dc.contributor.author | Ko, Jaeeun | |
| dc.contributor.author | Kang, Jiwoo | |
| dc.contributor.editor | Masia, Belen | |
| dc.contributor.editor | Thies, Justus | |
| dc.date.accessioned | 2026-04-17T14:05:50Z | |
| dc.date.available | 2026-04-17T14:05:50Z | |
| dc.date.issued | 2026 | |
| dc.description.abstract | Diffusion models are powerful generative frameworks for producing high-quality images by denoising latent variables from random noise. However, training with likelihood-based objectives can lead to oversmoothed high-frequency details such as textures and sharp edges. Adversarial training with GANs enhances sharpness but usually requires additional discriminator networks. We propose Latent Diffusion Generative Adversarial Networks (LD-GAN), a framework that integrates adversarial learning into diffusion models without modifying their pipeline. LD-GAN leverages the pretrained variational autoencoder as an energy-based discriminator, enabling adversarial training without extra parameters while preserving the latent priors learned from large datasets. We also introduce a structural consistency energy aligning encoder and decoder representations, improving perceptual quality. Experiments show improved sample fidelity, sharpness, and diversity across multiple generation tasks while maintaining efficient training dynamics. | |
| dc.description.number | 2 | |
| dc.description.sectionheaders | Diffusion and Beyond: Controlled Image Generation and Stylization | |
| dc.description.seriesinformation | Computer Graphics Forum | |
| dc.description.volume | 45 | |
| dc.identifier.doi | 10.1111/cgf.70409 | |
| dc.identifier.issn | 1467-8659 | |
| dc.identifier.pages | 21 pages | |
| dc.identifier.uri | https://diglib.eg.org/handle/10.1111/cgf70409 | |
| dc.identifier.uri | https://doi.org/10.1111/cgf.70409 | |
| dc.publisher | The Eurographics Association and John Wiley & Sons Ltd. | |
| dc.rights | CC-BY-4.0 | |
| dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | |
| dc.subject | Computer vision | |
| dc.title | Latent Diffusion-GAN: Adversarial Learning in the Autoencoded Latent Space |
Files
Original bundle
1 - 1 of 1