Latent Diffusion-GAN: Adversarial Learning in the Autoencoded Latent Space

Loading...
Thumbnail Image
Date
2026
Journal Title
Journal ISSN
Volume Title
Publisher
The Eurographics Association and John Wiley & Sons Ltd.
Abstract
Diffusion models are powerful generative frameworks for producing high-quality images by denoising latent variables from random noise. However, training with likelihood-based objectives can lead to oversmoothed high-frequency details such as textures and sharp edges. Adversarial training with GANs enhances sharpness but usually requires additional discriminator networks. We propose Latent Diffusion Generative Adversarial Networks (LD-GAN), a framework that integrates adversarial learning into diffusion models without modifying their pipeline. LD-GAN leverages the pretrained variational autoencoder as an energy-based discriminator, enabling adversarial training without extra parameters while preserving the latent priors learned from large datasets. We also introduce a structural consistency energy aligning encoder and decoder representations, improving perceptual quality. Experiments show improved sample fidelity, sharpness, and diversity across multiple generation tasks while maintaining efficient training dynamics.
Description

        
@article{
10.1111:cgf.70409
, journal = {Computer Graphics Forum}, title = {{
Latent Diffusion-GAN: Adversarial Learning in the Autoencoded Latent Space
}}, author = {
Jun, U-Chae
and
Ko, Jaeeun
and
Kang, Jiwoo
}, year = {
2026
}, publisher = {
The Eurographics Association and John Wiley & Sons Ltd.
}, ISSN = {
1467-8659
}, DOI = {
10.1111/cgf.70409
} }
Citation