Hybrid Sparse Transformer and Feature Alignment for Efficient Image Completion

dc.contributor.authorChen, L.en_US
dc.contributor.authorSun, Haoen_US
dc.contributor.editorChristie, Marcen_US
dc.contributor.editorPietroni, Nicoen_US
dc.contributor.editorWang, Yu-Shuenen_US
dc.date.accessioned2025-10-07T05:02:43Z
dc.date.available2025-10-07T05:02:43Z
dc.date.issued2025
dc.description.abstractIn this paper, we propose an efficient single-stage hybrid architecture for image completion. Existing transformer-based image completion methods often struggle with accurate content restoration, largely due to their ineffective modeling of corrupted channel information and the attention noise introduced by softmax-based mechanisms, which results in blurry textures and distorted structures. Additionally, these methods frequently fail to maintain texture consistency, either relying on imprecise mask sampling or incurring substantial computational costs from complex similarity calculations. To address these limitations, we present two key contributions: a Hybrid Sparse Self-Attention (HSA) module and a Feature Alignment Module (FAM). The HSA module enhances structural recovery by decoupling spatial and channel attention with sparse activation, while the FAM enforces texture consistency by aligning encoder and decoder features via a mask-free, energy-gated mechanism without additional inference cost. Our method achieves state-of-the-art image completion results with the fastest inference speed among single-stage networks, as measured by PSNR, SSIM, FID, and LPIPS on CelebA-HQ, Places2, and Paris datasets.en_US
dc.description.number7
dc.description.sectionheadersImage Creation & Augmentation
dc.description.seriesinformationComputer Graphics Forum
dc.description.volume44
dc.identifier.doi10.1111/cgf.70255
dc.identifier.issn1467-8659
dc.identifier.pages10 pages
dc.identifier.urihttps://doi.org/10.1111/cgf.70255
dc.identifier.urihttps://diglib.eg.org/handle/10.1111/cgf70255
dc.publisherThe Eurographics Association and John Wiley & Sons Ltd.en_US
dc.subjectCCS Concepts: Computing methodologies → Image processing; Computer vision tasks; Image Completion; Machine learning; Neural networks
dc.subjectComputing methodologies → Image processing
dc.subjectComputer vision tasks
dc.subjectImage Completion
dc.subjectMachine learning
dc.subjectNeural networks
dc.titleHybrid Sparse Transformer and Feature Alignment for Efficient Image Completionen_US
Files
Original bundle
Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
cgf70255.pdf
Size:
16.69 MB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
paper1288_mm.pdf
Size:
25.89 MB
Format:
Adobe Portable Document Format
Collections