CharGen: Fast and Fluent Portrait Modification

Loading...
Thumbnail Image
Date
2025
Journal Title
Journal ISSN
Volume Title
Publisher
The Eurographics Association
Abstract
Interactive editing of character images with diffusion models remains challenging due to the inherent trade-off between fine-grained control, generation speed, and visual fidelity. We introduce CharGen, a character-focused editor that combines attribute-specific Concept Sliders, trained to isolate and manipulate attributes such as facial feature size, expression, and decoration with the StreamDiffusion sampling pipeline for more interactive performance. To counteract the loss of detail that often accompanies accelerated sampling, we propose a lightweight Repair Step that reinstates fine textures without compromising structural consistency. Throughout extensive ablation studies and in comparison to open-source InstructPix2Pix and closedsource Google Gemini, and a comprehensive user study, CharGen achieves two-to-four-fold faster edit turnaround with precise editing control and identity-consistent results. Project page: https://chargen.jdihlmann.com/
Description

CCS Concepts: Computing methodologies → Image processing; Computer vision; Applied computing → Arts and humanities

        
@inproceedings{
10.2312:vmv.20251241
, booktitle = {
Vision, Modeling, and Visualization
}, editor = {
Egger, Bernhard
and
Günther, Tobias
}, title = {{
CharGen: Fast and Fluent Portrait Modification
}}, author = {
Dihlmann, Jan-Niklas
and
Killguss, Arnela
and
Lensch, Hendrik
}, year = {
2025
}, publisher = {
The Eurographics Association
}, ISBN = {
978-3-03868-294-3
}, DOI = {
10.2312/vmv.20251241
} }
Citation
Collections