Artist-Inator: Text-based, Gloss-aware Non-photorealistic Stylization

No Thumbnail Available
Date
2025
Journal Title
Journal ISSN
Volume Title
Publisher
The Eurographics Association and John Wiley & Sons Ltd.
Abstract
Large diffusion models have made a remarkable leap synthesizing high-quality artistic images from text descriptions. However, these powerful pre-trained models still lack control to guide key material appearance properties, such as gloss. In this work, we present a threefold contribution: (1) we analyze how gloss is perceived across different artistic styles (i.e., oil painting, watercolor, ink pen, charcoal, and soft crayon); (2) we leverage our findings to create a dataset with 1,336,272 stylized images of many different geometries in all five styles, including automatically-computed text descriptions of their appearance (e.g., ''A glossy bunny hand painted with an orange soft crayon''); and (3) we train ControlNet to condition Stable Diffusion XL synthesizing novel painterly depictions of new objects, using simple inputs such as edge maps, hand-drawn sketches, or clip arts. Compared to previous approaches, our framework yields more accurate results despite the simplified input, as we show both quantitative and qualitatively.
Description

CCS Concepts: Computing methodologies → Non-photorealistic rendering; Image processing; Perception

        
@article{
10.1111:cgf.70182
, journal = {Computer Graphics Forum}, title = {{
Artist-Inator: Text-based, Gloss-aware Non-photorealistic Stylization
}}, author = {
Subias, Jose Daniel
and
Daniel-Soriano, Saúl
and
Gutierrez, Diego
and
Serrano, Ana
}, year = {
2025
}, publisher = {
The Eurographics Association and John Wiley & Sons Ltd.
}, ISSN = {
1467-8659
}, DOI = {
10.1111/cgf.70182
} }
Citation
Collections