WYTIWYR: A User Intent-Aware Framework with Multi-modal Inputs for Visualization Retrieval

dc.contributor.authorXiao, Shishien_US
dc.contributor.authorHou, Yihanen_US
dc.contributor.authorJin, Chengen_US
dc.contributor.authorZeng, Weien_US
dc.contributor.editorBujack, Roxanaen_US
dc.contributor.editorArchambault, Danielen_US
dc.contributor.editorSchreck, Tobiasen_US
dc.date.accessioned2023-06-10T06:17:02Z
dc.date.available2023-06-10T06:17:02Z
dc.date.issued2023
dc.description.abstractRetrieving charts from a large corpus is a fundamental task that can benefit numerous applications such as visualization recommendations. The retrieved results are expected to conform to both explicit visual attributes (e.g., chart type, colormap) and implicit user intents (e.g., design style, context information) that vary upon application scenarios. However, existing examplebased chart retrieval methods are built upon non-decoupled and low-level visual features that are hard to interpret, while definition-based ones are constrained to pre-defined attributes that are hard to extend. In this work, we propose a new framework, namely WYTIWYR (What-You-Think-Is-What-You-Retrieve), that integrates user intents into the chart retrieval process. The framework consists of two stages: first, the Annotation stage disentangles the visual attributes within the query chart; and second, the Retrieval stage embeds the user's intent with customized text prompt as well as bitmap query chart, to recall targeted retrieval result. We develop a prototype WYTIWYR system leveraging a contrastive language-image pre-training (CLIP) model to achieve zero-shot classification as well as multi-modal input encoding, and test the prototype on a large corpus with charts crawled from the Internet. Quantitative experiments, case studies, and qualitative interviews are conducted. The results demonstrate the usability and effectiveness of our proposed framework.en_US
dc.description.number3
dc.description.sectionheadersInteraction and Accessibility
dc.description.seriesinformationComputer Graphics Forum
dc.description.volume42
dc.identifier.doi10.1111/cgf.14832
dc.identifier.issn1467-8659
dc.identifier.pages311-322
dc.identifier.pages12 pages
dc.identifier.urihttps://doi.org/10.1111/cgf.14832
dc.identifier.urihttps://diglib.eg.org:443/handle/10.1111/cgf14832
dc.publisherThe Eurographics Association and John Wiley & Sons Ltd.en_US
dc.subjectCCS Concepts: Human-centered computing -> Visualization; Information systems -> Query intent; Computing methodologies -> Artificial intelligence
dc.subjectHuman centered computing
dc.subjectVisualization
dc.subjectInformation systems
dc.subjectQuery intent
dc.subjectComputing methodologies
dc.subjectArtificial intelligence
dc.titleWYTIWYR: A User Intent-Aware Framework with Multi-modal Inputs for Visualization Retrievalen_US
Files
Original bundle
Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
v42i3pp311-322_cgf14832.pdf
Size:
2.8 MB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
1068-file-i7.pdf
Size:
1.79 MB
Format:
Adobe Portable Document Format
Collections