TextDNA: Visualizing Word Usage with Configurable Colorfields

dc.contributor.authorSzafir, Danielle Albersen_US
dc.contributor.authorStuffer, Deidreen_US
dc.contributor.authorSohail, Yusefen_US
dc.contributor.authorGleicher, Michaelen_US
dc.contributor.editorKwan-Liu Ma and Giuseppe Santucci and Jarke van Wijken_US
dc.date.accessioned2016-06-09T09:33:03Z
dc.date.available2016-06-09T09:33:03Z
dc.date.issued2016en_US
dc.description.abstractPatterns of words used in different text collections can characterize interesting properties of a corpus. However, these patterns are challenging to explore as they often involve complex relationships across many words and collections in a large space of words. In this paper, we propose a configurable colorfield design to aid this exploration. Our approach uses a dense colorfield overview to present large amounts of data in ways that make patterns perceptible. It allows flexible configuration of both data mappings and aggregations to expose different kinds of patterns, and provides interactions to help connect detailed patterns to the corpus overview. TextDNA, our prototype implementation, leverages the GPU to provide interactivity in the web browser even on large corpora. We present five case studies showing how the tool supports inquiry in corpora ranging in size from single document to millions of books. Our work shows how to make a configurable colorfield approach practical for a range of analytic tasks.en_US
dc.description.number3en_US
dc.description.sectionheadersText and Document Dataen_US
dc.description.seriesinformationComputer Graphics Forumen_US
dc.description.volume35en_US
dc.identifier.doi10.1111/cgf.12918en_US
dc.identifier.issn1467-8659en_US
dc.identifier.pages421-430en_US
dc.identifier.urihttps://doi.org/10.1111/cgf.12918en_US
dc.identifier.urihttps://diglib.eg.org/handle/10.1111/cgf12918
dc.publisherThe Eurographics Association and John Wiley & Sons Ltd.en_US
dc.subjectH.5.2 [Information Interfaces and Presentation]en_US
dc.subjectUser Interfacesen_US
dc.subjectGraphical User Interfacesen_US
dc.subjectI.7.m [Document and Text Processing]en_US
dc.subjectMicellaneousen_US
dc.subjectText Analysisen_US
dc.subjectJ.5 [Computer Applications]en_US
dc.subjectArts and Humanitiesen_US
dc.subjectLiteratureen_US
dc.titleTextDNA: Visualizing Word Usage with Configurable Colorfieldsen_US
Files