Assessing Effects of Task and Data Distribution on the Effectiveness of Visual Encodings

Kim, YounghoonHeer, JeffreyJeffrey Heer and Heike Leitte and Timo Ropinski2018-06-022018-06-0220181467-8659https://doi.org/10.1111/cgf.13409https://diglib.eg.org:443/handle/10.1111/cgf13409In addition to the choice of visual encodings, the effectiveness of a data visualization may vary with the analytical task being performed and the distribution of data values. To better assess these effects and create refined rankings of visual encodings, we conduct an experiment measuring subject performance across task types (e.g., comparing individual versus aggregate values) and data distributions (e.g., with varied cardinalities and entropies).We compare performance across 12 encoding specifications of trivariate data involving 1 categorical and 2 quantitative fields, including the use of x, y, color, size, and spatial subdivision (i.e., faceting). Our results extend existing models of encoding effectiveness and suggest improved approaches for automated design. For example, we find that colored scatterplots (with positionally-coded quantities and color-coded categories) perform well for comparing individual points, but perform poorly for summary tasks as the number of categories increases.H.5.2 [Information Interfaces]User InterfacesEvaluationAssessing Effects of Task and Data Distribution on the Effectiveness of Visual Encodings10.1111/cgf.13409157-167