An Empirical Study on the Reliability of Perceiving Correlation Indices using Scatterplots

dc.contributor.authorSher, Varshitaen_US
dc.contributor.authorBemis, Karen G.en_US
dc.contributor.authorLiccardi, Ilariaen_US
dc.contributor.authorChen, Minen_US
dc.contributor.editorHeer, Jeffrey and Ropinski, Timo and van Wijk, Jarkeen_US
dc.date.accessioned2017-06-12T05:22:19Z
dc.date.available2017-06-12T05:22:19Z
dc.date.issued2017
dc.description.abstractScatterplots have been in use for about two centuries, primarily for observing the relationship between two variables and commonly for supporting correlation analysis. In this paper, we report an empirical study that examines how humans' perception of correlation using scatterplots relates to the Pearson's product-moment correlation coefficient (PPMCC) - a commonly used statistical measure of correlation. In particular, we study human participants' estimation of correlation under different conditions, e.g., different PPMCC values, different densities of data points, different levels of symmetry of data enclosures, and different patterns of data distribution. As the participants were instructed to estimate the PPMCC of each stimulus scatterplot, the difference between the estimated and actual PPMCC is referred to as an offset. The results of the study show that varying PPMCC values, symmetry of data enclosure, or data distribution does have an impact on the average offsets, while only large variations in density cause an impact that is statistically significant. This study indicates that humans' perception of correlation using scatterplots does not correlate with computed PPMCC in a consistent manner. The magnitude of offsets may be affected not only by the difference between individuals, but also by geometric features of data enclosures. It suggests that visualizing scatterplots does not provide adequate support to the task of retrieving their corresponding PPMCC indicators, while the underlying model of humans' perception of correlation using scatterplots ought to feature other variables in addition to PPMCC. The paper also includes a theoretical discussion on the cost-benefit of using scatterplots.en_US
dc.description.number3
dc.description.sectionheadersEvaluating Visualization
dc.description.seriesinformationComputer Graphics Forum
dc.description.volume36
dc.identifier.doi10.1111/cgf.13168
dc.identifier.issn1467-8659
dc.identifier.pages061-072
dc.identifier.urihttps://doi.org/10.1111/cgf.13168
dc.identifier.urihttps://diglib.eg.org:443/handle/10.1111/cgf13168
dc.publisherThe Eurographics Association and John Wiley & Sons Ltd.en_US
dc.titleAn Empirical Study on the Reliability of Perceiving Correlation Indices using Scatterplotsen_US
Files
Collections