TY - JOUR
T1 - Validation of visual statistical inference, applied to linear models
AU - Majumder, Mahbubul
AU - Hofmann, Heike
AU - Cook, Dianne
N1 - Funding Information:
Mahbubul Majumder is PhD student (E-mail: mahbub72@gmail.com), Heike Hofmann (E-mail: hofmann@iastate.edu) is Associate Professor, and Dianne Cook (E-mail: dicook@iastate.edu) is Professor in the Department of Statistics and Statistical Laboratory, Iowa State University, Ames, IA 50011-1210. This research is supported in part by the National Science Foundation Grant # DMS 1007697.
PY - 2013
Y1 - 2013
N2 - Statistical graphics play a crucial role in exploratory data analysis, model checking, and diagnosis. The lineup protocol enables statistical significance testing of visual findings, bridging the gulf between exploratory and inferential statistics. In this article, inferential methods for statistical graphics are developed further by refining the terminology of visual inference and framing the lineup protocol in a context that allows direct comparison with conventional tests in scenarios when a conventional test exists. This framework is used to compare the performance of the lineup protocol against conventional statistical testing in the scenario of fitting linear models.Ahuman subjects experiment is conducted using simulated data to provide controlled conditions. Results suggest that the lineup protocol performs comparably with the conventional tests, and expectedly outperforms them when data are contaminated, a scenario where assumptions required for performing a conventional test are violated. Surprisingly, visual tests have higher power than the conventional tests when the effect size is large. And, interestingly, there may be some super-visual individuals who yield better performance and power than the conventional test even in the most difficult tasks. Supplementary materials for this article are available online.
AB - Statistical graphics play a crucial role in exploratory data analysis, model checking, and diagnosis. The lineup protocol enables statistical significance testing of visual findings, bridging the gulf between exploratory and inferential statistics. In this article, inferential methods for statistical graphics are developed further by refining the terminology of visual inference and framing the lineup protocol in a context that allows direct comparison with conventional tests in scenarios when a conventional test exists. This framework is used to compare the performance of the lineup protocol against conventional statistical testing in the scenario of fitting linear models.Ahuman subjects experiment is conducted using simulated data to provide controlled conditions. Results suggest that the lineup protocol performs comparably with the conventional tests, and expectedly outperforms them when data are contaminated, a scenario where assumptions required for performing a conventional test are violated. Surprisingly, visual tests have higher power than the conventional tests when the effect size is large. And, interestingly, there may be some super-visual individuals who yield better performance and power than the conventional test even in the most difficult tasks. Supplementary materials for this article are available online.
KW - Data mining
KW - Effect size
KW - Exploratory data analysis
KW - Lineup
KW - Nonparametric test
KW - Practical significance
KW - Statistical graphics
KW - Visualization
UR - http://www.scopus.com/inward/record.url?scp=84890085949&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84890085949&partnerID=8YFLogxK
U2 - 10.1080/01621459.2013.808157
DO - 10.1080/01621459.2013.808157
M3 - Article
AN - SCOPUS:84890085949
SN - 0162-1459
VL - 108
SP - 942
EP - 956
JO - Journal of the American Statistical Association
JF - Journal of the American Statistical Association
IS - 503
ER -