Testing for Marginal Independence between Two Categorical Variables with Multiple Responses

Christopher R. Bilder, Thomas M. Loughin

Research output: Contribution to journalReview articlepeer-review

29 Scopus citations

Abstract

Questions that ask respondents to "choose all that apply" from a set of items occur frequently in surveys. Categorical variables that summarize this type of survey data are called both pick any/c variables and multiple-response categorical variables. It is often of interest to test for independence between two categorical variables. When both categorical variables can have multiple responses, traditional Pearson chi-square tests for independence should not be used because of the within-subject dependence among responses. An intuitively constructed version of the Pearson statistic is proposed to perform the test using bootstrap procedures to approximate its sampling distribution. First- and second-order adjustments to the proposed statistic are given in order to use a chi-square distribution approximation. A Bonferroni adjustment is proposed to perform the test when the joint set of responses for individual subjects is unavailable. Simulations show that the bootstrap procedures hold the correct size more consistently than the other procedures.

Original languageEnglish (US)
Pages (from-to)241-248
Number of pages8
JournalBiometrics
Volume60
Issue number1
DOIs
StatePublished - Mar 2004
Externally publishedYes

Keywords

  • Bootstrap
  • Correlated binary data
  • Pearson statistic
  • Pick any/c
  • Simultaneous pairwise marginal independence

ASJC Scopus subject areas

  • Statistics and Probability
  • Biochemistry, Genetics and Molecular Biology(all)
  • Immunology and Microbiology(all)
  • Agricultural and Biological Sciences(all)
  • Applied Mathematics

Fingerprint

Dive into the research topics of 'Testing for Marginal Independence between Two Categorical Variables with Multiple Responses'. Together they form a unique fingerprint.

Cite this