TY - JOUR
T1 - From mixed effects modeling to spike and slab variable selection
T2 - A Bayesian regression model for group testing data
AU - Joyner, Chase N.
AU - McMahan, Christopher S.
AU - Tebbs, Joshua M.
AU - Bilder, Christopher R.
N1 - Funding Information:
The authors are grateful to the editor, the associate editor, and two referees for their helpful comments on earlier versions of this article. We thank Jeffrey Benfer, Dr. Lucy DesJardin, and Kristofer Eveland at the State Hygienic Laboratory (University of Iowa). This work was funded by Grant R01 AI121351 from the National Institutes of Health. Dr. McMahan also acknowledges the support of Grant OIA-1826715 from the National Science Foundation and Grant N00014-19-1-2295 from the Department of Defense's Office of Naval Research.
Funding Information:
The authors are grateful to the editor, the associate editor, and two referees for their helpful comments on earlier versions of this article. We thank Jeffrey Benfer, Dr. Lucy DesJardin, and Kristofer Eveland at the State Hygienic Laboratory (University of Iowa). This work was funded by Grant R01 AI121351 from the National Institutes of Health. Dr. McMahan also acknowledges the support of Grant OIA‐1826715 from the National Science Foundation and Grant N00014‐19‐1‐2295 from the Department of Defense's Office of Naval Research.
Publisher Copyright:
© 2019 The International Biometric Society
PY - 2020/9/1
Y1 - 2020/9/1
N2 - Due to reductions in both time and cost, group testing is a popular alternative to individual-level testing for disease screening. These reductions are obtained by testing pooled biospecimens (eg, blood, urine, swabs, etc.) for the presence of an infectious agent. However, these reductions come at the expense of data complexity, making the task of conducting disease surveillance more tenuous when compared to using individual-level data. This is because an individual's disease status may be obscured by a group testing protocol and the effect of imperfect testing. Furthermore, unlike individual-level testing, a given participant could be involved in multiple testing outcomes and/or may never be tested individually. To circumvent these complexities and to incorporate all available information, we propose a Bayesian generalized linear mixed model that accommodates data arising from any group testing protocol, estimates unknown assay accuracy probabilities and accounts for potential heterogeneity in the covariate effects across population subgroups (eg, clinic sites, etc.); this latter feature is of key interest to practitioners tasked with conducting disease surveillance. To achieve model selection, our proposal uses spike and slab priors for both fixed and random effects. The methodology is illustrated through numerical studies and is applied to chlamydia surveillance data collected in Iowa.
AB - Due to reductions in both time and cost, group testing is a popular alternative to individual-level testing for disease screening. These reductions are obtained by testing pooled biospecimens (eg, blood, urine, swabs, etc.) for the presence of an infectious agent. However, these reductions come at the expense of data complexity, making the task of conducting disease surveillance more tenuous when compared to using individual-level data. This is because an individual's disease status may be obscured by a group testing protocol and the effect of imperfect testing. Furthermore, unlike individual-level testing, a given participant could be involved in multiple testing outcomes and/or may never be tested individually. To circumvent these complexities and to incorporate all available information, we propose a Bayesian generalized linear mixed model that accommodates data arising from any group testing protocol, estimates unknown assay accuracy probabilities and accounts for potential heterogeneity in the covariate effects across population subgroups (eg, clinic sites, etc.); this latter feature is of key interest to practitioners tasked with conducting disease surveillance. To achieve model selection, our proposal uses spike and slab priors for both fixed and random effects. The methodology is illustrated through numerical studies and is applied to chlamydia surveillance data collected in Iowa.
KW - binary regression
KW - generalized linear mixed model
KW - latent variable modeling
KW - pooled testing
KW - random effects
KW - spike and slab prior
UR - http://www.scopus.com/inward/record.url?scp=85076228397&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85076228397&partnerID=8YFLogxK
U2 - 10.1111/biom.13176
DO - 10.1111/biom.13176
M3 - Article
C2 - 31729015
AN - SCOPUS:85076228397
SN - 0006-341X
VL - 76
SP - 913
EP - 923
JO - Biometrics
JF - Biometrics
IS - 3
ER -