Modeling multiple-response categorical data from complex surveys

Christopher R. Bilder, Thomas M. Loughin

Research output: Contribution to journalArticle

4 Scopus citations

Abstract

Although choose all that apply questions are common in modern surveys, methods for analyzing associations among responses to such questions have only recently been developed. These methods are generally valid only for simple random sampling, but these types of questions often appear in surveys conducted under more complex sampling plans. The purpose of this article is to provide statistical analysis methods that can be applied to choose all that apply questions in complex survey sampling situations. Loglinear models are developed to incorporate the multiple responses inherent in these types of questions. Statistics to compare models and to measure association are proposed and their asymptotic distributions are derived. Monte Carlo simulations show that tests based on adjusted Pearson statistics generally hold their correct size when comparing models. These simulations also show that confidence intervals for odds ratios estimated from loglinear models have good coverage properties, while being shorter than those constructed using empirical estimates. Furthermore, the methods are shown to be applicable to more general problems of modeling associations between elements of two or more binary vectors. The proposed analysis methods are applied to data from the National Health and Nutrition Examination Survey.

Original languageEnglish (US)
Pages (from-to)553-570
Number of pages18
JournalCanadian Journal of Statistics
Volume37
Issue number4
DOIs
Publication statusPublished - Dec 1 2009

    Fingerprint

Keywords

  • Choose all that apply
  • Correlated binary data
  • Loglinear model
  • NHANES
  • Pearson statistic
  • Pick any/c
  • Rao-scott adjustments

ASJC Scopus subject areas

  • Statistics and Probability
  • Statistics, Probability and Uncertainty

Cite this