Inverse sampling regression for pooled data

Osval A. Montesinos-López, Abelardo Montesinos-López, Kent Eskridge, José Crossa

Research output: Contribution to journalArticle

1 Citation (Scopus)

Abstract

Because pools are tested instead of individuals in group testing, this technique is helpful for estimating prevalence in a population or for classifying a large number of individuals into two groups at a low cost. For this reason, group testing is a well-known means of saving costs and producing precise estimates. In this paper, we developed a mixed-effect group testing regression that is useful when the data-collecting process is performed using inverse sampling. This model allows including covariate information at the individual level to incorporate heterogeneity among individuals and identify which covariates are associated with positive individuals. We present an approach to fit this model using maximum likelihood and we performed a simulation study to evaluate the quality of the estimates. Based on the simulation study, we found that the proposed regression method for inverse sampling with group testing produces parameter estimates with low bias when the pre-specified number of positive pools (r) to stop the sampling process is at least 10 and the number of clusters in the sample is also at least 10. We performed an application with real data and we provide an NLMIXED code that researchers can use to implement this method.

Original languageEnglish (US)
Pages (from-to)1093-1109
Number of pages17
JournalStatistical Methods in Medical Research
Volume26
Issue number3
DOIs
StatePublished - Jun 1 2017

Fingerprint

Inverse Sampling
Group Testing
Regression
Costs and Cost Analysis
Covariates
Research Personnel
Simulation Study
Estimate
Mixed Effects
Number of Clusters
Population
Maximum Likelihood
Evaluate
Costs
Model

Keywords

  • Group testing
  • classification
  • inverse sampling
  • precision
  • prevalence

ASJC Scopus subject areas

  • Epidemiology
  • Statistics and Probability
  • Health Information Management

Cite this

Montesinos-López, O. A., Montesinos-López, A., Eskridge, K., & Crossa, J. (2017). Inverse sampling regression for pooled data. Statistical Methods in Medical Research, 26(3), 1093-1109. https://doi.org/10.1177/0962280214568047

Inverse sampling regression for pooled data. / Montesinos-López, Osval A.; Montesinos-López, Abelardo; Eskridge, Kent; Crossa, José.

In: Statistical Methods in Medical Research, Vol. 26, No. 3, 01.06.2017, p. 1093-1109.

Research output: Contribution to journalArticle

Montesinos-López, OA, Montesinos-López, A, Eskridge, K & Crossa, J 2017, 'Inverse sampling regression for pooled data', Statistical Methods in Medical Research, vol. 26, no. 3, pp. 1093-1109. https://doi.org/10.1177/0962280214568047
Montesinos-López, Osval A. ; Montesinos-López, Abelardo ; Eskridge, Kent ; Crossa, José. / Inverse sampling regression for pooled data. In: Statistical Methods in Medical Research. 2017 ; Vol. 26, No. 3. pp. 1093-1109.
@article{2f99161266cc48b8a59b3ffc892df5b0,
title = "Inverse sampling regression for pooled data",
abstract = "Because pools are tested instead of individuals in group testing, this technique is helpful for estimating prevalence in a population or for classifying a large number of individuals into two groups at a low cost. For this reason, group testing is a well-known means of saving costs and producing precise estimates. In this paper, we developed a mixed-effect group testing regression that is useful when the data-collecting process is performed using inverse sampling. This model allows including covariate information at the individual level to incorporate heterogeneity among individuals and identify which covariates are associated with positive individuals. We present an approach to fit this model using maximum likelihood and we performed a simulation study to evaluate the quality of the estimates. Based on the simulation study, we found that the proposed regression method for inverse sampling with group testing produces parameter estimates with low bias when the pre-specified number of positive pools (r) to stop the sampling process is at least 10 and the number of clusters in the sample is also at least 10. We performed an application with real data and we provide an NLMIXED code that researchers can use to implement this method.",
keywords = "Group testing, classification, inverse sampling, precision, prevalence",
author = "Montesinos-L{\'o}pez, {Osval A.} and Abelardo Montesinos-L{\'o}pez and Kent Eskridge and Jos{\'e} Crossa",
year = "2017",
month = "6",
day = "1",
doi = "10.1177/0962280214568047",
language = "English (US)",
volume = "26",
pages = "1093--1109",
journal = "Statistical Methods in Medical Research",
issn = "0962-2802",
publisher = "SAGE Publications Ltd",
number = "3",

}

TY - JOUR

T1 - Inverse sampling regression for pooled data

AU - Montesinos-López, Osval A.

AU - Montesinos-López, Abelardo

AU - Eskridge, Kent

AU - Crossa, José

PY - 2017/6/1

Y1 - 2017/6/1

N2 - Because pools are tested instead of individuals in group testing, this technique is helpful for estimating prevalence in a population or for classifying a large number of individuals into two groups at a low cost. For this reason, group testing is a well-known means of saving costs and producing precise estimates. In this paper, we developed a mixed-effect group testing regression that is useful when the data-collecting process is performed using inverse sampling. This model allows including covariate information at the individual level to incorporate heterogeneity among individuals and identify which covariates are associated with positive individuals. We present an approach to fit this model using maximum likelihood and we performed a simulation study to evaluate the quality of the estimates. Based on the simulation study, we found that the proposed regression method for inverse sampling with group testing produces parameter estimates with low bias when the pre-specified number of positive pools (r) to stop the sampling process is at least 10 and the number of clusters in the sample is also at least 10. We performed an application with real data and we provide an NLMIXED code that researchers can use to implement this method.

AB - Because pools are tested instead of individuals in group testing, this technique is helpful for estimating prevalence in a population or for classifying a large number of individuals into two groups at a low cost. For this reason, group testing is a well-known means of saving costs and producing precise estimates. In this paper, we developed a mixed-effect group testing regression that is useful when the data-collecting process is performed using inverse sampling. This model allows including covariate information at the individual level to incorporate heterogeneity among individuals and identify which covariates are associated with positive individuals. We present an approach to fit this model using maximum likelihood and we performed a simulation study to evaluate the quality of the estimates. Based on the simulation study, we found that the proposed regression method for inverse sampling with group testing produces parameter estimates with low bias when the pre-specified number of positive pools (r) to stop the sampling process is at least 10 and the number of clusters in the sample is also at least 10. We performed an application with real data and we provide an NLMIXED code that researchers can use to implement this method.

KW - Group testing

KW - classification

KW - inverse sampling

KW - precision

KW - prevalence

UR - http://www.scopus.com/inward/record.url?scp=85020728400&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85020728400&partnerID=8YFLogxK

U2 - 10.1177/0962280214568047

DO - 10.1177/0962280214568047

M3 - Article

C2 - 25601742

AN - SCOPUS:85020728400

VL - 26

SP - 1093

EP - 1109

JO - Statistical Methods in Medical Research

JF - Statistical Methods in Medical Research

SN - 0962-2802

IS - 3

ER -