Using a Monotonic Density Ratio Model to Find the Asymptotically Optimal Combination of Multiple Diagnostic Tests

Baojiang Chen, Pengfei Li, Jing Qin, Tao Yu

Research output: Contribution to journalArticle

2 Citations (Scopus)

Abstract

With the advent of new technology, new biomarker studies have become essential in cancer research. To achieve optimal sensitivity and specificity, one needs to combine different diagnostic tests. The celebrated Neyman–Pearson lemma enables us to use the density ratio to optimally combine different diagnostic tests. In this article, we propose a semiparametric model by directly modeling the density ratio between the diseased and nondiseased population as an unspecified monotonic nondecreasing function of a linear or nonlinear combination of multiple diagnostic tests. This method is appealing in that it is not necessary to assume separate models for the diseased and nondiseased populations. Further, the proposed method provides an asymptotically optimal way to combine multiple test results. We use a pool-adjacent-violation-algorithm to find the semiparametric maximum likelihood estimate of the receiver operating characteristic (ROC) curve. Using modern empirical process theory we show cubic root n consistency for the ROC curve and the underlying Euclidean parameter estimation. Extensive simulations show that the proposed method outperforms its competitors. We apply the method to two real-data applications. Supplementary materials for this article are available online.

Original languageEnglish (US)
Pages (from-to)861-874
Number of pages14
JournalJournal of the American Statistical Association
Volume111
Issue number514
DOIs
StatePublished - Apr 2 2016

Fingerprint

Multiple Tests
Diagnostic Tests
Asymptotically Optimal
Monotonic
Receiver Operating Characteristic Curve
Monotonic Function
Empirical Process
Semiparametric Model
Biomarkers
Maximum Likelihood Estimate
Model
Specificity
Parameter Estimation
Lemma
Euclidean
Cancer
Adjacent
Roots
Necessary
Diagnostic tests

Keywords

  • Optimal combination of biomarkers
  • Pool-adjacent-violation-algorithm
  • ROC curve
  • Semiparametric likelihood
  • Sensitivity
  • Specificity

ASJC Scopus subject areas

  • Statistics and Probability
  • Statistics, Probability and Uncertainty

Cite this

Using a Monotonic Density Ratio Model to Find the Asymptotically Optimal Combination of Multiple Diagnostic Tests. / Chen, Baojiang; Li, Pengfei; Qin, Jing; Yu, Tao.

In: Journal of the American Statistical Association, Vol. 111, No. 514, 02.04.2016, p. 861-874.

Research output: Contribution to journalArticle

@article{e2b55a7ea8f241aea32bc3fd4ebb5690,
title = "Using a Monotonic Density Ratio Model to Find the Asymptotically Optimal Combination of Multiple Diagnostic Tests",
abstract = "With the advent of new technology, new biomarker studies have become essential in cancer research. To achieve optimal sensitivity and specificity, one needs to combine different diagnostic tests. The celebrated Neyman–Pearson lemma enables us to use the density ratio to optimally combine different diagnostic tests. In this article, we propose a semiparametric model by directly modeling the density ratio between the diseased and nondiseased population as an unspecified monotonic nondecreasing function of a linear or nonlinear combination of multiple diagnostic tests. This method is appealing in that it is not necessary to assume separate models for the diseased and nondiseased populations. Further, the proposed method provides an asymptotically optimal way to combine multiple test results. We use a pool-adjacent-violation-algorithm to find the semiparametric maximum likelihood estimate of the receiver operating characteristic (ROC) curve. Using modern empirical process theory we show cubic root n consistency for the ROC curve and the underlying Euclidean parameter estimation. Extensive simulations show that the proposed method outperforms its competitors. We apply the method to two real-data applications. Supplementary materials for this article are available online.",
keywords = "Optimal combination of biomarkers, Pool-adjacent-violation-algorithm, ROC curve, Semiparametric likelihood, Sensitivity, Specificity",
author = "Baojiang Chen and Pengfei Li and Jing Qin and Tao Yu",
year = "2016",
month = "4",
day = "2",
doi = "10.1080/01621459.2015.1066681",
language = "English (US)",
volume = "111",
pages = "861--874",
journal = "Journal of the American Statistical Association",
issn = "0162-1459",
publisher = "Taylor and Francis Ltd.",
number = "514",

}

TY - JOUR

T1 - Using a Monotonic Density Ratio Model to Find the Asymptotically Optimal Combination of Multiple Diagnostic Tests

AU - Chen, Baojiang

AU - Li, Pengfei

AU - Qin, Jing

AU - Yu, Tao

PY - 2016/4/2

Y1 - 2016/4/2

N2 - With the advent of new technology, new biomarker studies have become essential in cancer research. To achieve optimal sensitivity and specificity, one needs to combine different diagnostic tests. The celebrated Neyman–Pearson lemma enables us to use the density ratio to optimally combine different diagnostic tests. In this article, we propose a semiparametric model by directly modeling the density ratio between the diseased and nondiseased population as an unspecified monotonic nondecreasing function of a linear or nonlinear combination of multiple diagnostic tests. This method is appealing in that it is not necessary to assume separate models for the diseased and nondiseased populations. Further, the proposed method provides an asymptotically optimal way to combine multiple test results. We use a pool-adjacent-violation-algorithm to find the semiparametric maximum likelihood estimate of the receiver operating characteristic (ROC) curve. Using modern empirical process theory we show cubic root n consistency for the ROC curve and the underlying Euclidean parameter estimation. Extensive simulations show that the proposed method outperforms its competitors. We apply the method to two real-data applications. Supplementary materials for this article are available online.

AB - With the advent of new technology, new biomarker studies have become essential in cancer research. To achieve optimal sensitivity and specificity, one needs to combine different diagnostic tests. The celebrated Neyman–Pearson lemma enables us to use the density ratio to optimally combine different diagnostic tests. In this article, we propose a semiparametric model by directly modeling the density ratio between the diseased and nondiseased population as an unspecified monotonic nondecreasing function of a linear or nonlinear combination of multiple diagnostic tests. This method is appealing in that it is not necessary to assume separate models for the diseased and nondiseased populations. Further, the proposed method provides an asymptotically optimal way to combine multiple test results. We use a pool-adjacent-violation-algorithm to find the semiparametric maximum likelihood estimate of the receiver operating characteristic (ROC) curve. Using modern empirical process theory we show cubic root n consistency for the ROC curve and the underlying Euclidean parameter estimation. Extensive simulations show that the proposed method outperforms its competitors. We apply the method to two real-data applications. Supplementary materials for this article are available online.

KW - Optimal combination of biomarkers

KW - Pool-adjacent-violation-algorithm

KW - ROC curve

KW - Semiparametric likelihood

KW - Sensitivity

KW - Specificity

UR - http://www.scopus.com/inward/record.url?scp=84983283637&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84983283637&partnerID=8YFLogxK

U2 - 10.1080/01621459.2015.1066681

DO - 10.1080/01621459.2015.1066681

M3 - Article

VL - 111

SP - 861

EP - 874

JO - Journal of the American Statistical Association

JF - Journal of the American Statistical Association

SN - 0162-1459

IS - 514

ER -