A heuristic solution of the identifiability problem of the age-period-cohort analysis of cancer occurrence

Lung cancer example

Tengiz Mdzinarishvili, Simon Sherman

Research output: Contribution to journalArticle

5 Citations (Scopus)

Abstract

Background: The Age-Period-Cohort (APC) analysis is aimed at estimating the following effects on disease incidence: (i) the age of the subject at the time of disease diagnosis; (ii) the time period, when the disease occurred; and (iii) the date of birth of the subject. These effects can help in evaluating the biological events leading to the disease, in estimating the influence of distinct risk factors on disease occurrence, and in the development of new strategies for disease prevention and treatment. Methodology/Principal Findings: We developed a novel approach for estimating the APC effects on disease incidence rates in the frame of the Log-Linear Age-Period-Cohort (LLAPC) model. Since the APC effects are linearly interdependent and cannot be uniquely estimated, solving this identifiability problem requires setting four redundant parameters within a set of unknown parameters. By setting three parameters (one of the time-period and the birth-cohort effects and the corresponding age effect) to zero, we reduced this problem to the problem of determining one redundant parameter and, used as such, the effect of the time-period adjacent to the anchored time period. By varying this identification parameter, a family of estimates of the APC effects can be obtained. Using a heuristic assumption that the differences between the adjacent birth-cohort effects are small, we developed a numerical method for determining the optimal value of the identification parameter, by which a unique set of all APC effects is determined and the identifiability problem is solved. Conclusions/Significance: We tested this approach while estimating the APC effects on lung cancer occurrence in white men and women using the SEER data, collected during 1975-2004. We showed that the LLAPC models with the corresponding unique sets of the APC effects estimated by the proposed approach fit very well with the observational data.

Original languageEnglish (US)
Article numbere34362
JournalPloS one
Volume7
Issue number4
DOIs
StatePublished - Apr 4 2012

Fingerprint

Cohort Effect
lung neoplasms
Lung Neoplasms
Cohort Studies
Demography
neoplasms
Parturition
Identification (control systems)
disease incidence
Heuristics
Incidence
Numerical methods
disease occurrence
disease prevention
disease diagnosis
risk factors

ASJC Scopus subject areas

  • Biochemistry, Genetics and Molecular Biology(all)
  • Agricultural and Biological Sciences(all)

Cite this

A heuristic solution of the identifiability problem of the age-period-cohort analysis of cancer occurrence : Lung cancer example. / Mdzinarishvili, Tengiz; Sherman, Simon.

In: PloS one, Vol. 7, No. 4, e34362, 04.04.2012.

Research output: Contribution to journalArticle

@article{3f65d81164fd4c4fb960b7c27ddbab49,
title = "A heuristic solution of the identifiability problem of the age-period-cohort analysis of cancer occurrence: Lung cancer example",
abstract = "Background: The Age-Period-Cohort (APC) analysis is aimed at estimating the following effects on disease incidence: (i) the age of the subject at the time of disease diagnosis; (ii) the time period, when the disease occurred; and (iii) the date of birth of the subject. These effects can help in evaluating the biological events leading to the disease, in estimating the influence of distinct risk factors on disease occurrence, and in the development of new strategies for disease prevention and treatment. Methodology/Principal Findings: We developed a novel approach for estimating the APC effects on disease incidence rates in the frame of the Log-Linear Age-Period-Cohort (LLAPC) model. Since the APC effects are linearly interdependent and cannot be uniquely estimated, solving this identifiability problem requires setting four redundant parameters within a set of unknown parameters. By setting three parameters (one of the time-period and the birth-cohort effects and the corresponding age effect) to zero, we reduced this problem to the problem of determining one redundant parameter and, used as such, the effect of the time-period adjacent to the anchored time period. By varying this identification parameter, a family of estimates of the APC effects can be obtained. Using a heuristic assumption that the differences between the adjacent birth-cohort effects are small, we developed a numerical method for determining the optimal value of the identification parameter, by which a unique set of all APC effects is determined and the identifiability problem is solved. Conclusions/Significance: We tested this approach while estimating the APC effects on lung cancer occurrence in white men and women using the SEER data, collected during 1975-2004. We showed that the LLAPC models with the corresponding unique sets of the APC effects estimated by the proposed approach fit very well with the observational data.",
author = "Tengiz Mdzinarishvili and Simon Sherman",
year = "2012",
month = "4",
day = "4",
doi = "10.1371/journal.pone.0034362",
language = "English (US)",
volume = "7",
journal = "PLoS One",
issn = "1932-6203",
publisher = "Public Library of Science",
number = "4",

}

TY - JOUR

T1 - A heuristic solution of the identifiability problem of the age-period-cohort analysis of cancer occurrence

T2 - Lung cancer example

AU - Mdzinarishvili, Tengiz

AU - Sherman, Simon

PY - 2012/4/4

Y1 - 2012/4/4

N2 - Background: The Age-Period-Cohort (APC) analysis is aimed at estimating the following effects on disease incidence: (i) the age of the subject at the time of disease diagnosis; (ii) the time period, when the disease occurred; and (iii) the date of birth of the subject. These effects can help in evaluating the biological events leading to the disease, in estimating the influence of distinct risk factors on disease occurrence, and in the development of new strategies for disease prevention and treatment. Methodology/Principal Findings: We developed a novel approach for estimating the APC effects on disease incidence rates in the frame of the Log-Linear Age-Period-Cohort (LLAPC) model. Since the APC effects are linearly interdependent and cannot be uniquely estimated, solving this identifiability problem requires setting four redundant parameters within a set of unknown parameters. By setting three parameters (one of the time-period and the birth-cohort effects and the corresponding age effect) to zero, we reduced this problem to the problem of determining one redundant parameter and, used as such, the effect of the time-period adjacent to the anchored time period. By varying this identification parameter, a family of estimates of the APC effects can be obtained. Using a heuristic assumption that the differences between the adjacent birth-cohort effects are small, we developed a numerical method for determining the optimal value of the identification parameter, by which a unique set of all APC effects is determined and the identifiability problem is solved. Conclusions/Significance: We tested this approach while estimating the APC effects on lung cancer occurrence in white men and women using the SEER data, collected during 1975-2004. We showed that the LLAPC models with the corresponding unique sets of the APC effects estimated by the proposed approach fit very well with the observational data.

AB - Background: The Age-Period-Cohort (APC) analysis is aimed at estimating the following effects on disease incidence: (i) the age of the subject at the time of disease diagnosis; (ii) the time period, when the disease occurred; and (iii) the date of birth of the subject. These effects can help in evaluating the biological events leading to the disease, in estimating the influence of distinct risk factors on disease occurrence, and in the development of new strategies for disease prevention and treatment. Methodology/Principal Findings: We developed a novel approach for estimating the APC effects on disease incidence rates in the frame of the Log-Linear Age-Period-Cohort (LLAPC) model. Since the APC effects are linearly interdependent and cannot be uniquely estimated, solving this identifiability problem requires setting four redundant parameters within a set of unknown parameters. By setting three parameters (one of the time-period and the birth-cohort effects and the corresponding age effect) to zero, we reduced this problem to the problem of determining one redundant parameter and, used as such, the effect of the time-period adjacent to the anchored time period. By varying this identification parameter, a family of estimates of the APC effects can be obtained. Using a heuristic assumption that the differences between the adjacent birth-cohort effects are small, we developed a numerical method for determining the optimal value of the identification parameter, by which a unique set of all APC effects is determined and the identifiability problem is solved. Conclusions/Significance: We tested this approach while estimating the APC effects on lung cancer occurrence in white men and women using the SEER data, collected during 1975-2004. We showed that the LLAPC models with the corresponding unique sets of the APC effects estimated by the proposed approach fit very well with the observational data.

UR - http://www.scopus.com/inward/record.url?scp=84859364137&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84859364137&partnerID=8YFLogxK

U2 - 10.1371/journal.pone.0034362

DO - 10.1371/journal.pone.0034362

M3 - Article

VL - 7

JO - PLoS One

JF - PLoS One

SN - 1932-6203

IS - 4

M1 - e34362

ER -