Predicting Asthma Prevalence by Linking Social Media Data and Traditional Surveys

Hongying Dai, Brian R. Lee, Jianqiang Hao

Research output: Contribution to journalArticle

1 Citation (Scopus)

Abstract

Asthma is one of the most common chronic diseases that has a profound impact on people’s well-being and our society. In this study, we link multiple large-scale data sources to construct an epidemiological model to predict asthma prevalence across geographic regions. We use: (1) the Social Media Monitoring (SMM) data from Twitter (N = 500 million tweets/day), (2) the 2014 Behavioral Risk Factor Surveillance System (BRFSS) (N = 464,664), and (3) the 2014 American Community Survey (ACS) conducted by the U.S. Census Bureau (N = 3.5 million per year). We predict asthma prevalence in the traditional survey (BRFSS) using social media information collected from Twitter and socioeconomic factors collected from ACS. The evidence suggests that monitoring asthma-related tweets may provide real-time information that can be used to predict outcomes from traditional surveys.

Original languageEnglish (US)
Pages (from-to)75-92
Number of pages18
JournalAnnals of the American Academy of Political and Social Science
Volume669
Issue number1
DOIs
StatePublished - Jan 1 2017

Fingerprint

social media
twitter
surveillance
monitoring
socioeconomic factors
community
census
well-being
Disease
evidence

Keywords

  • ACS
  • BRFSS
  • SMM
  • asthma
  • data linkage
  • social media monitoring

ASJC Scopus subject areas

  • Sociology and Political Science
  • Social Sciences(all)

Cite this

Predicting Asthma Prevalence by Linking Social Media Data and Traditional Surveys. / Dai, Hongying; Lee, Brian R.; Hao, Jianqiang.

In: Annals of the American Academy of Political and Social Science, Vol. 669, No. 1, 01.01.2017, p. 75-92.

Research output: Contribution to journalArticle

@article{53324a9a0f574df7949e03b8b95e03c4,
title = "Predicting Asthma Prevalence by Linking Social Media Data and Traditional Surveys",
abstract = "Asthma is one of the most common chronic diseases that has a profound impact on people’s well-being and our society. In this study, we link multiple large-scale data sources to construct an epidemiological model to predict asthma prevalence across geographic regions. We use: (1) the Social Media Monitoring (SMM) data from Twitter (N = 500 million tweets/day), (2) the 2014 Behavioral Risk Factor Surveillance System (BRFSS) (N = 464,664), and (3) the 2014 American Community Survey (ACS) conducted by the U.S. Census Bureau (N = 3.5 million per year). We predict asthma prevalence in the traditional survey (BRFSS) using social media information collected from Twitter and socioeconomic factors collected from ACS. The evidence suggests that monitoring asthma-related tweets may provide real-time information that can be used to predict outcomes from traditional surveys.",
keywords = "ACS, BRFSS, SMM, asthma, data linkage, social media monitoring",
author = "Hongying Dai and Lee, {Brian R.} and Jianqiang Hao",
year = "2017",
month = "1",
day = "1",
doi = "10.1177/0002716216678399",
language = "English (US)",
volume = "669",
pages = "75--92",
journal = "Annals of the American Academy of Political and Social Science",
issn = "0002-7162",
publisher = "SAGE Publications Inc.",
number = "1",

}

TY - JOUR

T1 - Predicting Asthma Prevalence by Linking Social Media Data and Traditional Surveys

AU - Dai, Hongying

AU - Lee, Brian R.

AU - Hao, Jianqiang

PY - 2017/1/1

Y1 - 2017/1/1

N2 - Asthma is one of the most common chronic diseases that has a profound impact on people’s well-being and our society. In this study, we link multiple large-scale data sources to construct an epidemiological model to predict asthma prevalence across geographic regions. We use: (1) the Social Media Monitoring (SMM) data from Twitter (N = 500 million tweets/day), (2) the 2014 Behavioral Risk Factor Surveillance System (BRFSS) (N = 464,664), and (3) the 2014 American Community Survey (ACS) conducted by the U.S. Census Bureau (N = 3.5 million per year). We predict asthma prevalence in the traditional survey (BRFSS) using social media information collected from Twitter and socioeconomic factors collected from ACS. The evidence suggests that monitoring asthma-related tweets may provide real-time information that can be used to predict outcomes from traditional surveys.

AB - Asthma is one of the most common chronic diseases that has a profound impact on people’s well-being and our society. In this study, we link multiple large-scale data sources to construct an epidemiological model to predict asthma prevalence across geographic regions. We use: (1) the Social Media Monitoring (SMM) data from Twitter (N = 500 million tweets/day), (2) the 2014 Behavioral Risk Factor Surveillance System (BRFSS) (N = 464,664), and (3) the 2014 American Community Survey (ACS) conducted by the U.S. Census Bureau (N = 3.5 million per year). We predict asthma prevalence in the traditional survey (BRFSS) using social media information collected from Twitter and socioeconomic factors collected from ACS. The evidence suggests that monitoring asthma-related tweets may provide real-time information that can be used to predict outcomes from traditional surveys.

KW - ACS

KW - BRFSS

KW - SMM

KW - asthma

KW - data linkage

KW - social media monitoring

UR - http://www.scopus.com/inward/record.url?scp=85007021319&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85007021319&partnerID=8YFLogxK

U2 - 10.1177/0002716216678399

DO - 10.1177/0002716216678399

M3 - Article

AN - SCOPUS:85007021319

VL - 669

SP - 75

EP - 92

JO - Annals of the American Academy of Political and Social Science

JF - Annals of the American Academy of Political and Social Science

SN - 0002-7162

IS - 1

ER -