Sequential Association Rule Mining with Time Lags

Sherri K. Harms, Jitender S. Deogun

Research output: Contribution to journalArticle

85 Citations (Scopus)

Abstract

This paper presents MOWCATL, an efficient method for mining frequent association rules from multiple sequential data sets. Our goal is to find patterns in one or more sequences that precede the occurrence of patterns in other sequences. Recent work has highlighted the importance of using constraints to focus the mining process on the association rules relevant to the user. To refine the data mining process, this approach introduces the use of separate antecedent and consequent inclusion constraints, in addition to the traditional frequency and support constraints in sequential data mining. Moreover, separate antecedent and consequent maximum window widths are used to specify the antecedent and consequent patterns that are separated by either a maximal width time lag or a fixed width time lag. Multiple time series drought risk management data are used to show that our approach can be effectively employed in real-life problems. This approach is compared to existing methods to show how they complement each other to discover associations in the drought risk management domain. The experimental results validate the superior performance of our method for efficiently finding relationships between global climatic episodes and local drought conditions. Both the maximal and fixed width time lags are shown to be useful when finding interesting associations.

Original languageEnglish (US)
Pages (from-to)7-22
Number of pages16
JournalJournal of Intelligent Information Systems
Volume22
Issue number1
DOIs
StatePublished - Jan 1 2004

Fingerprint

Drought
Association rules
Risk management
Data mining
Time series

Keywords

  • Drought risk management
  • Knowledge discovery
  • Sequential rule discovery
  • Time lag

ASJC Scopus subject areas

  • Software
  • Information Systems
  • Hardware and Architecture
  • Computer Networks and Communications
  • Artificial Intelligence

Cite this

Sequential Association Rule Mining with Time Lags. / Harms, Sherri K.; Deogun, Jitender S.

In: Journal of Intelligent Information Systems, Vol. 22, No. 1, 01.01.2004, p. 7-22.

Research output: Contribution to journalArticle

@article{52ea8c2e8b1c464eb30d17e5cf371458,
title = "Sequential Association Rule Mining with Time Lags",
abstract = "This paper presents MOWCATL, an efficient method for mining frequent association rules from multiple sequential data sets. Our goal is to find patterns in one or more sequences that precede the occurrence of patterns in other sequences. Recent work has highlighted the importance of using constraints to focus the mining process on the association rules relevant to the user. To refine the data mining process, this approach introduces the use of separate antecedent and consequent inclusion constraints, in addition to the traditional frequency and support constraints in sequential data mining. Moreover, separate antecedent and consequent maximum window widths are used to specify the antecedent and consequent patterns that are separated by either a maximal width time lag or a fixed width time lag. Multiple time series drought risk management data are used to show that our approach can be effectively employed in real-life problems. This approach is compared to existing methods to show how they complement each other to discover associations in the drought risk management domain. The experimental results validate the superior performance of our method for efficiently finding relationships between global climatic episodes and local drought conditions. Both the maximal and fixed width time lags are shown to be useful when finding interesting associations.",
keywords = "Drought risk management, Knowledge discovery, Sequential rule discovery, Time lag",
author = "Harms, {Sherri K.} and Deogun, {Jitender S.}",
year = "2004",
month = "1",
day = "1",
doi = "10.1023/A:1025824629047",
language = "English (US)",
volume = "22",
pages = "7--22",
journal = "Journal of Intelligent Information Systems",
issn = "0925-9902",
publisher = "Springer Netherlands",
number = "1",

}

TY - JOUR

T1 - Sequential Association Rule Mining with Time Lags

AU - Harms, Sherri K.

AU - Deogun, Jitender S.

PY - 2004/1/1

Y1 - 2004/1/1

N2 - This paper presents MOWCATL, an efficient method for mining frequent association rules from multiple sequential data sets. Our goal is to find patterns in one or more sequences that precede the occurrence of patterns in other sequences. Recent work has highlighted the importance of using constraints to focus the mining process on the association rules relevant to the user. To refine the data mining process, this approach introduces the use of separate antecedent and consequent inclusion constraints, in addition to the traditional frequency and support constraints in sequential data mining. Moreover, separate antecedent and consequent maximum window widths are used to specify the antecedent and consequent patterns that are separated by either a maximal width time lag or a fixed width time lag. Multiple time series drought risk management data are used to show that our approach can be effectively employed in real-life problems. This approach is compared to existing methods to show how they complement each other to discover associations in the drought risk management domain. The experimental results validate the superior performance of our method for efficiently finding relationships between global climatic episodes and local drought conditions. Both the maximal and fixed width time lags are shown to be useful when finding interesting associations.

AB - This paper presents MOWCATL, an efficient method for mining frequent association rules from multiple sequential data sets. Our goal is to find patterns in one or more sequences that precede the occurrence of patterns in other sequences. Recent work has highlighted the importance of using constraints to focus the mining process on the association rules relevant to the user. To refine the data mining process, this approach introduces the use of separate antecedent and consequent inclusion constraints, in addition to the traditional frequency and support constraints in sequential data mining. Moreover, separate antecedent and consequent maximum window widths are used to specify the antecedent and consequent patterns that are separated by either a maximal width time lag or a fixed width time lag. Multiple time series drought risk management data are used to show that our approach can be effectively employed in real-life problems. This approach is compared to existing methods to show how they complement each other to discover associations in the drought risk management domain. The experimental results validate the superior performance of our method for efficiently finding relationships between global climatic episodes and local drought conditions. Both the maximal and fixed width time lags are shown to be useful when finding interesting associations.

KW - Drought risk management

KW - Knowledge discovery

KW - Sequential rule discovery

KW - Time lag

UR - http://www.scopus.com/inward/record.url?scp=0347900774&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0347900774&partnerID=8YFLogxK

U2 - 10.1023/A:1025824629047

DO - 10.1023/A:1025824629047

M3 - Article

AN - SCOPUS:0347900774

VL - 22

SP - 7

EP - 22

JO - Journal of Intelligent Information Systems

JF - Journal of Intelligent Information Systems

SN - 0925-9902

IS - 1

ER -