Discovering dynamic developer relationships from software version histories by time series segmentation

Harvey Pe Siy, Parvathi Chundi, Daniel J. Rosenkrantz, Mahadevan Subramaniam

Research output: Chapter in Book/Report/Conference proceedingConference contribution

6 Citations (Scopus)

Abstract

Time series analysis is a pmmising appmach to discover temporal patterns from time stamped, numeric data. A novel approach to apply time series analysis to discern temporal information from software version repositories is proposed. Version logs containing numeric as well as non-numeric data are represented as an item-set time series. A dynamic programming based algorithm to optimally segment an item-set time series is presented. The algorithm automatically produces a compacted item-set time series that can be analyzed to discern temporal patterns. The effectiveness of the approach is illustrated by applying to the Mozilla data set to study the change frequency and developer activity profiles. The experimental results show that the segmentation algorithm produces segments that capture meaningful information and is superior to the information content obtaining by arbitrarily segmenting time period into regular time intervals.

Original languageEnglish (US)
Title of host publicationICSM 2007 - Proceedings of the 2007 IEEE International Conference on Software Maintenance
Pages415-424
Number of pages10
DOIs
StatePublished - Dec 1 2007
Event23rd International Conference on Software Maintenance, ICSM - Paris, France
Duration: Oct 2 2007Oct 5 2007

Publication series

NameIEEE International Conference on Software Maintenance, ICSM

Conference

Conference23rd International Conference on Software Maintenance, ICSM
CountryFrance
CityParis
Period10/2/0710/5/07

Fingerprint

Time series
Time series analysis
Dynamic programming

ASJC Scopus subject areas

  • Software

Cite this

Siy, H. P., Chundi, P., Rosenkrantz, D. J., & Subramaniam, M. (2007). Discovering dynamic developer relationships from software version histories by time series segmentation. In ICSM 2007 - Proceedings of the 2007 IEEE International Conference on Software Maintenance (pp. 415-424). [4362654] (IEEE International Conference on Software Maintenance, ICSM). https://doi.org/10.1109/ICSM.2007.4362654

Discovering dynamic developer relationships from software version histories by time series segmentation. / Siy, Harvey Pe; Chundi, Parvathi; Rosenkrantz, Daniel J.; Subramaniam, Mahadevan.

ICSM 2007 - Proceedings of the 2007 IEEE International Conference on Software Maintenance. 2007. p. 415-424 4362654 (IEEE International Conference on Software Maintenance, ICSM).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Siy, HP, Chundi, P, Rosenkrantz, DJ & Subramaniam, M 2007, Discovering dynamic developer relationships from software version histories by time series segmentation. in ICSM 2007 - Proceedings of the 2007 IEEE International Conference on Software Maintenance., 4362654, IEEE International Conference on Software Maintenance, ICSM, pp. 415-424, 23rd International Conference on Software Maintenance, ICSM, Paris, France, 10/2/07. https://doi.org/10.1109/ICSM.2007.4362654
Siy HP, Chundi P, Rosenkrantz DJ, Subramaniam M. Discovering dynamic developer relationships from software version histories by time series segmentation. In ICSM 2007 - Proceedings of the 2007 IEEE International Conference on Software Maintenance. 2007. p. 415-424. 4362654. (IEEE International Conference on Software Maintenance, ICSM). https://doi.org/10.1109/ICSM.2007.4362654
Siy, Harvey Pe ; Chundi, Parvathi ; Rosenkrantz, Daniel J. ; Subramaniam, Mahadevan. / Discovering dynamic developer relationships from software version histories by time series segmentation. ICSM 2007 - Proceedings of the 2007 IEEE International Conference on Software Maintenance. 2007. pp. 415-424 (IEEE International Conference on Software Maintenance, ICSM).
@inproceedings{89011ca501ae487c8e51ce416964a0a1,
title = "Discovering dynamic developer relationships from software version histories by time series segmentation",
abstract = "Time series analysis is a pmmising appmach to discover temporal patterns from time stamped, numeric data. A novel approach to apply time series analysis to discern temporal information from software version repositories is proposed. Version logs containing numeric as well as non-numeric data are represented as an item-set time series. A dynamic programming based algorithm to optimally segment an item-set time series is presented. The algorithm automatically produces a compacted item-set time series that can be analyzed to discern temporal patterns. The effectiveness of the approach is illustrated by applying to the Mozilla data set to study the change frequency and developer activity profiles. The experimental results show that the segmentation algorithm produces segments that capture meaningful information and is superior to the information content obtaining by arbitrarily segmenting time period into regular time intervals.",
author = "Siy, {Harvey Pe} and Parvathi Chundi and Rosenkrantz, {Daniel J.} and Mahadevan Subramaniam",
year = "2007",
month = "12",
day = "1",
doi = "10.1109/ICSM.2007.4362654",
language = "English (US)",
isbn = "1424412560",
series = "IEEE International Conference on Software Maintenance, ICSM",
pages = "415--424",
booktitle = "ICSM 2007 - Proceedings of the 2007 IEEE International Conference on Software Maintenance",

}

TY - GEN

T1 - Discovering dynamic developer relationships from software version histories by time series segmentation

AU - Siy, Harvey Pe

AU - Chundi, Parvathi

AU - Rosenkrantz, Daniel J.

AU - Subramaniam, Mahadevan

PY - 2007/12/1

Y1 - 2007/12/1

N2 - Time series analysis is a pmmising appmach to discover temporal patterns from time stamped, numeric data. A novel approach to apply time series analysis to discern temporal information from software version repositories is proposed. Version logs containing numeric as well as non-numeric data are represented as an item-set time series. A dynamic programming based algorithm to optimally segment an item-set time series is presented. The algorithm automatically produces a compacted item-set time series that can be analyzed to discern temporal patterns. The effectiveness of the approach is illustrated by applying to the Mozilla data set to study the change frequency and developer activity profiles. The experimental results show that the segmentation algorithm produces segments that capture meaningful information and is superior to the information content obtaining by arbitrarily segmenting time period into regular time intervals.

AB - Time series analysis is a pmmising appmach to discover temporal patterns from time stamped, numeric data. A novel approach to apply time series analysis to discern temporal information from software version repositories is proposed. Version logs containing numeric as well as non-numeric data are represented as an item-set time series. A dynamic programming based algorithm to optimally segment an item-set time series is presented. The algorithm automatically produces a compacted item-set time series that can be analyzed to discern temporal patterns. The effectiveness of the approach is illustrated by applying to the Mozilla data set to study the change frequency and developer activity profiles. The experimental results show that the segmentation algorithm produces segments that capture meaningful information and is superior to the information content obtaining by arbitrarily segmenting time period into regular time intervals.

UR - http://www.scopus.com/inward/record.url?scp=47349086101&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=47349086101&partnerID=8YFLogxK

U2 - 10.1109/ICSM.2007.4362654

DO - 10.1109/ICSM.2007.4362654

M3 - Conference contribution

SN - 1424412560

SN - 9781424412563

T3 - IEEE International Conference on Software Maintenance, ICSM

SP - 415

EP - 424

BT - ICSM 2007 - Proceedings of the 2007 IEEE International Conference on Software Maintenance

ER -