Discovering meaningful clusters from mining software engineering literature

Yan Wu, Harvey Siy, Li Fan

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Document clustering is becoming an increasingly popular technique for identifying relationships in unstructured text. In this paper, we attempt to make sense of the output of a clustering algorithm applied to software engineering research papers. We introduce a notion of cluster "stability" as a measure of the meaningfulness of a cluster. We assess its usefulness and limitations in identifying meaningful clusters. In the process, we track how important research topics may have changed from year to year.

Original languageEnglish (US)
Title of host publication20th International Conference on Software Engineering and Knowledge Engineering, SEKE 2008
Pages613-618
Number of pages6
StatePublished - Dec 1 2008
Event20th International Conference on Software Engineering and Knowledge Engineering, SEKE 2008 - San Francisco Bay, CA, United States
Duration: Jul 1 2008Jul 3 2008

Publication series

Name20th International Conference on Software Engineering and Knowledge Engineering, SEKE 2008

Conference

Conference20th International Conference on Software Engineering and Knowledge Engineering, SEKE 2008
CountryUnited States
CitySan Francisco Bay, CA
Period7/1/087/3/08

Fingerprint

Mining engineering
Engineering research
Clustering algorithms
Software engineering

ASJC Scopus subject areas

  • Software

Cite this

Wu, Y., Siy, H., & Fan, L. (2008). Discovering meaningful clusters from mining software engineering literature. In 20th International Conference on Software Engineering and Knowledge Engineering, SEKE 2008 (pp. 613-618). (20th International Conference on Software Engineering and Knowledge Engineering, SEKE 2008).

Discovering meaningful clusters from mining software engineering literature. / Wu, Yan; Siy, Harvey; Fan, Li.

20th International Conference on Software Engineering and Knowledge Engineering, SEKE 2008. 2008. p. 613-618 (20th International Conference on Software Engineering and Knowledge Engineering, SEKE 2008).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Wu, Y, Siy, H & Fan, L 2008, Discovering meaningful clusters from mining software engineering literature. in 20th International Conference on Software Engineering and Knowledge Engineering, SEKE 2008. 20th International Conference on Software Engineering and Knowledge Engineering, SEKE 2008, pp. 613-618, 20th International Conference on Software Engineering and Knowledge Engineering, SEKE 2008, San Francisco Bay, CA, United States, 7/1/08.
Wu Y, Siy H, Fan L. Discovering meaningful clusters from mining software engineering literature. In 20th International Conference on Software Engineering and Knowledge Engineering, SEKE 2008. 2008. p. 613-618. (20th International Conference on Software Engineering and Knowledge Engineering, SEKE 2008).
Wu, Yan ; Siy, Harvey ; Fan, Li. / Discovering meaningful clusters from mining software engineering literature. 20th International Conference on Software Engineering and Knowledge Engineering, SEKE 2008. 2008. pp. 613-618 (20th International Conference on Software Engineering and Knowledge Engineering, SEKE 2008).
@inproceedings{7d4f8b530ffa476eaba1b27e34e998cc,
title = "Discovering meaningful clusters from mining software engineering literature",
abstract = "Document clustering is becoming an increasingly popular technique for identifying relationships in unstructured text. In this paper, we attempt to make sense of the output of a clustering algorithm applied to software engineering research papers. We introduce a notion of cluster {"}stability{"} as a measure of the meaningfulness of a cluster. We assess its usefulness and limitations in identifying meaningful clusters. In the process, we track how important research topics may have changed from year to year.",
author = "Yan Wu and Harvey Siy and Li Fan",
year = "2008",
month = "12",
day = "1",
language = "English (US)",
isbn = "9781627486620",
series = "20th International Conference on Software Engineering and Knowledge Engineering, SEKE 2008",
pages = "613--618",
booktitle = "20th International Conference on Software Engineering and Knowledge Engineering, SEKE 2008",

}

TY - GEN

T1 - Discovering meaningful clusters from mining software engineering literature

AU - Wu, Yan

AU - Siy, Harvey

AU - Fan, Li

PY - 2008/12/1

Y1 - 2008/12/1

N2 - Document clustering is becoming an increasingly popular technique for identifying relationships in unstructured text. In this paper, we attempt to make sense of the output of a clustering algorithm applied to software engineering research papers. We introduce a notion of cluster "stability" as a measure of the meaningfulness of a cluster. We assess its usefulness and limitations in identifying meaningful clusters. In the process, we track how important research topics may have changed from year to year.

AB - Document clustering is becoming an increasingly popular technique for identifying relationships in unstructured text. In this paper, we attempt to make sense of the output of a clustering algorithm applied to software engineering research papers. We introduce a notion of cluster "stability" as a measure of the meaningfulness of a cluster. We assess its usefulness and limitations in identifying meaningful clusters. In the process, we track how important research topics may have changed from year to year.

UR - http://www.scopus.com/inward/record.url?scp=84886892151&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84886892151&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:84886892151

SN - 9781627486620

T3 - 20th International Conference on Software Engineering and Knowledge Engineering, SEKE 2008

SP - 613

EP - 618

BT - 20th International Conference on Software Engineering and Knowledge Engineering, SEKE 2008

ER -