Comparative analysis of software repository metrics in BioPerl, BioJava and BioRuby

M. Rahmani, D. Bastola, L. Najjar

Research output: Contribution to journalConference article

Abstract

The open source programming languages, often with a bio-suffix, i.e. BioPerl, BioJava, and BioRuby, have been widely used in bio informatics and computational biology research. The computational tools written in these languages provide multiple functionalities as the languages make them flexible to create customized analysis and examination of biological data. In this paper, we investigate one of the software quality parameters, " maintainability", in BioPerl, BioJava, and BioRuby projects using comment density metric in their source code repositories. Furthermore, we evaluate the "popularity" of these three languages in bioinformatics communities using three other software metrics such as number of committers, commit frequency, and lines of code. To perform this study, source code repositories of these three open source projects have been analyzed from the first release, which covers all the programming activities of the projects from the starting date until July 2011. Our results show BioPerl to be the most popular language among the three languages in open source communities. In addition, investigation on comment density of these three open source projects has shown that BioPerl is the most promising one in terms of future maintainability and quality of the project. The results of this research can be useful for developers in choosing an appropriate language for the development of bioinformatics applications.

Original languageEnglish (US)
Pages (from-to)518-521
Number of pages4
JournalProcedia Computer Science
Volume9
DOIs
StatePublished - Jan 1 2012
Event12th Annual International Conference on Computational Science, ICCS 2012 - Omaha, NB, United States
Duration: Jun 4 2012Jun 6 2012

Fingerprint

Bioinformatics
Maintainability
Computer programming languages

Keywords

  • BioJava
  • BioPerl
  • BioRuby
  • Bioinformatics programming language
  • Software maintainability
  • Software repository metrics

ASJC Scopus subject areas

  • Computer Science(all)

Cite this

Comparative analysis of software repository metrics in BioPerl, BioJava and BioRuby. / Rahmani, M.; Bastola, D.; Najjar, L.

In: Procedia Computer Science, Vol. 9, 01.01.2012, p. 518-521.

Research output: Contribution to journalConference article

@article{717582a15ab44936b76f823f807b5b52,
title = "Comparative analysis of software repository metrics in BioPerl, BioJava and BioRuby",
abstract = "The open source programming languages, often with a bio-suffix, i.e. BioPerl, BioJava, and BioRuby, have been widely used in bio informatics and computational biology research. The computational tools written in these languages provide multiple functionalities as the languages make them flexible to create customized analysis and examination of biological data. In this paper, we investigate one of the software quality parameters, {"} maintainability{"}, in BioPerl, BioJava, and BioRuby projects using comment density metric in their source code repositories. Furthermore, we evaluate the {"}popularity{"} of these three languages in bioinformatics communities using three other software metrics such as number of committers, commit frequency, and lines of code. To perform this study, source code repositories of these three open source projects have been analyzed from the first release, which covers all the programming activities of the projects from the starting date until July 2011. Our results show BioPerl to be the most popular language among the three languages in open source communities. In addition, investigation on comment density of these three open source projects has shown that BioPerl is the most promising one in terms of future maintainability and quality of the project. The results of this research can be useful for developers in choosing an appropriate language for the development of bioinformatics applications.",
keywords = "BioJava, BioPerl, BioRuby, Bioinformatics programming language, Software maintainability, Software repository metrics",
author = "M. Rahmani and D. Bastola and L. Najjar",
year = "2012",
month = "1",
day = "1",
doi = "10.1016/j.procs.2012.04.055",
language = "English (US)",
volume = "9",
pages = "518--521",
journal = "Procedia Computer Science",
issn = "1877-0509",
publisher = "Elsevier BV",

}

TY - JOUR

T1 - Comparative analysis of software repository metrics in BioPerl, BioJava and BioRuby

AU - Rahmani, M.

AU - Bastola, D.

AU - Najjar, L.

PY - 2012/1/1

Y1 - 2012/1/1

N2 - The open source programming languages, often with a bio-suffix, i.e. BioPerl, BioJava, and BioRuby, have been widely used in bio informatics and computational biology research. The computational tools written in these languages provide multiple functionalities as the languages make them flexible to create customized analysis and examination of biological data. In this paper, we investigate one of the software quality parameters, " maintainability", in BioPerl, BioJava, and BioRuby projects using comment density metric in their source code repositories. Furthermore, we evaluate the "popularity" of these three languages in bioinformatics communities using three other software metrics such as number of committers, commit frequency, and lines of code. To perform this study, source code repositories of these three open source projects have been analyzed from the first release, which covers all the programming activities of the projects from the starting date until July 2011. Our results show BioPerl to be the most popular language among the three languages in open source communities. In addition, investigation on comment density of these three open source projects has shown that BioPerl is the most promising one in terms of future maintainability and quality of the project. The results of this research can be useful for developers in choosing an appropriate language for the development of bioinformatics applications.

AB - The open source programming languages, often with a bio-suffix, i.e. BioPerl, BioJava, and BioRuby, have been widely used in bio informatics and computational biology research. The computational tools written in these languages provide multiple functionalities as the languages make them flexible to create customized analysis and examination of biological data. In this paper, we investigate one of the software quality parameters, " maintainability", in BioPerl, BioJava, and BioRuby projects using comment density metric in their source code repositories. Furthermore, we evaluate the "popularity" of these three languages in bioinformatics communities using three other software metrics such as number of committers, commit frequency, and lines of code. To perform this study, source code repositories of these three open source projects have been analyzed from the first release, which covers all the programming activities of the projects from the starting date until July 2011. Our results show BioPerl to be the most popular language among the three languages in open source communities. In addition, investigation on comment density of these three open source projects has shown that BioPerl is the most promising one in terms of future maintainability and quality of the project. The results of this research can be useful for developers in choosing an appropriate language for the development of bioinformatics applications.

KW - BioJava

KW - BioPerl

KW - BioRuby

KW - Bioinformatics programming language

KW - Software maintainability

KW - Software repository metrics

UR - http://www.scopus.com/inward/record.url?scp=84897002717&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84897002717&partnerID=8YFLogxK

U2 - 10.1016/j.procs.2012.04.055

DO - 10.1016/j.procs.2012.04.055

M3 - Conference article

AN - SCOPUS:84897002717

VL - 9

SP - 518

EP - 521

JO - Procedia Computer Science

JF - Procedia Computer Science

SN - 1877-0509

ER -