A new sequence distance measure for phylogenetic tree construction

Research output: Contribution to journalArticle

254 Citations (Scopus)

Abstract

Motivation: Most existing approaches for phylogenetic inference use multiple alignment of sequences and assume some sort of an evolutionary model. The multiple alignment strategy does not work for all types of data, e.g. whole genome phylogeny, and the evolutionary models may not always be correct. We propose a new sequence distance measure based on the relative information between the sequences using Lempel-Ziv complexity. The distance matrix thus obtained can be used to construct phylogenetic trees. Results: The proposed approach does not require sequence alignment and is totally automatic. The algorithm has successfully constructed consistent phylogenies for real and simulated data sets.

Original languageEnglish (US)
Pages (from-to)2122-2130
Number of pages9
JournalBioinformatics
Volume19
Issue number16
DOIs
StatePublished - Nov 1 2003

Fingerprint

Sequence Alignment
Phylogenetic Tree
Distance Measure
Phylogeny
Alignment
Distance Matrix
Genome
Phylogenetics
Sort
Genes
Model
Datasets

ASJC Scopus subject areas

  • Statistics and Probability
  • Biochemistry
  • Molecular Biology
  • Computer Science Applications
  • Computational Theory and Mathematics
  • Computational Mathematics

Cite this

A new sequence distance measure for phylogenetic tree construction. / Otu, Hasan H; Sayood, Khalid.

In: Bioinformatics, Vol. 19, No. 16, 01.11.2003, p. 2122-2130.

Research output: Contribution to journalArticle

@article{9bf3d540ddcb4e0196c6eb3bf9ea46c6,
title = "A new sequence distance measure for phylogenetic tree construction",
abstract = "Motivation: Most existing approaches for phylogenetic inference use multiple alignment of sequences and assume some sort of an evolutionary model. The multiple alignment strategy does not work for all types of data, e.g. whole genome phylogeny, and the evolutionary models may not always be correct. We propose a new sequence distance measure based on the relative information between the sequences using Lempel-Ziv complexity. The distance matrix thus obtained can be used to construct phylogenetic trees. Results: The proposed approach does not require sequence alignment and is totally automatic. The algorithm has successfully constructed consistent phylogenies for real and simulated data sets.",
author = "Otu, {Hasan H} and Khalid Sayood",
year = "2003",
month = "11",
day = "1",
doi = "10.1093/bioinformatics/btg295",
language = "English (US)",
volume = "19",
pages = "2122--2130",
journal = "Bioinformatics",
issn = "1367-4803",
publisher = "Oxford University Press",
number = "16",

}

TY - JOUR

T1 - A new sequence distance measure for phylogenetic tree construction

AU - Otu, Hasan H

AU - Sayood, Khalid

PY - 2003/11/1

Y1 - 2003/11/1

N2 - Motivation: Most existing approaches for phylogenetic inference use multiple alignment of sequences and assume some sort of an evolutionary model. The multiple alignment strategy does not work for all types of data, e.g. whole genome phylogeny, and the evolutionary models may not always be correct. We propose a new sequence distance measure based on the relative information between the sequences using Lempel-Ziv complexity. The distance matrix thus obtained can be used to construct phylogenetic trees. Results: The proposed approach does not require sequence alignment and is totally automatic. The algorithm has successfully constructed consistent phylogenies for real and simulated data sets.

AB - Motivation: Most existing approaches for phylogenetic inference use multiple alignment of sequences and assume some sort of an evolutionary model. The multiple alignment strategy does not work for all types of data, e.g. whole genome phylogeny, and the evolutionary models may not always be correct. We propose a new sequence distance measure based on the relative information between the sequences using Lempel-Ziv complexity. The distance matrix thus obtained can be used to construct phylogenetic trees. Results: The proposed approach does not require sequence alignment and is totally automatic. The algorithm has successfully constructed consistent phylogenies for real and simulated data sets.

UR - http://www.scopus.com/inward/record.url?scp=0242643741&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0242643741&partnerID=8YFLogxK

U2 - 10.1093/bioinformatics/btg295

DO - 10.1093/bioinformatics/btg295

M3 - Article

VL - 19

SP - 2122

EP - 2130

JO - Bioinformatics

JF - Bioinformatics

SN - 1367-4803

IS - 16

ER -