Statistical methods for estimating sequence divergence

Takashi Gojobori, Etsuko N. Moriyama, Motoo Kimura

Research output: Contribution to journalArticle

32 Citations (Scopus)

Abstract

This chapter describes methods for estimating total number of nucleotide substitutions. The initiation and termination codons are excluded from the comparison, because the former is usually invariant and changes of the latter are quite restrictive. When the number of differences varies depending on the types of base pairs, and furthermore when the value of K is expected to become more than 1.0, the four- and six-parameter methods are more suitable than other methods. In particular, the estimate obtained by the six-parameter method is often close to the true value even though the value of K becomes much larger. The formula for the six-parameter method, however, frequently tends to become inapplicable owing to sampling and stochastic errors unless the DNA sequences compared are sufficiently long. If the number of nucleotide differences between two DNA sequences is very small, the number of synonymous and nonsynonymous substitutions can be obtained simply by counting synonymous and nonsynonymous nucleotide differences.

Original languageEnglish (US)
Pages (from-to)531-550
Number of pages20
JournalMethods in enzymology
Volume183
Issue numberC
DOIs
StatePublished - Jan 1 1990

Fingerprint

Statistical methods
Nucleotides
DNA sequences
Substitution reactions
Initiator Codon
Selection Bias
Terminator Codon
Sampling
Base Pairing

ASJC Scopus subject areas

  • Biochemistry
  • Molecular Biology

Cite this

Statistical methods for estimating sequence divergence. / Gojobori, Takashi; Moriyama, Etsuko N.; Kimura, Motoo.

In: Methods in enzymology, Vol. 183, No. C, 01.01.1990, p. 531-550.

Research output: Contribution to journalArticle

Gojobori, Takashi ; Moriyama, Etsuko N. ; Kimura, Motoo. / Statistical methods for estimating sequence divergence. In: Methods in enzymology. 1990 ; Vol. 183, No. C. pp. 531-550.
@article{3e456e24c87e4ed5bc510a48f0490797,
title = "Statistical methods for estimating sequence divergence",
abstract = "This chapter describes methods for estimating total number of nucleotide substitutions. The initiation and termination codons are excluded from the comparison, because the former is usually invariant and changes of the latter are quite restrictive. When the number of differences varies depending on the types of base pairs, and furthermore when the value of K is expected to become more than 1.0, the four- and six-parameter methods are more suitable than other methods. In particular, the estimate obtained by the six-parameter method is often close to the true value even though the value of K becomes much larger. The formula for the six-parameter method, however, frequently tends to become inapplicable owing to sampling and stochastic errors unless the DNA sequences compared are sufficiently long. If the number of nucleotide differences between two DNA sequences is very small, the number of synonymous and nonsynonymous substitutions can be obtained simply by counting synonymous and nonsynonymous nucleotide differences.",
author = "Takashi Gojobori and Moriyama, {Etsuko N.} and Motoo Kimura",
year = "1990",
month = "1",
day = "1",
doi = "10.1016/0076-6879(90)83035-8",
language = "English (US)",
volume = "183",
pages = "531--550",
journal = "Methods in Enzymology",
issn = "0076-6879",
publisher = "Academic Press Inc.",
number = "C",

}

TY - JOUR

T1 - Statistical methods for estimating sequence divergence

AU - Gojobori, Takashi

AU - Moriyama, Etsuko N.

AU - Kimura, Motoo

PY - 1990/1/1

Y1 - 1990/1/1

N2 - This chapter describes methods for estimating total number of nucleotide substitutions. The initiation and termination codons are excluded from the comparison, because the former is usually invariant and changes of the latter are quite restrictive. When the number of differences varies depending on the types of base pairs, and furthermore when the value of K is expected to become more than 1.0, the four- and six-parameter methods are more suitable than other methods. In particular, the estimate obtained by the six-parameter method is often close to the true value even though the value of K becomes much larger. The formula for the six-parameter method, however, frequently tends to become inapplicable owing to sampling and stochastic errors unless the DNA sequences compared are sufficiently long. If the number of nucleotide differences between two DNA sequences is very small, the number of synonymous and nonsynonymous substitutions can be obtained simply by counting synonymous and nonsynonymous nucleotide differences.

AB - This chapter describes methods for estimating total number of nucleotide substitutions. The initiation and termination codons are excluded from the comparison, because the former is usually invariant and changes of the latter are quite restrictive. When the number of differences varies depending on the types of base pairs, and furthermore when the value of K is expected to become more than 1.0, the four- and six-parameter methods are more suitable than other methods. In particular, the estimate obtained by the six-parameter method is often close to the true value even though the value of K becomes much larger. The formula for the six-parameter method, however, frequently tends to become inapplicable owing to sampling and stochastic errors unless the DNA sequences compared are sufficiently long. If the number of nucleotide differences between two DNA sequences is very small, the number of synonymous and nonsynonymous substitutions can be obtained simply by counting synonymous and nonsynonymous nucleotide differences.

UR - http://www.scopus.com/inward/record.url?scp=0025320104&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0025320104&partnerID=8YFLogxK

U2 - 10.1016/0076-6879(90)83035-8

DO - 10.1016/0076-6879(90)83035-8

M3 - Article

C2 - 2314291

AN - SCOPUS:0025320104

VL - 183

SP - 531

EP - 550

JO - Methods in Enzymology

JF - Methods in Enzymology

SN - 0076-6879

IS - C

ER -