Reconstruction of ancestral protein sequences and its applications

Wei Cai, Jimin Pei, Nick V. Grishin

Research output: Contribution to journalArticle

73 Citations (Scopus)

Abstract

Background: Modern-day proteins were selected during long evolutionary history as descendants of ancient life forms. In silico reconstruction of such ancestral protein sequences facilitates our understanding of evolutionary processes, protein classification and biological function. Additionally, reconstructed ancestral protein sequences could serve to fill in sequence space thus aiding remote homology inference. Results: We developed ANCESCON, a package for distance-based phylogenetic inference and reconstruction of ancestral protein sequences that takes into account the observed variation of evolutionary rates between positions that more precisely describes the evolution of protein families. To improve the accuracy of evolutionary distance estimation and ancestral sequence reconstruction, two approaches are proposed to estimate position-specific evolutionary rates. Comparisons show that at large evolutionary distances our method gives more accurate ancestral sequence reconstruction than PAML, PHYLIP and PAUP*. We apply the reconstructed ancestral sequences to homology inference and functional site prediction. We show that the usage of hypothetical ancestors together with the present day sequences improves profile-based sequence similarity searches; and that ancestral sequence reconstruction methods can be used to predict positions with functional specificity. Conclusions: As a computational tool to reconstruct ancestral protein sequences from a given multiple sequence alignment, ANCESCON shows high accuracy in tests and helps detection of remote homologs and prediction of functional sites. ANCESCON is freely available for noncommercial use. Pre-compiled versions for several platforms can be downloaded from ftp:// iole.swmed.edu/pub/ANCESCON/.

Original languageEnglish (US)
Article number33
JournalBMC Evolutionary Biology
Volume4
DOIs
StatePublished - Sep 17 2004

Fingerprint

amino acid sequences
protein
Proteins
prediction
proteins
homology
sequence alignment
ancestry
Sequence Alignment
history
Sequence Homology
phylogeny
Computer Simulation
methodology
History
phylogenetics
testing
rate
method

ASJC Scopus subject areas

  • Medicine(all)
  • Ecology, Evolution, Behavior and Systematics

Cite this

Reconstruction of ancestral protein sequences and its applications. / Cai, Wei; Pei, Jimin; Grishin, Nick V.

In: BMC Evolutionary Biology, Vol. 4, 33, 17.09.2004.

Research output: Contribution to journalArticle

@article{68de9db7d8c24062aa6455750890765b,
title = "Reconstruction of ancestral protein sequences and its applications",
abstract = "Background: Modern-day proteins were selected during long evolutionary history as descendants of ancient life forms. In silico reconstruction of such ancestral protein sequences facilitates our understanding of evolutionary processes, protein classification and biological function. Additionally, reconstructed ancestral protein sequences could serve to fill in sequence space thus aiding remote homology inference. Results: We developed ANCESCON, a package for distance-based phylogenetic inference and reconstruction of ancestral protein sequences that takes into account the observed variation of evolutionary rates between positions that more precisely describes the evolution of protein families. To improve the accuracy of evolutionary distance estimation and ancestral sequence reconstruction, two approaches are proposed to estimate position-specific evolutionary rates. Comparisons show that at large evolutionary distances our method gives more accurate ancestral sequence reconstruction than PAML, PHYLIP and PAUP*. We apply the reconstructed ancestral sequences to homology inference and functional site prediction. We show that the usage of hypothetical ancestors together with the present day sequences improves profile-based sequence similarity searches; and that ancestral sequence reconstruction methods can be used to predict positions with functional specificity. Conclusions: As a computational tool to reconstruct ancestral protein sequences from a given multiple sequence alignment, ANCESCON shows high accuracy in tests and helps detection of remote homologs and prediction of functional sites. ANCESCON is freely available for noncommercial use. Pre-compiled versions for several platforms can be downloaded from ftp:// iole.swmed.edu/pub/ANCESCON/.",
author = "Wei Cai and Jimin Pei and Grishin, {Nick V.}",
year = "2004",
month = "9",
day = "17",
doi = "10.1186/1471-2148-4-33",
language = "English (US)",
volume = "4",
journal = "BMC Evolutionary Biology",
issn = "1471-2148",
publisher = "BioMed Central",

}

TY - JOUR

T1 - Reconstruction of ancestral protein sequences and its applications

AU - Cai, Wei

AU - Pei, Jimin

AU - Grishin, Nick V.

PY - 2004/9/17

Y1 - 2004/9/17

N2 - Background: Modern-day proteins were selected during long evolutionary history as descendants of ancient life forms. In silico reconstruction of such ancestral protein sequences facilitates our understanding of evolutionary processes, protein classification and biological function. Additionally, reconstructed ancestral protein sequences could serve to fill in sequence space thus aiding remote homology inference. Results: We developed ANCESCON, a package for distance-based phylogenetic inference and reconstruction of ancestral protein sequences that takes into account the observed variation of evolutionary rates between positions that more precisely describes the evolution of protein families. To improve the accuracy of evolutionary distance estimation and ancestral sequence reconstruction, two approaches are proposed to estimate position-specific evolutionary rates. Comparisons show that at large evolutionary distances our method gives more accurate ancestral sequence reconstruction than PAML, PHYLIP and PAUP*. We apply the reconstructed ancestral sequences to homology inference and functional site prediction. We show that the usage of hypothetical ancestors together with the present day sequences improves profile-based sequence similarity searches; and that ancestral sequence reconstruction methods can be used to predict positions with functional specificity. Conclusions: As a computational tool to reconstruct ancestral protein sequences from a given multiple sequence alignment, ANCESCON shows high accuracy in tests and helps detection of remote homologs and prediction of functional sites. ANCESCON is freely available for noncommercial use. Pre-compiled versions for several platforms can be downloaded from ftp:// iole.swmed.edu/pub/ANCESCON/.

AB - Background: Modern-day proteins were selected during long evolutionary history as descendants of ancient life forms. In silico reconstruction of such ancestral protein sequences facilitates our understanding of evolutionary processes, protein classification and biological function. Additionally, reconstructed ancestral protein sequences could serve to fill in sequence space thus aiding remote homology inference. Results: We developed ANCESCON, a package for distance-based phylogenetic inference and reconstruction of ancestral protein sequences that takes into account the observed variation of evolutionary rates between positions that more precisely describes the evolution of protein families. To improve the accuracy of evolutionary distance estimation and ancestral sequence reconstruction, two approaches are proposed to estimate position-specific evolutionary rates. Comparisons show that at large evolutionary distances our method gives more accurate ancestral sequence reconstruction than PAML, PHYLIP and PAUP*. We apply the reconstructed ancestral sequences to homology inference and functional site prediction. We show that the usage of hypothetical ancestors together with the present day sequences improves profile-based sequence similarity searches; and that ancestral sequence reconstruction methods can be used to predict positions with functional specificity. Conclusions: As a computational tool to reconstruct ancestral protein sequences from a given multiple sequence alignment, ANCESCON shows high accuracy in tests and helps detection of remote homologs and prediction of functional sites. ANCESCON is freely available for noncommercial use. Pre-compiled versions for several platforms can be downloaded from ftp:// iole.swmed.edu/pub/ANCESCON/.

UR - http://www.scopus.com/inward/record.url?scp=8744259751&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=8744259751&partnerID=8YFLogxK

U2 - 10.1186/1471-2148-4-33

DO - 10.1186/1471-2148-4-33

M3 - Article

C2 - 15377393

AN - SCOPUS:8744259751

VL - 4

JO - BMC Evolutionary Biology

JF - BMC Evolutionary Biology

SN - 1471-2148

M1 - 33

ER -