SCOPmap

Automated assignment of protein structures to evolutionary superfamilies

Sara Cheek, Yuan Qi, S. Sri Krishna, Lisa N. Kinch, Nick V. Grishin

Research output: Contribution to journalArticle

34 Citations (Scopus)

Abstract

Background: Inference of remote homology between proteins is very challenging and remains a prerogative of an expert. Thus a significant drawback to the use of evolutionary-based protein structure classifications is the difficulty in assigning new proteins to unique positions in the classification scheme with automatic methods. To address this issue, we have developed an algorithm to map protein domains to an existing structural classification scheme and have applied it to the SCOP database. Results: The general strategy employed by this algorithm is to combine the results of several existing sequence and structure comparison tools applied to a query protein of known structure in order to find the homologs already classified in SCOP database and thus determine classification assignments. The algorithm is able to map domains within newly solved structures to the appropriate SCOP superfamily level with ∼95% accuracy. Examples of correctly mapped remote homologs are discussed. The algorithm is also capable of identifying potential evolutionary relationships not specified in the SCOP database, thus helping to make it better. The strategy of the mapping algorithm is not limited to SCOP and can be applied to any other evolutionary-based classification scheme as well. SCOPmap is available for download. Conclusion: The SCOPmap program is useful for assigning domains in newly solved structures to appropriate superfamilies and for identifying evolutionary links between different superfamilies.

Original languageEnglish (US)
Article number197
JournalBMC Bioinformatics
Volume5
DOIs
StatePublished - Dec 14 2004

Fingerprint

Protein Structure
Assignment
Proteins
Protein
Databases
Homology
Query
Strategy

ASJC Scopus subject areas

  • Medicine(all)
  • Structural Biology
  • Applied Mathematics

Cite this

SCOPmap : Automated assignment of protein structures to evolutionary superfamilies. / Cheek, Sara; Qi, Yuan; Krishna, S. Sri; Kinch, Lisa N.; Grishin, Nick V.

In: BMC Bioinformatics, Vol. 5, 197, 14.12.2004.

Research output: Contribution to journalArticle

Cheek, Sara ; Qi, Yuan ; Krishna, S. Sri ; Kinch, Lisa N. ; Grishin, Nick V. / SCOPmap : Automated assignment of protein structures to evolutionary superfamilies. In: BMC Bioinformatics. 2004 ; Vol. 5.
@article{bbb871a611714e319992b8b023d88c22,
title = "SCOPmap: Automated assignment of protein structures to evolutionary superfamilies",
abstract = "Background: Inference of remote homology between proteins is very challenging and remains a prerogative of an expert. Thus a significant drawback to the use of evolutionary-based protein structure classifications is the difficulty in assigning new proteins to unique positions in the classification scheme with automatic methods. To address this issue, we have developed an algorithm to map protein domains to an existing structural classification scheme and have applied it to the SCOP database. Results: The general strategy employed by this algorithm is to combine the results of several existing sequence and structure comparison tools applied to a query protein of known structure in order to find the homologs already classified in SCOP database and thus determine classification assignments. The algorithm is able to map domains within newly solved structures to the appropriate SCOP superfamily level with ∼95{\%} accuracy. Examples of correctly mapped remote homologs are discussed. The algorithm is also capable of identifying potential evolutionary relationships not specified in the SCOP database, thus helping to make it better. The strategy of the mapping algorithm is not limited to SCOP and can be applied to any other evolutionary-based classification scheme as well. SCOPmap is available for download. Conclusion: The SCOPmap program is useful for assigning domains in newly solved structures to appropriate superfamilies and for identifying evolutionary links between different superfamilies.",
author = "Sara Cheek and Yuan Qi and Krishna, {S. Sri} and Kinch, {Lisa N.} and Grishin, {Nick V.}",
year = "2004",
month = "12",
day = "14",
doi = "10.1186/1471-2105-5-197",
language = "English (US)",
volume = "5",
journal = "BMC Bioinformatics",
issn = "1471-2105",
publisher = "BioMed Central",

}

TY - JOUR

T1 - SCOPmap

T2 - Automated assignment of protein structures to evolutionary superfamilies

AU - Cheek, Sara

AU - Qi, Yuan

AU - Krishna, S. Sri

AU - Kinch, Lisa N.

AU - Grishin, Nick V.

PY - 2004/12/14

Y1 - 2004/12/14

N2 - Background: Inference of remote homology between proteins is very challenging and remains a prerogative of an expert. Thus a significant drawback to the use of evolutionary-based protein structure classifications is the difficulty in assigning new proteins to unique positions in the classification scheme with automatic methods. To address this issue, we have developed an algorithm to map protein domains to an existing structural classification scheme and have applied it to the SCOP database. Results: The general strategy employed by this algorithm is to combine the results of several existing sequence and structure comparison tools applied to a query protein of known structure in order to find the homologs already classified in SCOP database and thus determine classification assignments. The algorithm is able to map domains within newly solved structures to the appropriate SCOP superfamily level with ∼95% accuracy. Examples of correctly mapped remote homologs are discussed. The algorithm is also capable of identifying potential evolutionary relationships not specified in the SCOP database, thus helping to make it better. The strategy of the mapping algorithm is not limited to SCOP and can be applied to any other evolutionary-based classification scheme as well. SCOPmap is available for download. Conclusion: The SCOPmap program is useful for assigning domains in newly solved structures to appropriate superfamilies and for identifying evolutionary links between different superfamilies.

AB - Background: Inference of remote homology between proteins is very challenging and remains a prerogative of an expert. Thus a significant drawback to the use of evolutionary-based protein structure classifications is the difficulty in assigning new proteins to unique positions in the classification scheme with automatic methods. To address this issue, we have developed an algorithm to map protein domains to an existing structural classification scheme and have applied it to the SCOP database. Results: The general strategy employed by this algorithm is to combine the results of several existing sequence and structure comparison tools applied to a query protein of known structure in order to find the homologs already classified in SCOP database and thus determine classification assignments. The algorithm is able to map domains within newly solved structures to the appropriate SCOP superfamily level with ∼95% accuracy. Examples of correctly mapped remote homologs are discussed. The algorithm is also capable of identifying potential evolutionary relationships not specified in the SCOP database, thus helping to make it better. The strategy of the mapping algorithm is not limited to SCOP and can be applied to any other evolutionary-based classification scheme as well. SCOPmap is available for download. Conclusion: The SCOPmap program is useful for assigning domains in newly solved structures to appropriate superfamilies and for identifying evolutionary links between different superfamilies.

UR - http://www.scopus.com/inward/record.url?scp=13244264810&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=13244264810&partnerID=8YFLogxK

U2 - 10.1186/1471-2105-5-197

DO - 10.1186/1471-2105-5-197

M3 - Article

VL - 5

JO - BMC Bioinformatics

JF - BMC Bioinformatics

SN - 1471-2105

M1 - 197

ER -