PROMALS3D: A tool for multiple protein sequence and structure alignments

Jimin Pei, Bong Hyun Kim, Nick V. Grishin

Research output: Contribution to journalArticle

646 Citations (Scopus)

Abstract

Although multiple sequence alignments (MSAs) are essential for a wide range of applications from structure modeling to prediction of functional sites, construction of accurate MSAs for distantly related proteins remains a largely unsolved problem. The rapidly increasing database of spatial structures is a valuable source to improve alignment quality. We explore the use of 3D structural information to guide sequence alignments constructed by our MSA program PROMALS. The resulting tool, PROMALS3D, automatically identifies homologs with known 3D structures for the input sequences, derives structural constraints through structure-based alignments and combines them with sequence constraints to construct consistency-based multiple sequence alignments. The output is a consensus alignment that brings together sequence and structural information about input proteins and their homologs. PROMALS3D can also align sequences of multiple input structures, with the output representing a multiple structure-based alignment refined in combination with sequence constraints. The advantage of PROMALS3D is that it gives researchers an easy way to produce high-quality alignments consistent with both sequences and structures of proteins. PROMALS3D outperforms a number of existing methods for constructing multiple sequence or structural alignments using both reference-dependent and reference-independent evaluation methods.

Original languageEnglish (US)
Pages (from-to)2295-2300
Number of pages6
JournalNucleic Acids Research
Volume36
Issue number7
DOIs
StatePublished - Apr 2008

Fingerprint

Sequence Alignment
Proteins
Research Personnel
Databases

ASJC Scopus subject areas

  • Genetics

Cite this

PROMALS3D : A tool for multiple protein sequence and structure alignments. / Pei, Jimin; Kim, Bong Hyun; Grishin, Nick V.

In: Nucleic Acids Research, Vol. 36, No. 7, 04.2008, p. 2295-2300.

Research output: Contribution to journalArticle

@article{d33921bd724e44378a5ffe52e2d8d519,
title = "PROMALS3D: A tool for multiple protein sequence and structure alignments",
abstract = "Although multiple sequence alignments (MSAs) are essential for a wide range of applications from structure modeling to prediction of functional sites, construction of accurate MSAs for distantly related proteins remains a largely unsolved problem. The rapidly increasing database of spatial structures is a valuable source to improve alignment quality. We explore the use of 3D structural information to guide sequence alignments constructed by our MSA program PROMALS. The resulting tool, PROMALS3D, automatically identifies homologs with known 3D structures for the input sequences, derives structural constraints through structure-based alignments and combines them with sequence constraints to construct consistency-based multiple sequence alignments. The output is a consensus alignment that brings together sequence and structural information about input proteins and their homologs. PROMALS3D can also align sequences of multiple input structures, with the output representing a multiple structure-based alignment refined in combination with sequence constraints. The advantage of PROMALS3D is that it gives researchers an easy way to produce high-quality alignments consistent with both sequences and structures of proteins. PROMALS3D outperforms a number of existing methods for constructing multiple sequence or structural alignments using both reference-dependent and reference-independent evaluation methods.",
author = "Jimin Pei and Kim, {Bong Hyun} and Grishin, {Nick V.}",
year = "2008",
month = "4",
doi = "10.1093/nar/gkn072",
language = "English (US)",
volume = "36",
pages = "2295--2300",
journal = "Nucleic Acids Research",
issn = "0305-1048",
publisher = "Oxford University Press",
number = "7",

}

TY - JOUR

T1 - PROMALS3D

T2 - A tool for multiple protein sequence and structure alignments

AU - Pei, Jimin

AU - Kim, Bong Hyun

AU - Grishin, Nick V.

PY - 2008/4

Y1 - 2008/4

N2 - Although multiple sequence alignments (MSAs) are essential for a wide range of applications from structure modeling to prediction of functional sites, construction of accurate MSAs for distantly related proteins remains a largely unsolved problem. The rapidly increasing database of spatial structures is a valuable source to improve alignment quality. We explore the use of 3D structural information to guide sequence alignments constructed by our MSA program PROMALS. The resulting tool, PROMALS3D, automatically identifies homologs with known 3D structures for the input sequences, derives structural constraints through structure-based alignments and combines them with sequence constraints to construct consistency-based multiple sequence alignments. The output is a consensus alignment that brings together sequence and structural information about input proteins and their homologs. PROMALS3D can also align sequences of multiple input structures, with the output representing a multiple structure-based alignment refined in combination with sequence constraints. The advantage of PROMALS3D is that it gives researchers an easy way to produce high-quality alignments consistent with both sequences and structures of proteins. PROMALS3D outperforms a number of existing methods for constructing multiple sequence or structural alignments using both reference-dependent and reference-independent evaluation methods.

AB - Although multiple sequence alignments (MSAs) are essential for a wide range of applications from structure modeling to prediction of functional sites, construction of accurate MSAs for distantly related proteins remains a largely unsolved problem. The rapidly increasing database of spatial structures is a valuable source to improve alignment quality. We explore the use of 3D structural information to guide sequence alignments constructed by our MSA program PROMALS. The resulting tool, PROMALS3D, automatically identifies homologs with known 3D structures for the input sequences, derives structural constraints through structure-based alignments and combines them with sequence constraints to construct consistency-based multiple sequence alignments. The output is a consensus alignment that brings together sequence and structural information about input proteins and their homologs. PROMALS3D can also align sequences of multiple input structures, with the output representing a multiple structure-based alignment refined in combination with sequence constraints. The advantage of PROMALS3D is that it gives researchers an easy way to produce high-quality alignments consistent with both sequences and structures of proteins. PROMALS3D outperforms a number of existing methods for constructing multiple sequence or structural alignments using both reference-dependent and reference-independent evaluation methods.

UR - http://www.scopus.com/inward/record.url?scp=42449090264&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=42449090264&partnerID=8YFLogxK

U2 - 10.1093/nar/gkn072

DO - 10.1093/nar/gkn072

M3 - Article

C2 - 18287115

AN - SCOPUS:42449090264

VL - 36

SP - 2295

EP - 2300

JO - Nucleic Acids Research

JF - Nucleic Acids Research

SN - 0305-1048

IS - 7

ER -