COMPASS: A tool for comparison of multiple protein alignments with assessment of statistical significance

Ruslan Sadreyev, Nick Grishin

Research output: Contribution to journalArticlepeer-review

243 Scopus citations

Abstract

We present a novel method for the comparison of multiple protein alignments with assessment of statistical significance (COMPASS). The method derives numerical profiles from alignments, constructs optimal local profile-profile alignments and analytically estimates E-values for the detected similarities. The scoring system and E-value calculation are based on a generalization of the PSI-BLAST approach to profile-sequence comparison, which is adapted for the profile-profile case. Tested along with existing methods for profile-sequence (PSI-BLAST) and profile-profile (prof_sim) comparison, COMPASS shows increased abilities for sensitive and selective detection of remote sequence similarities, as well as improved quality of local alignments. The method allows prediction of relationships between protein families in the PFAM database beyond the range of conventional methods. Two predicted relations with high significance are similarities between various Rossmann-type folds and between various helix-turn-helix-containing families. The potential value of COMPASS for structure/function predictions is illustrated by the detection of an intricate homology between the DNA-binding domain of the CTF/NFI family and the MH1 domain of the Smad family.

Original languageEnglish (US)
Pages (from-to)317-336
Number of pages20
JournalJournal of Molecular Biology
Volume326
Issue number1
DOIs
StatePublished - Feb 7 2003

Keywords

  • CTF/NFI
  • Profile-profile comparison
  • Protein structure prediction
  • Sequence profiles
  • Sequence similarity searches

ASJC Scopus subject areas

  • Molecular Biology
  • Biophysics
  • Structural Biology

Fingerprint

Dive into the research topics of 'COMPASS: A tool for comparison of multiple protein alignments with assessment of statistical significance'. Together they form a unique fingerprint.

Cite this