Performance evaluation of existing de novo sequencing algorithms

Sergey Pevtsov, Irina Fedulova, Hamid Mirzaei, Charles Buck, Xiang Zhang

Research output: Contribution to journalArticlepeer-review

88 Scopus citations

Abstract

Two methods have been developed for protein identification from tandem mass spectra: database searching and de novo sequencing. De novo sequencing identifies peptide directly from tandem mass spectra. Among many proposed algorithms, we evaluated the performance of the five de novo sequencing algorithms, AUDENS, Lutefisk, NovoHMM, PepNovo, and PEAKS. Our evaluation methods are based on calculation of relative sequence distance (RSD), algorithm sensitivity, and spectrum quality. We found that de novo sequencing algorithms have different performance in analyzing QSTAR and LCQ mass spectrometer data, but in general, perform better in analyzing QSTAR data than LCQ data. For the QSTAR data, the performance order of the five algorithms is PEAKS > Lutefisk, PepNovo > AUDENS, NovoHMM. The performance of PEAKS, Lutefisk, and PepNovo strongly depends on the spectrum quality and increases with an increase of spectrum quality. However, AUDENS and NovoHMM are not sensitive to the spectrum quality. Compared with other four algorithms, PEAKS has the best sensitivity and also has the best performance in the entire range of spectrum quality. For the LCQ data, the performance order is NovoHMM > PepNovo, PEAKS > Lutefisk > AUDENS. NovoHMM has the best sensitivity, and its performance is the best in the entire range of spectrum quality. But the overall performance of NovoHMM is not significantly different from the performance of PEAKS and PepNovo. AUDENS does not give a good performance in analyzing either QSTAR and LCQ data.

Original languageEnglish (US)
Pages (from-to)3018-3028
Number of pages11
JournalJournal of Proteome Research
Volume5
Issue number11
DOIs
StatePublished - Nov 2006

Keywords

  • De novo sequencing
  • Mass spectral quality
  • Mass spectrometry
  • Peptide identification

ASJC Scopus subject areas

  • Biochemistry
  • General Chemistry

Fingerprint

Dive into the research topics of 'Performance evaluation of existing de novo sequencing algorithms'. Together they form a unique fingerprint.

Cite this