Strategies for metagenomic-guided whole-community proteomics of complex microbial environments

Brandi L. Cantarel; Alison R. Erickson; Nathan C. VerBerkmoes; Brian K. Erickson; Patricia A. Carey; Chongle Pan; Manesh Shah; Emmanuel F. Mongodin; Janet K. Jansson; Claire M. Fraser-Liggett; Robert L. Hettich

doi:10.1371/journal.pone.0027173

Strategies for metagenomic-guided whole-community proteomics of complex microbial environments

Brandi L. Cantarel, Alison R. Erickson, Nathan C. VerBerkmoes, Brian K. Erickson, Patricia A. Carey, Chongle Pan, Manesh Shah, Emmanuel F. Mongodin, Janet K. Jansson, Claire M. Fraser-Liggett, Robert L. Hettich

Research output: Contribution to journal › Article › peer-review

52 Scopus citations

Abstract

Accurate protein identification in large-scale proteomics experiments relies upon a detailed, accurate protein catalogue, which is derived from predictions of open reading frames based on genome sequence data. Integration of mass spectrometry-based proteomics data with computational proteome predictions from environmental metagenomic sequences has been challenging because of the variable overlap between proteomic datasets and corresponding short-read nucleotide sequence data. In this study, we have benchmarked several strategies for increasing microbial peptide spectral matching in metaproteomic datasets using protein predictions generated from matched metagenomic sequences from the same human fecal samples. Additionally, we investigated the impact of mass spectrometry-based filters (high mass accuracy, delta correlation), and de novo peptide sequencing on the number and robustness of peptide-spectrum assignments in these complex datasets. In summary, we find that high mass accuracy peptide measurements searched against non-assembled reads from DNA sequencing of the same samples significantly increased identifiable proteins without sacrificing accuracy.

Original language	English (US)
Article number	e27173
Journal	PloS one
Volume	6
Issue number	11
DOIs	https://doi.org/10.1371/journal.pone.0027173
State	Published - Nov 23 2011

ASJC Scopus subject areas

General Biochemistry, Genetics and Molecular Biology
General Agricultural and Biological Sciences
General

Access to Document

10.1371/journal.pone.0027173

Cite this

@article{5526ebb3888a4c70b829cec44154079c,

title = "Strategies for metagenomic-guided whole-community proteomics of complex microbial environments",

abstract = "Accurate protein identification in large-scale proteomics experiments relies upon a detailed, accurate protein catalogue, which is derived from predictions of open reading frames based on genome sequence data. Integration of mass spectrometry-based proteomics data with computational proteome predictions from environmental metagenomic sequences has been challenging because of the variable overlap between proteomic datasets and corresponding short-read nucleotide sequence data. In this study, we have benchmarked several strategies for increasing microbial peptide spectral matching in metaproteomic datasets using protein predictions generated from matched metagenomic sequences from the same human fecal samples. Additionally, we investigated the impact of mass spectrometry-based filters (high mass accuracy, delta correlation), and de novo peptide sequencing on the number and robustness of peptide-spectrum assignments in these complex datasets. In summary, we find that high mass accuracy peptide measurements searched against non-assembled reads from DNA sequencing of the same samples significantly increased identifiable proteins without sacrificing accuracy.",

author = "Cantarel, {Brandi L.} and Erickson, {Alison R.} and VerBerkmoes, {Nathan C.} and Erickson, {Brian K.} and Carey, {Patricia A.} and Chongle Pan and Manesh Shah and Mongodin, {Emmanuel F.} and Jansson, {Janet K.} and Fraser-Liggett, {Claire M.} and Hettich, {Robert L.}",

year = "2011",

month = nov,

day = "23",

doi = "10.1371/journal.pone.0027173",

language = "English (US)",

volume = "6",

journal = "PloS one",

issn = "1932-6203",

publisher = "Public Library of Science",

number = "11",

}

TY - JOUR

T1 - Strategies for metagenomic-guided whole-community proteomics of complex microbial environments

AU - Cantarel, Brandi L.

AU - Erickson, Alison R.

AU - VerBerkmoes, Nathan C.

AU - Erickson, Brian K.

AU - Carey, Patricia A.

AU - Pan, Chongle

AU - Shah, Manesh

AU - Mongodin, Emmanuel F.

AU - Jansson, Janet K.

AU - Fraser-Liggett, Claire M.

AU - Hettich, Robert L.

PY - 2011/11/23

Y1 - 2011/11/23

N2 - Accurate protein identification in large-scale proteomics experiments relies upon a detailed, accurate protein catalogue, which is derived from predictions of open reading frames based on genome sequence data. Integration of mass spectrometry-based proteomics data with computational proteome predictions from environmental metagenomic sequences has been challenging because of the variable overlap between proteomic datasets and corresponding short-read nucleotide sequence data. In this study, we have benchmarked several strategies for increasing microbial peptide spectral matching in metaproteomic datasets using protein predictions generated from matched metagenomic sequences from the same human fecal samples. Additionally, we investigated the impact of mass spectrometry-based filters (high mass accuracy, delta correlation), and de novo peptide sequencing on the number and robustness of peptide-spectrum assignments in these complex datasets. In summary, we find that high mass accuracy peptide measurements searched against non-assembled reads from DNA sequencing of the same samples significantly increased identifiable proteins without sacrificing accuracy.

AB - Accurate protein identification in large-scale proteomics experiments relies upon a detailed, accurate protein catalogue, which is derived from predictions of open reading frames based on genome sequence data. Integration of mass spectrometry-based proteomics data with computational proteome predictions from environmental metagenomic sequences has been challenging because of the variable overlap between proteomic datasets and corresponding short-read nucleotide sequence data. In this study, we have benchmarked several strategies for increasing microbial peptide spectral matching in metaproteomic datasets using protein predictions generated from matched metagenomic sequences from the same human fecal samples. Additionally, we investigated the impact of mass spectrometry-based filters (high mass accuracy, delta correlation), and de novo peptide sequencing on the number and robustness of peptide-spectrum assignments in these complex datasets. In summary, we find that high mass accuracy peptide measurements searched against non-assembled reads from DNA sequencing of the same samples significantly increased identifiable proteins without sacrificing accuracy.

UR - http://www.scopus.com/inward/record.url?scp=81755176064&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=81755176064&partnerID=8YFLogxK

U2 - 10.1371/journal.pone.0027173

DO - 10.1371/journal.pone.0027173

M3 - Article

C2 - 22132090

AN - SCOPUS:81755176064

SN - 1932-6203

VL - 6

JO - PloS one

JF - PloS one

IS - 11

M1 - e27173

ER -

Strategies for metagenomic-guided whole-community proteomics of complex microbial environments

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this