TY - JOUR
T1 - The pangenome structure of Escherichia coli
T2 - Comparative genomic analysis of E. coli commensal and pathogenic isolates
AU - Rasko, David A.
AU - Rosovitz, M. J.
AU - Myers, Garry S A
AU - Mongodin, Emmanuel F.
AU - Fricke, W. Florian
AU - Gajer, Pawel
AU - Crabtree, Jonathan
AU - Sebaihia, Mohammed
AU - Thomson, Nicholas R.
AU - Chaudhuri, Roy
AU - Henderson, Ian R.
AU - Sperandio, Vanessa
AU - Ravel, Jacques
PY - 2008/10
Y1 - 2008/10
N2 - Whole-genome sequencing has been skewed toward bacterial pathogens as a consequence of the prioritization of medical and veterinary diseases. However, it is becoming clear that in order to accurately measure genetic variation within and between pathogenic groups, multiple isolates, as well as commensal species, must be sequenced. This study examined the pangenomic content of Escherichia coli. Six distinct E. coli pathovars can be distinguished using molecular or phenotypic markers, but only two of the six pathovars have been subjected to any genome sequencing previously. Thus, this report provides a seminal description of the genomic contents and unique features of three unsequenced pathovars, enterotoxigenic E. coli, enteropathogenic E. coli, and enteroaggregative E. coli. We also determined the first genome sequence of a human commensal E. coli isolate, E. coli HS, which will undoubtedly provide a new baseline from which workers can examine the evolution of pathogenic E. coli. Comparison of 17 E. coli genomes, 8 of which are new, resulted in identification of ∼2,200 genes conserved in all isolates. We were also able to identify genes that were isolate and pathovar specific. Fewer pathovar-specific genes were identified than anticipated, suggesting that each isolate may have independently developed virulence capabilities. Pangenome calculations indicate that E. coli genomic diversity represents an open pangenome model containing a reservoir of more than 13,000 genes, many of which may be uncharacterized but important virulence factors. This comparative study of the species E. coli, while descriptive, should provide the basis for future functional work on this important group of pathogens.
AB - Whole-genome sequencing has been skewed toward bacterial pathogens as a consequence of the prioritization of medical and veterinary diseases. However, it is becoming clear that in order to accurately measure genetic variation within and between pathogenic groups, multiple isolates, as well as commensal species, must be sequenced. This study examined the pangenomic content of Escherichia coli. Six distinct E. coli pathovars can be distinguished using molecular or phenotypic markers, but only two of the six pathovars have been subjected to any genome sequencing previously. Thus, this report provides a seminal description of the genomic contents and unique features of three unsequenced pathovars, enterotoxigenic E. coli, enteropathogenic E. coli, and enteroaggregative E. coli. We also determined the first genome sequence of a human commensal E. coli isolate, E. coli HS, which will undoubtedly provide a new baseline from which workers can examine the evolution of pathogenic E. coli. Comparison of 17 E. coli genomes, 8 of which are new, resulted in identification of ∼2,200 genes conserved in all isolates. We were also able to identify genes that were isolate and pathovar specific. Fewer pathovar-specific genes were identified than anticipated, suggesting that each isolate may have independently developed virulence capabilities. Pangenome calculations indicate that E. coli genomic diversity represents an open pangenome model containing a reservoir of more than 13,000 genes, many of which may be uncharacterized but important virulence factors. This comparative study of the species E. coli, while descriptive, should provide the basis for future functional work on this important group of pathogens.
UR - http://www.scopus.com/inward/record.url?scp=53849104729&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=53849104729&partnerID=8YFLogxK
U2 - 10.1128/JB.00619-08
DO - 10.1128/JB.00619-08
M3 - Article
C2 - 18676672
AN - SCOPUS:53849104729
SN - 0021-9193
VL - 190
SP - 6881
EP - 6893
JO - Journal of bacteriology
JF - Journal of bacteriology
IS - 20
ER -