MAKER

An easy-to-use annotation pipeline designed for emerging model organism genomes

Brandi L. Cantarel, Ian Korf, Sofia M.C. Robb, Genis Parra, Eric Ross, Barry Moore, Carson Holt, Alejandro Sánchez Alvarado, Mark Yandell

Research output: Contribution to journalArticle

540 Citations (Scopus)

Abstract

We have developed a portable and easily configurable genome annotation pipeline called MAKER. Its purpose is to allow investigators to independently annotate eukaryotic genomes and create genome databases. MAKER identifies repeats, aligns ESTs and proteins to a genome, produces ab initio gene predictions, and automatically synthesizes these data into gene annotations having evidence-based quality indices. MAKER is also easily trainable: Outputs of preliminary runs are used to automatically retrain its gene-prediction algorithm, producing higher-quality gene-models on subsequent runs. MAKER's inputs are minimal, and its outputs can be used to create a GMOD database. Its outputs can also be viewed in the Apollo Genome browser; this feature of MAKER provides an easy means to annotate, view, and edit individual contigs and BACs without the overhead of a database. As proof of principle, we have used MAKER to annotate the genome of the planarian Schmidtea mediterranea and to create a new genome database, SmedGD. We have also compared MAKER's performance to other published annotation pipelines. Our results demonstrate that MAKER provides a simple and effective means to convert a genome sequence into a community-accessible genome database. MAKER should prove especially useful for emerging model organism genome projects for which extensive bioinformatics resources may not be readily available.

Original languageEnglish (US)
Pages (from-to)188-196
Number of pages9
JournalGenome Research
Volume18
Issue number1
DOIs
StatePublished - Jan 1 2008

Fingerprint

Genome
Databases
Planarians
Genes
Molecular Sequence Annotation
Expressed Sequence Tags
Computational Biology
Research Personnel
Proteins

ASJC Scopus subject areas

  • Genetics
  • Genetics(clinical)

Cite this

Cantarel, B. L., Korf, I., Robb, S. M. C., Parra, G., Ross, E., Moore, B., ... Yandell, M. (2008). MAKER: An easy-to-use annotation pipeline designed for emerging model organism genomes. Genome Research, 18(1), 188-196. https://doi.org/10.1101/gr.6743907

MAKER : An easy-to-use annotation pipeline designed for emerging model organism genomes. / Cantarel, Brandi L.; Korf, Ian; Robb, Sofia M.C.; Parra, Genis; Ross, Eric; Moore, Barry; Holt, Carson; Alvarado, Alejandro Sánchez; Yandell, Mark.

In: Genome Research, Vol. 18, No. 1, 01.01.2008, p. 188-196.

Research output: Contribution to journalArticle

Cantarel, BL, Korf, I, Robb, SMC, Parra, G, Ross, E, Moore, B, Holt, C, Alvarado, AS & Yandell, M 2008, 'MAKER: An easy-to-use annotation pipeline designed for emerging model organism genomes', Genome Research, vol. 18, no. 1, pp. 188-196. https://doi.org/10.1101/gr.6743907
Cantarel, Brandi L. ; Korf, Ian ; Robb, Sofia M.C. ; Parra, Genis ; Ross, Eric ; Moore, Barry ; Holt, Carson ; Alvarado, Alejandro Sánchez ; Yandell, Mark. / MAKER : An easy-to-use annotation pipeline designed for emerging model organism genomes. In: Genome Research. 2008 ; Vol. 18, No. 1. pp. 188-196.
@article{386971dc3f944dfbb91be26777327af5,
title = "MAKER: An easy-to-use annotation pipeline designed for emerging model organism genomes",
abstract = "We have developed a portable and easily configurable genome annotation pipeline called MAKER. Its purpose is to allow investigators to independently annotate eukaryotic genomes and create genome databases. MAKER identifies repeats, aligns ESTs and proteins to a genome, produces ab initio gene predictions, and automatically synthesizes these data into gene annotations having evidence-based quality indices. MAKER is also easily trainable: Outputs of preliminary runs are used to automatically retrain its gene-prediction algorithm, producing higher-quality gene-models on subsequent runs. MAKER's inputs are minimal, and its outputs can be used to create a GMOD database. Its outputs can also be viewed in the Apollo Genome browser; this feature of MAKER provides an easy means to annotate, view, and edit individual contigs and BACs without the overhead of a database. As proof of principle, we have used MAKER to annotate the genome of the planarian Schmidtea mediterranea and to create a new genome database, SmedGD. We have also compared MAKER's performance to other published annotation pipelines. Our results demonstrate that MAKER provides a simple and effective means to convert a genome sequence into a community-accessible genome database. MAKER should prove especially useful for emerging model organism genome projects for which extensive bioinformatics resources may not be readily available.",
author = "Cantarel, {Brandi L.} and Ian Korf and Robb, {Sofia M.C.} and Genis Parra and Eric Ross and Barry Moore and Carson Holt and Alvarado, {Alejandro S{\'a}nchez} and Mark Yandell",
year = "2008",
month = "1",
day = "1",
doi = "10.1101/gr.6743907",
language = "English (US)",
volume = "18",
pages = "188--196",
journal = "Genome Research",
issn = "1088-9051",
publisher = "Cold Spring Harbor Laboratory Press",
number = "1",

}

TY - JOUR

T1 - MAKER

T2 - An easy-to-use annotation pipeline designed for emerging model organism genomes

AU - Cantarel, Brandi L.

AU - Korf, Ian

AU - Robb, Sofia M.C.

AU - Parra, Genis

AU - Ross, Eric

AU - Moore, Barry

AU - Holt, Carson

AU - Alvarado, Alejandro Sánchez

AU - Yandell, Mark

PY - 2008/1/1

Y1 - 2008/1/1

N2 - We have developed a portable and easily configurable genome annotation pipeline called MAKER. Its purpose is to allow investigators to independently annotate eukaryotic genomes and create genome databases. MAKER identifies repeats, aligns ESTs and proteins to a genome, produces ab initio gene predictions, and automatically synthesizes these data into gene annotations having evidence-based quality indices. MAKER is also easily trainable: Outputs of preliminary runs are used to automatically retrain its gene-prediction algorithm, producing higher-quality gene-models on subsequent runs. MAKER's inputs are minimal, and its outputs can be used to create a GMOD database. Its outputs can also be viewed in the Apollo Genome browser; this feature of MAKER provides an easy means to annotate, view, and edit individual contigs and BACs without the overhead of a database. As proof of principle, we have used MAKER to annotate the genome of the planarian Schmidtea mediterranea and to create a new genome database, SmedGD. We have also compared MAKER's performance to other published annotation pipelines. Our results demonstrate that MAKER provides a simple and effective means to convert a genome sequence into a community-accessible genome database. MAKER should prove especially useful for emerging model organism genome projects for which extensive bioinformatics resources may not be readily available.

AB - We have developed a portable and easily configurable genome annotation pipeline called MAKER. Its purpose is to allow investigators to independently annotate eukaryotic genomes and create genome databases. MAKER identifies repeats, aligns ESTs and proteins to a genome, produces ab initio gene predictions, and automatically synthesizes these data into gene annotations having evidence-based quality indices. MAKER is also easily trainable: Outputs of preliminary runs are used to automatically retrain its gene-prediction algorithm, producing higher-quality gene-models on subsequent runs. MAKER's inputs are minimal, and its outputs can be used to create a GMOD database. Its outputs can also be viewed in the Apollo Genome browser; this feature of MAKER provides an easy means to annotate, view, and edit individual contigs and BACs without the overhead of a database. As proof of principle, we have used MAKER to annotate the genome of the planarian Schmidtea mediterranea and to create a new genome database, SmedGD. We have also compared MAKER's performance to other published annotation pipelines. Our results demonstrate that MAKER provides a simple and effective means to convert a genome sequence into a community-accessible genome database. MAKER should prove especially useful for emerging model organism genome projects for which extensive bioinformatics resources may not be readily available.

UR - http://www.scopus.com/inward/record.url?scp=38049057159&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=38049057159&partnerID=8YFLogxK

U2 - 10.1101/gr.6743907

DO - 10.1101/gr.6743907

M3 - Article

VL - 18

SP - 188

EP - 196

JO - Genome Research

JF - Genome Research

SN - 1088-9051

IS - 1

ER -