Principles of metadata organization at the ENCODE data coordination center

Eurie L. Hong, Cricket A. Sloan, Esther T. Chan, Jean M. Davidson, Venkat S. Malladi, J. Seth Strattan, Benjamin C. Hitz, Idan Gabdank, Aditi K. Narayanan, Marcus Ho, Brian T. Lee, Laurence D. Rowe, Timothy R. Dreszer, Greg R. Roe, Nikhil R. Podduturi, Forrest Tanaka, Jason A. Hilton, J. Michael Cherry

Research output: Contribution to journalArticle

17 Citations (Scopus)

Abstract

The Encyclopedia of DNA Elements (ENCODE) Data Coordinating Center (DCC) is responsible for organizing, describing and providing access to the diverse data generated by the ENCODE project. The description of these data, known as metadata, includes the biological sample used as input, the protocols and assays performed on these samples, the data files generated from the results and the computational methods used to analyze the data. Here, we outline the principles and philosophy used to define the ENCODE metadata in order to create a metadata standard that can be applied to diverse assays and multiple genomic projects. In addition, we present how the data are validated and used by the ENCODE DCC in creating the ENCODE Portal (https://www.encodeproject.org/).

Original languageEnglish (US)
Article numberbaw001
JournalDatabase
Volume2016
DOIs
StatePublished - Jan 1 2016
Externally publishedYes

Fingerprint

Encyclopedias
Metadata
DNA
Assays
Information Storage and Retrieval
assays
Computational methods
genomics
sampling

ASJC Scopus subject areas

  • Information Systems
  • Biochemistry, Genetics and Molecular Biology(all)
  • Agricultural and Biological Sciences(all)

Cite this

Hong, E. L., Sloan, C. A., Chan, E. T., Davidson, J. M., Malladi, V. S., Strattan, J. S., ... Cherry, J. M. (2016). Principles of metadata organization at the ENCODE data coordination center. Database, 2016, [baw001]. https://doi.org/10.1093/database/baw001

Principles of metadata organization at the ENCODE data coordination center. / Hong, Eurie L.; Sloan, Cricket A.; Chan, Esther T.; Davidson, Jean M.; Malladi, Venkat S.; Strattan, J. Seth; Hitz, Benjamin C.; Gabdank, Idan; Narayanan, Aditi K.; Ho, Marcus; Lee, Brian T.; Rowe, Laurence D.; Dreszer, Timothy R.; Roe, Greg R.; Podduturi, Nikhil R.; Tanaka, Forrest; Hilton, Jason A.; Cherry, J. Michael.

In: Database, Vol. 2016, baw001, 01.01.2016.

Research output: Contribution to journalArticle

Hong, EL, Sloan, CA, Chan, ET, Davidson, JM, Malladi, VS, Strattan, JS, Hitz, BC, Gabdank, I, Narayanan, AK, Ho, M, Lee, BT, Rowe, LD, Dreszer, TR, Roe, GR, Podduturi, NR, Tanaka, F, Hilton, JA & Cherry, JM 2016, 'Principles of metadata organization at the ENCODE data coordination center', Database, vol. 2016, baw001. https://doi.org/10.1093/database/baw001
Hong EL, Sloan CA, Chan ET, Davidson JM, Malladi VS, Strattan JS et al. Principles of metadata organization at the ENCODE data coordination center. Database. 2016 Jan 1;2016. baw001. https://doi.org/10.1093/database/baw001
Hong, Eurie L. ; Sloan, Cricket A. ; Chan, Esther T. ; Davidson, Jean M. ; Malladi, Venkat S. ; Strattan, J. Seth ; Hitz, Benjamin C. ; Gabdank, Idan ; Narayanan, Aditi K. ; Ho, Marcus ; Lee, Brian T. ; Rowe, Laurence D. ; Dreszer, Timothy R. ; Roe, Greg R. ; Podduturi, Nikhil R. ; Tanaka, Forrest ; Hilton, Jason A. ; Cherry, J. Michael. / Principles of metadata organization at the ENCODE data coordination center. In: Database. 2016 ; Vol. 2016.
@article{d5ceee9a2d79439b9b77c91db63b654c,
title = "Principles of metadata organization at the ENCODE data coordination center",
abstract = "The Encyclopedia of DNA Elements (ENCODE) Data Coordinating Center (DCC) is responsible for organizing, describing and providing access to the diverse data generated by the ENCODE project. The description of these data, known as metadata, includes the biological sample used as input, the protocols and assays performed on these samples, the data files generated from the results and the computational methods used to analyze the data. Here, we outline the principles and philosophy used to define the ENCODE metadata in order to create a metadata standard that can be applied to diverse assays and multiple genomic projects. In addition, we present how the data are validated and used by the ENCODE DCC in creating the ENCODE Portal (https://www.encodeproject.org/).",
author = "Hong, {Eurie L.} and Sloan, {Cricket A.} and Chan, {Esther T.} and Davidson, {Jean M.} and Malladi, {Venkat S.} and Strattan, {J. Seth} and Hitz, {Benjamin C.} and Idan Gabdank and Narayanan, {Aditi K.} and Marcus Ho and Lee, {Brian T.} and Rowe, {Laurence D.} and Dreszer, {Timothy R.} and Roe, {Greg R.} and Podduturi, {Nikhil R.} and Forrest Tanaka and Hilton, {Jason A.} and Cherry, {J. Michael}",
year = "2016",
month = "1",
day = "1",
doi = "10.1093/database/baw001",
language = "English (US)",
volume = "2016",
journal = "Database : the journal of biological databases and curation",
issn = "1758-0463",
publisher = "Oxford University Press",

}

TY - JOUR

T1 - Principles of metadata organization at the ENCODE data coordination center

AU - Hong, Eurie L.

AU - Sloan, Cricket A.

AU - Chan, Esther T.

AU - Davidson, Jean M.

AU - Malladi, Venkat S.

AU - Strattan, J. Seth

AU - Hitz, Benjamin C.

AU - Gabdank, Idan

AU - Narayanan, Aditi K.

AU - Ho, Marcus

AU - Lee, Brian T.

AU - Rowe, Laurence D.

AU - Dreszer, Timothy R.

AU - Roe, Greg R.

AU - Podduturi, Nikhil R.

AU - Tanaka, Forrest

AU - Hilton, Jason A.

AU - Cherry, J. Michael

PY - 2016/1/1

Y1 - 2016/1/1

N2 - The Encyclopedia of DNA Elements (ENCODE) Data Coordinating Center (DCC) is responsible for organizing, describing and providing access to the diverse data generated by the ENCODE project. The description of these data, known as metadata, includes the biological sample used as input, the protocols and assays performed on these samples, the data files generated from the results and the computational methods used to analyze the data. Here, we outline the principles and philosophy used to define the ENCODE metadata in order to create a metadata standard that can be applied to diverse assays and multiple genomic projects. In addition, we present how the data are validated and used by the ENCODE DCC in creating the ENCODE Portal (https://www.encodeproject.org/).

AB - The Encyclopedia of DNA Elements (ENCODE) Data Coordinating Center (DCC) is responsible for organizing, describing and providing access to the diverse data generated by the ENCODE project. The description of these data, known as metadata, includes the biological sample used as input, the protocols and assays performed on these samples, the data files generated from the results and the computational methods used to analyze the data. Here, we outline the principles and philosophy used to define the ENCODE metadata in order to create a metadata standard that can be applied to diverse assays and multiple genomic projects. In addition, we present how the data are validated and used by the ENCODE DCC in creating the ENCODE Portal (https://www.encodeproject.org/).

UR - http://www.scopus.com/inward/record.url?scp=84964862392&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84964862392&partnerID=8YFLogxK

U2 - 10.1093/database/baw001

DO - 10.1093/database/baw001

M3 - Article

C2 - 26980513

AN - SCOPUS:84964862392

VL - 2016

JO - Database : the journal of biological databases and curation

JF - Database : the journal of biological databases and curation

SN - 1758-0463

M1 - baw001

ER -