Genomic privacy

Abraham P. Schwab, Hung S Luu, Jason Wang, Jason Y Park

Research output: Contribution to journalReview article

1 Citation (Scopus)

Abstract

BACKGROUND: Genetic information is unique among all laboratory data because it not only informs the current health of the specific person tested but may also be predictive of the future health of the individual and, to varying degrees, all biological relatives. CONTENT: As DNA sequencing has become ubiquitous with decreasing cost, large repositories of genomic data have emerged from the domains of research, healthcare, law enforcement, international security, and recreational consumer interest (i.e., genealogy). Broadly shared genomic data are believed to be a key element for future discoveries in human disease. For example, the National Cancer Institute's Genomic Data Commons is designed to promote cancer research discoveries by providing free access to the genome data sets of 12000 cancer patients. However, in parallel with the promise of curing diseases, genomic data also have the potential for harm. Genomic data that are deidentified by standard healthcare practices (e.g., removal of name, date of birth) can be reidentified by methods that combine genomic software with publicly available demographic databases (e.g., phone book). Recent law enforcement cases (i.e., Bear Brook Murders, Golden State Killer) in theUShave demonstrated the power of combiningDNA profiles with genealogy databases. SUMMARY: We examine the current environment of genomic privacy and confidentiality in the US and describe current and future risks to genomic privacy. Reidentification and inference of genetic information of biological relatives will become more important as larger databases of clinical, criminal, and recreational genomic information are developed over the next decade.

Original languageEnglish (US)
Pages (from-to)1696-1703
Number of pages8
JournalClinical Chemistry
Volume64
Issue number12
DOIs
StatePublished - Dec 1 2018

Fingerprint

Privacy
Genealogy and Heraldry
Law Enforcement
Law enforcement
Databases
Health
Ursidae
National Cancer Institute (U.S.)
Homicide
Health Services Research
Confidentiality
DNA Sequence Analysis
Names
Curing
Neoplasms
Software
Genes
Demography
Parturition
Genome

ASJC Scopus subject areas

  • Clinical Biochemistry
  • Biochemistry, medical

Cite this

Genomic privacy. / Schwab, Abraham P.; Luu, Hung S; Wang, Jason; Park, Jason Y.

In: Clinical Chemistry, Vol. 64, No. 12, 01.12.2018, p. 1696-1703.

Research output: Contribution to journalReview article

Schwab, Abraham P. ; Luu, Hung S ; Wang, Jason ; Park, Jason Y. / Genomic privacy. In: Clinical Chemistry. 2018 ; Vol. 64, No. 12. pp. 1696-1703.
@article{344ca549279049d5b7f5c88d42bf1b5d,
title = "Genomic privacy",
abstract = "BACKGROUND: Genetic information is unique among all laboratory data because it not only informs the current health of the specific person tested but may also be predictive of the future health of the individual and, to varying degrees, all biological relatives. CONTENT: As DNA sequencing has become ubiquitous with decreasing cost, large repositories of genomic data have emerged from the domains of research, healthcare, law enforcement, international security, and recreational consumer interest (i.e., genealogy). Broadly shared genomic data are believed to be a key element for future discoveries in human disease. For example, the National Cancer Institute's Genomic Data Commons is designed to promote cancer research discoveries by providing free access to the genome data sets of 12000 cancer patients. However, in parallel with the promise of curing diseases, genomic data also have the potential for harm. Genomic data that are deidentified by standard healthcare practices (e.g., removal of name, date of birth) can be reidentified by methods that combine genomic software with publicly available demographic databases (e.g., phone book). Recent law enforcement cases (i.e., Bear Brook Murders, Golden State Killer) in theUShave demonstrated the power of combiningDNA profiles with genealogy databases. SUMMARY: We examine the current environment of genomic privacy and confidentiality in the US and describe current and future risks to genomic privacy. Reidentification and inference of genetic information of biological relatives will become more important as larger databases of clinical, criminal, and recreational genomic information are developed over the next decade.",
author = "Schwab, {Abraham P.} and Luu, {Hung S} and Jason Wang and Park, {Jason Y}",
year = "2018",
month = "12",
day = "1",
doi = "10.1373/clinchem.2018.289512",
language = "English (US)",
volume = "64",
pages = "1696--1703",
journal = "Clinical Chemistry",
issn = "0009-9147",
publisher = "American Association for Clinical Chemistry Inc.",
number = "12",

}

TY - JOUR

T1 - Genomic privacy

AU - Schwab, Abraham P.

AU - Luu, Hung S

AU - Wang, Jason

AU - Park, Jason Y

PY - 2018/12/1

Y1 - 2018/12/1

N2 - BACKGROUND: Genetic information is unique among all laboratory data because it not only informs the current health of the specific person tested but may also be predictive of the future health of the individual and, to varying degrees, all biological relatives. CONTENT: As DNA sequencing has become ubiquitous with decreasing cost, large repositories of genomic data have emerged from the domains of research, healthcare, law enforcement, international security, and recreational consumer interest (i.e., genealogy). Broadly shared genomic data are believed to be a key element for future discoveries in human disease. For example, the National Cancer Institute's Genomic Data Commons is designed to promote cancer research discoveries by providing free access to the genome data sets of 12000 cancer patients. However, in parallel with the promise of curing diseases, genomic data also have the potential for harm. Genomic data that are deidentified by standard healthcare practices (e.g., removal of name, date of birth) can be reidentified by methods that combine genomic software with publicly available demographic databases (e.g., phone book). Recent law enforcement cases (i.e., Bear Brook Murders, Golden State Killer) in theUShave demonstrated the power of combiningDNA profiles with genealogy databases. SUMMARY: We examine the current environment of genomic privacy and confidentiality in the US and describe current and future risks to genomic privacy. Reidentification and inference of genetic information of biological relatives will become more important as larger databases of clinical, criminal, and recreational genomic information are developed over the next decade.

AB - BACKGROUND: Genetic information is unique among all laboratory data because it not only informs the current health of the specific person tested but may also be predictive of the future health of the individual and, to varying degrees, all biological relatives. CONTENT: As DNA sequencing has become ubiquitous with decreasing cost, large repositories of genomic data have emerged from the domains of research, healthcare, law enforcement, international security, and recreational consumer interest (i.e., genealogy). Broadly shared genomic data are believed to be a key element for future discoveries in human disease. For example, the National Cancer Institute's Genomic Data Commons is designed to promote cancer research discoveries by providing free access to the genome data sets of 12000 cancer patients. However, in parallel with the promise of curing diseases, genomic data also have the potential for harm. Genomic data that are deidentified by standard healthcare practices (e.g., removal of name, date of birth) can be reidentified by methods that combine genomic software with publicly available demographic databases (e.g., phone book). Recent law enforcement cases (i.e., Bear Brook Murders, Golden State Killer) in theUShave demonstrated the power of combiningDNA profiles with genealogy databases. SUMMARY: We examine the current environment of genomic privacy and confidentiality in the US and describe current and future risks to genomic privacy. Reidentification and inference of genetic information of biological relatives will become more important as larger databases of clinical, criminal, and recreational genomic information are developed over the next decade.

UR - http://www.scopus.com/inward/record.url?scp=85057537226&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85057537226&partnerID=8YFLogxK

U2 - 10.1373/clinchem.2018.289512

DO - 10.1373/clinchem.2018.289512

M3 - Review article

C2 - 29991478

AN - SCOPUS:85057537226

VL - 64

SP - 1696

EP - 1703

JO - Clinical Chemistry

JF - Clinical Chemistry

SN - 0009-9147

IS - 12

ER -