Sensitive and accurate detection of copy number variants using read depth of coverage

Seungtai Yoon, Zhenyu Xuan, Vladimir Makarov, Kenny Ye, Jonathan Sebat

Research output: Contribution to journalArticle

364 Citations (Scopus)

Abstract

Methods for the direct detection of copy number variation (CNV) genome-wide have become effective instruments for identifying genetic risk factors for disease. The application of next-generation sequencing platforms to genetic studies promises to improve sensitivity to detect CNVs as well as inversions, indels, and SNPs. New computational approaches are needed to systematically detect these variants from genome sequence data. Existing sequence-based approaches for CNV detection are primarily based on paired-end read mapping (PEM) as reported previously by Tuzun et al. and Korbel et al. Due to limitations of the PEM approach, some classes of CNVs are difficult to ascertain, including large insertions and variants located within complex genomic regions. To overcome these limitations, we developed a method for CNV detection using read depth of coverage. Event-wise testing (EWT) is a method based on significance testing. In contrast to standard segmentation algorithms that typically operate by performing likelihood evaluation for every point in the genome, EWT works on intervals of data points, rapidly searching for specific classes of events. Overall false-positive rate is controlled by testing the significance of each possible event and adjusting for multiple testing. Deletions and duplications detected in an individual genome by EWT are examined across multiple genomes to identify polymorphism between individuals. We estimated error rates using simulations based on real data, and we applied EWT to the analysis of chromosome 1 from paired-end shotgun sequence data (303) on five individuals. Our results suggest that analysis of read depth is an effective approach for the detection of CNVs, and it captures structural variants that are refractory to established PEM-based methods.

Original languageEnglish (US)
Pages (from-to)1586-1592
Number of pages7
JournalGenome Research
Volume19
Issue number9
DOIs
StatePublished - Sep 1 2009
Externally publishedYes

Fingerprint

Genome
Chromosomes, Human, Pair 1
Firearms
Single Nucleotide Polymorphism

ASJC Scopus subject areas

  • Genetics
  • Genetics(clinical)

Cite this

Sensitive and accurate detection of copy number variants using read depth of coverage. / Yoon, Seungtai; Xuan, Zhenyu; Makarov, Vladimir; Ye, Kenny; Sebat, Jonathan.

In: Genome Research, Vol. 19, No. 9, 01.09.2009, p. 1586-1592.

Research output: Contribution to journalArticle

Yoon, Seungtai ; Xuan, Zhenyu ; Makarov, Vladimir ; Ye, Kenny ; Sebat, Jonathan. / Sensitive and accurate detection of copy number variants using read depth of coverage. In: Genome Research. 2009 ; Vol. 19, No. 9. pp. 1586-1592.
@article{d8e0ee3ebb8146888195a1c1bf3da1f6,
title = "Sensitive and accurate detection of copy number variants using read depth of coverage",
abstract = "Methods for the direct detection of copy number variation (CNV) genome-wide have become effective instruments for identifying genetic risk factors for disease. The application of next-generation sequencing platforms to genetic studies promises to improve sensitivity to detect CNVs as well as inversions, indels, and SNPs. New computational approaches are needed to systematically detect these variants from genome sequence data. Existing sequence-based approaches for CNV detection are primarily based on paired-end read mapping (PEM) as reported previously by Tuzun et al. and Korbel et al. Due to limitations of the PEM approach, some classes of CNVs are difficult to ascertain, including large insertions and variants located within complex genomic regions. To overcome these limitations, we developed a method for CNV detection using read depth of coverage. Event-wise testing (EWT) is a method based on significance testing. In contrast to standard segmentation algorithms that typically operate by performing likelihood evaluation for every point in the genome, EWT works on intervals of data points, rapidly searching for specific classes of events. Overall false-positive rate is controlled by testing the significance of each possible event and adjusting for multiple testing. Deletions and duplications detected in an individual genome by EWT are examined across multiple genomes to identify polymorphism between individuals. We estimated error rates using simulations based on real data, and we applied EWT to the analysis of chromosome 1 from paired-end shotgun sequence data (303) on five individuals. Our results suggest that analysis of read depth is an effective approach for the detection of CNVs, and it captures structural variants that are refractory to established PEM-based methods.",
author = "Seungtai Yoon and Zhenyu Xuan and Vladimir Makarov and Kenny Ye and Jonathan Sebat",
year = "2009",
month = "9",
day = "1",
doi = "10.1101/gr.092981.109",
language = "English (US)",
volume = "19",
pages = "1586--1592",
journal = "Genome Research",
issn = "1088-9051",
publisher = "Cold Spring Harbor Laboratory Press",
number = "9",

}

TY - JOUR

T1 - Sensitive and accurate detection of copy number variants using read depth of coverage

AU - Yoon, Seungtai

AU - Xuan, Zhenyu

AU - Makarov, Vladimir

AU - Ye, Kenny

AU - Sebat, Jonathan

PY - 2009/9/1

Y1 - 2009/9/1

N2 - Methods for the direct detection of copy number variation (CNV) genome-wide have become effective instruments for identifying genetic risk factors for disease. The application of next-generation sequencing platforms to genetic studies promises to improve sensitivity to detect CNVs as well as inversions, indels, and SNPs. New computational approaches are needed to systematically detect these variants from genome sequence data. Existing sequence-based approaches for CNV detection are primarily based on paired-end read mapping (PEM) as reported previously by Tuzun et al. and Korbel et al. Due to limitations of the PEM approach, some classes of CNVs are difficult to ascertain, including large insertions and variants located within complex genomic regions. To overcome these limitations, we developed a method for CNV detection using read depth of coverage. Event-wise testing (EWT) is a method based on significance testing. In contrast to standard segmentation algorithms that typically operate by performing likelihood evaluation for every point in the genome, EWT works on intervals of data points, rapidly searching for specific classes of events. Overall false-positive rate is controlled by testing the significance of each possible event and adjusting for multiple testing. Deletions and duplications detected in an individual genome by EWT are examined across multiple genomes to identify polymorphism between individuals. We estimated error rates using simulations based on real data, and we applied EWT to the analysis of chromosome 1 from paired-end shotgun sequence data (303) on five individuals. Our results suggest that analysis of read depth is an effective approach for the detection of CNVs, and it captures structural variants that are refractory to established PEM-based methods.

AB - Methods for the direct detection of copy number variation (CNV) genome-wide have become effective instruments for identifying genetic risk factors for disease. The application of next-generation sequencing platforms to genetic studies promises to improve sensitivity to detect CNVs as well as inversions, indels, and SNPs. New computational approaches are needed to systematically detect these variants from genome sequence data. Existing sequence-based approaches for CNV detection are primarily based on paired-end read mapping (PEM) as reported previously by Tuzun et al. and Korbel et al. Due to limitations of the PEM approach, some classes of CNVs are difficult to ascertain, including large insertions and variants located within complex genomic regions. To overcome these limitations, we developed a method for CNV detection using read depth of coverage. Event-wise testing (EWT) is a method based on significance testing. In contrast to standard segmentation algorithms that typically operate by performing likelihood evaluation for every point in the genome, EWT works on intervals of data points, rapidly searching for specific classes of events. Overall false-positive rate is controlled by testing the significance of each possible event and adjusting for multiple testing. Deletions and duplications detected in an individual genome by EWT are examined across multiple genomes to identify polymorphism between individuals. We estimated error rates using simulations based on real data, and we applied EWT to the analysis of chromosome 1 from paired-end shotgun sequence data (303) on five individuals. Our results suggest that analysis of read depth is an effective approach for the detection of CNVs, and it captures structural variants that are refractory to established PEM-based methods.

UR - http://www.scopus.com/inward/record.url?scp=69749122557&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=69749122557&partnerID=8YFLogxK

U2 - 10.1101/gr.092981.109

DO - 10.1101/gr.092981.109

M3 - Article

C2 - 19657104

AN - SCOPUS:69749122557

VL - 19

SP - 1586

EP - 1592

JO - Genome Research

JF - Genome Research

SN - 1088-9051

IS - 9

ER -