TY - JOUR
T1 - Sensitive and specific post-call filtering of genetic variants in xenograft and primary tumors
AU - Mannakee, Brian K.
AU - Balaji, Uthra
AU - Witkiewicz, Agnieszka K.
AU - Gutenkunst, Ryan N.
AU - Knudsen, Erik
N1 - Publisher Copyright:
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
Copyright:
Copyright 2020 Elsevier B.V., All rights reserved.
PY - 2017/9/12
Y1 - 2017/9/12
N2 - Motivation: Tumor genome sequencing offers great promise for guiding research and therapy, but spurious variant calls can arise from multiple sources. Mouse contamination can generate many spurious calls when sequencing patient-derived xenografts (PDXs). Paralogous genome sequences can also generate spurious calls when sequencing any tumor. We developed a BLAST-based algorithm, MAPEX, to identify and filter out spurious calls from both these sources. Results: When calling variants from xenografts, MAPEX has similar sensitivity and specificity to more complex algorithms. When applied to any tumor, MAPEX also automatically flags calls that potentially arise from paralogous sequences. Our implementation, mapexr, runs quickly and easily on a desktop computer. MAPEX is thus a useful addition to almost any pipeline for calling genetic variants in tumors.
AB - Motivation: Tumor genome sequencing offers great promise for guiding research and therapy, but spurious variant calls can arise from multiple sources. Mouse contamination can generate many spurious calls when sequencing patient-derived xenografts (PDXs). Paralogous genome sequences can also generate spurious calls when sequencing any tumor. We developed a BLAST-based algorithm, MAPEX, to identify and filter out spurious calls from both these sources. Results: When calling variants from xenografts, MAPEX has similar sensitivity and specificity to more complex algorithms. When applied to any tumor, MAPEX also automatically flags calls that potentially arise from paralogous sequences. Our implementation, mapexr, runs quickly and easily on a desktop computer. MAPEX is thus a useful addition to almost any pipeline for calling genetic variants in tumors.
UR - http://www.scopus.com/inward/record.url?scp=85094350854&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85094350854&partnerID=8YFLogxK
U2 - 10.1101/187468
DO - 10.1101/187468
M3 - Article
AN - SCOPUS:85094350854
JO - Seminars in Fetal and Neonatal Medicine
JF - Seminars in Fetal and Neonatal Medicine
SN - 1744-165X
ER -