TY - JOUR
T1 - Machine learning reveals bilateral distribution of somatic L1 insertions in human neurons and glia
AU - Brain Somatic Mosaicism Network
AU - Zhu, Xiaowei
AU - Zhou, Bo
AU - Pattni, Reenal
AU - Gleason, Kelly
AU - Tan, Chunfeng
AU - Kalinowski, Agnieszka
AU - Sloan, Steven
AU - Fiston-Lavier, Anna Sophie
AU - Mariani, Jessica
AU - Abyzov, Alexej
AU - Petrov, Dimitri
AU - Barres, Ben A.
AU - Vogel, Hannes
AU - Moran, John V.
AU - Vaccarino, Flora M.
AU - Tamminga, Carol A.
AU - Levinson, Douglas F.
AU - Urban, Alexander E.
N1 - Publisher Copyright:
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Copyright:
Copyright 2020 Elsevier B.V., All rights reserved.
PY - 2019/6/6
Y1 - 2019/6/6
N2 - Active retrotransposons in the human genome (L1, Alu and SVA elements) can create genomic mobile element insertions (MEIs) in both germline and somatic tissue1. Specific somatic MEIs have been detected at high levels in human cancers2, and at lower to medium levels in human brains3. Dysregulation of somatic retrotransposition in the human brain has been hypothesized to contribute to neuropsychiatric diseases4,5. However, individual somatic MEIs are present in small proportions of cells at a given anatomical location, and thus standard whole-genome sequencing (WGS) presents a difficult signal-to-noise problem, while single-cell approaches suffer from limited scalability and experimental artifacts introduced by enzymatic whole-genome amplification6. Previous studies produced widely differing estimates for the somatic retrotransposition rates in human brain3,6-8. Here, we present a highly precise machine learning method (RetroSom) to directly identify somatic L1 and Alu insertions in <1% cells from 200× deep WGS, which allows circumventing the restrictions of whole-genome amplification. Using RetroSom we confirmed a lower rate of retrotransposition for individual somatic L1 insertions in human neurons. We discovered that anatomical distribution of somatic L1 insertion is as widespread in glia as in neurons, and across both hemispheres of the brain, indicating retrotransposition occurs during early embryogenesis. We characterized two of the detected brain-specific L1 insertions in great detail in neurons and glia from a donor with schizophrenia. Both insertions are within introns of genes active in brain (CNNM2, FRMD4A) in regions with multiple genetic associations with neuropsychiatric disorders9-11. Gene expression was significantly reduced by both somatic insertions in a reporter assay. Our results provide novel insights into the potential for pathological effects of somatic retrotransposition in the human brain, now including the large glial fraction. RetroSom has broad applicability in all disease states where somatic retrotransposition is expected to play a role, such as autoimmune disorders and cancer.
AB - Active retrotransposons in the human genome (L1, Alu and SVA elements) can create genomic mobile element insertions (MEIs) in both germline and somatic tissue1. Specific somatic MEIs have been detected at high levels in human cancers2, and at lower to medium levels in human brains3. Dysregulation of somatic retrotransposition in the human brain has been hypothesized to contribute to neuropsychiatric diseases4,5. However, individual somatic MEIs are present in small proportions of cells at a given anatomical location, and thus standard whole-genome sequencing (WGS) presents a difficult signal-to-noise problem, while single-cell approaches suffer from limited scalability and experimental artifacts introduced by enzymatic whole-genome amplification6. Previous studies produced widely differing estimates for the somatic retrotransposition rates in human brain3,6-8. Here, we present a highly precise machine learning method (RetroSom) to directly identify somatic L1 and Alu insertions in <1% cells from 200× deep WGS, which allows circumventing the restrictions of whole-genome amplification. Using RetroSom we confirmed a lower rate of retrotransposition for individual somatic L1 insertions in human neurons. We discovered that anatomical distribution of somatic L1 insertion is as widespread in glia as in neurons, and across both hemispheres of the brain, indicating retrotransposition occurs during early embryogenesis. We characterized two of the detected brain-specific L1 insertions in great detail in neurons and glia from a donor with schizophrenia. Both insertions are within introns of genes active in brain (CNNM2, FRMD4A) in regions with multiple genetic associations with neuropsychiatric disorders9-11. Gene expression was significantly reduced by both somatic insertions in a reporter assay. Our results provide novel insights into the potential for pathological effects of somatic retrotransposition in the human brain, now including the large glial fraction. RetroSom has broad applicability in all disease states where somatic retrotransposition is expected to play a role, such as autoimmune disorders and cancer.
UR - http://www.scopus.com/inward/record.url?scp=85095618121&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85095618121&partnerID=8YFLogxK
U2 - 10.1101/660779
DO - 10.1101/660779
M3 - Article
AN - SCOPUS:85095618121
JO - Seminars in Fetal and Neonatal Medicine
JF - Seminars in Fetal and Neonatal Medicine
SN - 1744-165X
ER -