TY - JOUR
T1 - MB-GAN
T2 - Microbiome Simulation via Generative Adversarial Network
AU - Rong, Ruichen
AU - Jiang, Shuang
AU - Xu, Lin
AU - Xiao, Guanghua
AU - Xie, Yang
AU - Liu, Dajiang J.
AU - Li, Qiwei
AU - Zhan, Xiaowei
N1 - Publisher Copyright:
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
Copyright:
Copyright 2020 Elsevier B.V., All rights reserved.
PY - 2019/12/4
Y1 - 2019/12/4
N2 - Simulation is a critical component of experimental design and evaluation of analysis methods in microbiome association studies. However, statistically modeling the microbiome data is challenging since that the complex structure in the real data is difficult to be fully represented by statistical models. To address this challenge, we designed a novel simulation framework for microbiome data using a generative adversarial network (GAN), called MB-GAN, by utilizing methodology advancements from the deep learning community. MB-GAN can automatically learn from a given dataset and compute simulated datasets that are indistinguishable from it. When MB-GAN was applied to a case-control microbiome study of 396 samples, we demonstrated that the simulated data and the original data had similar first-order and second-order properties, including sparsity, diversities, and taxa-taxa correlations. These advantages are suitable for further microbiome methodology development where high fidelity microbiome data are needed.
AB - Simulation is a critical component of experimental design and evaluation of analysis methods in microbiome association studies. However, statistically modeling the microbiome data is challenging since that the complex structure in the real data is difficult to be fully represented by statistical models. To address this challenge, we designed a novel simulation framework for microbiome data using a generative adversarial network (GAN), called MB-GAN, by utilizing methodology advancements from the deep learning community. MB-GAN can automatically learn from a given dataset and compute simulated datasets that are indistinguishable from it. When MB-GAN was applied to a case-control microbiome study of 396 samples, we demonstrated that the simulated data and the original data had similar first-order and second-order properties, including sparsity, diversities, and taxa-taxa correlations. These advantages are suitable for further microbiome methodology development where high fidelity microbiome data are needed.
KW - Deep learning
KW - Generative adversarial network
KW - Microbiome simulation
UR - http://www.scopus.com/inward/record.url?scp=85094341879&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85094341879&partnerID=8YFLogxK
U2 - 10.1101/863977
DO - 10.1101/863977
M3 - Article
AN - SCOPUS:85094341879
JO - Seminars in Fetal and Neonatal Medicine
JF - Seminars in Fetal and Neonatal Medicine
SN - 1744-165X
ER -