Scina

Semi-supervised analysis of single cells in silico

Ze Zhang, Danni Luo, Xue Zhong, Jin Huk Choi, Yuanqing Ma, Stacy Wang, Elena Mahrt, Wei Guo, Eric W. Stawiski, Zora Modrusan, Somasekar Seshagiri, Payal Kapur, Gary Hon, James B Brugarolas, Tao Wang

Research output: Contribution to journalArticle

Abstract

Advances in single-cell RNA sequencing (scRNA-Seq) have allowed for comprehensive analyses of single cell data. However, current analyses of scRNA-Seq data usually start from unsupervised clustering or visualization. These methods ignore the prior knowledge of transcriptomes and of the probable structures of the data. Moreover, cell identification heavily relies on subjective and inaccurate human inspection afterwards. To address these analytical challenges, we developed the Semi-supervised Category Identification and Assignment (SCINA) algorithm, a semi-supervised model, for analyses of scRNA-Seq and flow cytometry/CyTOF data, and other data of similar format, by automatically exploiting previously established gene signatures using an expectation–maximization (EM) algorithm. We applied SCINA on a wide range of datasets, and showed its accuracy, stableness and efficiency exceeded most popular unsupervised approaches. SCINA discovered an intermediate stage of oligodendrocyte from mouse brain scRNA-Seq data. SCINA also detected immune cell population shifting in Stk4 knock-out-knockoutmouse cytometry data. Finally, SCINA identified a new kidney tumor clade with similarity to FH-deficient tumors from bulk tumor data. Overall, SCINA provides both methodological advances and biological insights from perspectives different from traditional analytical methods.

Original languageEnglish (US)
Article number531
JournalGenes
Volume10
Issue number7
DOIs
StatePublished - Jul 1 2019

Fingerprint

Single-Cell Analysis
RNA Sequence Analysis
Computer Simulation
Neoplasms
Oligodendroglia
Transcriptome
Cluster Analysis
Flow Cytometry
Kidney
Brain
Population
Genes

Keywords

  • CyTOF
  • Fumarase
  • Fumarate hydratase
  • HLRCC
  • RCC
  • Renal cell carcinoma
  • SCINA
  • Single-cell RNA-seq

ASJC Scopus subject areas

  • Genetics
  • Genetics(clinical)

Cite this

Zhang, Z., Luo, D., Zhong, X., Choi, J. H., Ma, Y., Wang, S., ... Wang, T. (2019). Scina: Semi-supervised analysis of single cells in silico. Genes, 10(7), [531]. https://doi.org/10.3390/genes10070531

Scina : Semi-supervised analysis of single cells in silico. / Zhang, Ze; Luo, Danni; Zhong, Xue; Choi, Jin Huk; Ma, Yuanqing; Wang, Stacy; Mahrt, Elena; Guo, Wei; Stawiski, Eric W.; Modrusan, Zora; Seshagiri, Somasekar; Kapur, Payal; Hon, Gary; Brugarolas, James B; Wang, Tao.

In: Genes, Vol. 10, No. 7, 531, 01.07.2019.

Research output: Contribution to journalArticle

Zhang, Z, Luo, D, Zhong, X, Choi, JH, Ma, Y, Wang, S, Mahrt, E, Guo, W, Stawiski, EW, Modrusan, Z, Seshagiri, S, Kapur, P, Hon, G, Brugarolas, JB & Wang, T 2019, 'Scina: Semi-supervised analysis of single cells in silico', Genes, vol. 10, no. 7, 531. https://doi.org/10.3390/genes10070531
Zhang Z, Luo D, Zhong X, Choi JH, Ma Y, Wang S et al. Scina: Semi-supervised analysis of single cells in silico. Genes. 2019 Jul 1;10(7). 531. https://doi.org/10.3390/genes10070531
Zhang, Ze ; Luo, Danni ; Zhong, Xue ; Choi, Jin Huk ; Ma, Yuanqing ; Wang, Stacy ; Mahrt, Elena ; Guo, Wei ; Stawiski, Eric W. ; Modrusan, Zora ; Seshagiri, Somasekar ; Kapur, Payal ; Hon, Gary ; Brugarolas, James B ; Wang, Tao. / Scina : Semi-supervised analysis of single cells in silico. In: Genes. 2019 ; Vol. 10, No. 7.
@article{548e43a4d1b94ccc800fe562e5aa1cdd,
title = "Scina: Semi-supervised analysis of single cells in silico",
abstract = "Advances in single-cell RNA sequencing (scRNA-Seq) have allowed for comprehensive analyses of single cell data. However, current analyses of scRNA-Seq data usually start from unsupervised clustering or visualization. These methods ignore the prior knowledge of transcriptomes and of the probable structures of the data. Moreover, cell identification heavily relies on subjective and inaccurate human inspection afterwards. To address these analytical challenges, we developed the Semi-supervised Category Identification and Assignment (SCINA) algorithm, a semi-supervised model, for analyses of scRNA-Seq and flow cytometry/CyTOF data, and other data of similar format, by automatically exploiting previously established gene signatures using an expectation–maximization (EM) algorithm. We applied SCINA on a wide range of datasets, and showed its accuracy, stableness and efficiency exceeded most popular unsupervised approaches. SCINA discovered an intermediate stage of oligodendrocyte from mouse brain scRNA-Seq data. SCINA also detected immune cell population shifting in Stk4 knock-out-knockoutmouse cytometry data. Finally, SCINA identified a new kidney tumor clade with similarity to FH-deficient tumors from bulk tumor data. Overall, SCINA provides both methodological advances and biological insights from perspectives different from traditional analytical methods.",
keywords = "CyTOF, Fumarase, Fumarate hydratase, HLRCC, RCC, Renal cell carcinoma, SCINA, Single-cell RNA-seq",
author = "Ze Zhang and Danni Luo and Xue Zhong and Choi, {Jin Huk} and Yuanqing Ma and Stacy Wang and Elena Mahrt and Wei Guo and Stawiski, {Eric W.} and Zora Modrusan and Somasekar Seshagiri and Payal Kapur and Gary Hon and Brugarolas, {James B} and Tao Wang",
year = "2019",
month = "7",
day = "1",
doi = "10.3390/genes10070531",
language = "English (US)",
volume = "10",
journal = "Genes",
issn = "2073-4425",
publisher = "Multidisciplinary Digital Publishing Institute (MDPI)",
number = "7",

}

TY - JOUR

T1 - Scina

T2 - Semi-supervised analysis of single cells in silico

AU - Zhang, Ze

AU - Luo, Danni

AU - Zhong, Xue

AU - Choi, Jin Huk

AU - Ma, Yuanqing

AU - Wang, Stacy

AU - Mahrt, Elena

AU - Guo, Wei

AU - Stawiski, Eric W.

AU - Modrusan, Zora

AU - Seshagiri, Somasekar

AU - Kapur, Payal

AU - Hon, Gary

AU - Brugarolas, James B

AU - Wang, Tao

PY - 2019/7/1

Y1 - 2019/7/1

N2 - Advances in single-cell RNA sequencing (scRNA-Seq) have allowed for comprehensive analyses of single cell data. However, current analyses of scRNA-Seq data usually start from unsupervised clustering or visualization. These methods ignore the prior knowledge of transcriptomes and of the probable structures of the data. Moreover, cell identification heavily relies on subjective and inaccurate human inspection afterwards. To address these analytical challenges, we developed the Semi-supervised Category Identification and Assignment (SCINA) algorithm, a semi-supervised model, for analyses of scRNA-Seq and flow cytometry/CyTOF data, and other data of similar format, by automatically exploiting previously established gene signatures using an expectation–maximization (EM) algorithm. We applied SCINA on a wide range of datasets, and showed its accuracy, stableness and efficiency exceeded most popular unsupervised approaches. SCINA discovered an intermediate stage of oligodendrocyte from mouse brain scRNA-Seq data. SCINA also detected immune cell population shifting in Stk4 knock-out-knockoutmouse cytometry data. Finally, SCINA identified a new kidney tumor clade with similarity to FH-deficient tumors from bulk tumor data. Overall, SCINA provides both methodological advances and biological insights from perspectives different from traditional analytical methods.

AB - Advances in single-cell RNA sequencing (scRNA-Seq) have allowed for comprehensive analyses of single cell data. However, current analyses of scRNA-Seq data usually start from unsupervised clustering or visualization. These methods ignore the prior knowledge of transcriptomes and of the probable structures of the data. Moreover, cell identification heavily relies on subjective and inaccurate human inspection afterwards. To address these analytical challenges, we developed the Semi-supervised Category Identification and Assignment (SCINA) algorithm, a semi-supervised model, for analyses of scRNA-Seq and flow cytometry/CyTOF data, and other data of similar format, by automatically exploiting previously established gene signatures using an expectation–maximization (EM) algorithm. We applied SCINA on a wide range of datasets, and showed its accuracy, stableness and efficiency exceeded most popular unsupervised approaches. SCINA discovered an intermediate stage of oligodendrocyte from mouse brain scRNA-Seq data. SCINA also detected immune cell population shifting in Stk4 knock-out-knockoutmouse cytometry data. Finally, SCINA identified a new kidney tumor clade with similarity to FH-deficient tumors from bulk tumor data. Overall, SCINA provides both methodological advances and biological insights from perspectives different from traditional analytical methods.

KW - CyTOF

KW - Fumarase

KW - Fumarate hydratase

KW - HLRCC

KW - RCC

KW - Renal cell carcinoma

KW - SCINA

KW - Single-cell RNA-seq

UR - http://www.scopus.com/inward/record.url?scp=85071381828&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85071381828&partnerID=8YFLogxK

U2 - 10.3390/genes10070531

DO - 10.3390/genes10070531

M3 - Article

VL - 10

JO - Genes

JF - Genes

SN - 2073-4425

IS - 7

M1 - 531

ER -