Identifying genes with tri-modal association with survival and tumor grade in cancer patients

Minzhe Zhang, Tao Wang, Rosa Sirianni, Philip W Shaul, Yang Xie

Research output: Contribution to journalArticle

Abstract

Background: Previous cancer genomics studies focused on searching for novel oncogenes and tumor suppressor genes whose abundance is positively or negatively correlated with end-point observation, such as survival or tumor grade. This approach may potentially miss some truly functional genes if both its low and high modes have associations with end-point observation. Such genes act as both oncogenes and tumor suppressor genes, a scenario that is unlikely but theoretically possible. Results: We invented an Expectation-Maximization (EM) algorithm to divide patients into low-, middle- and high-expressing groups according to the expression level of a certain gene in both tumor and normal patients. We found one gene, ORMDL3, whose low and high modes were both associated with worse survival and higher tumor grade in breast cancer patients in multiple patient cohorts. We speculate that its tumor suppressor gene role may be real, while its high expression correlating with worse end-point outcome is probably due to the passenger event of the nearby ERBB2's amplification. Conclusions: The proposed EM algorithm can effectively detect genes having tri-modal distributed expression in patient groups compared to normal genes, thus rendering a new perspective on dissecting the association between genomic features and end-point observations. Our analysis of breast cancer datasets suggest that the gene ORMDL3 may have an unexploited tumor suppressive function.

Original languageEnglish (US)
Article number13
JournalBMC Bioinformatics
Volume20
Issue number1
DOIs
StatePublished - Jan 8 2019

Fingerprint

Tumors
Tumor
Cancer
Genes
Gene
Survival
Tumor Suppressor Genes
Neoplasms
End point
Oncogenes
Expectation-maximization Algorithm
Observation
Breast Cancer
Breast Neoplasms
Genomics
Feature Point
Amplification
Rendering
Divides
Scenarios

Keywords

  • Breast Cancer
  • Expectation maximization
  • Oncogene
  • Survival
  • Tumor suppressor gene

ASJC Scopus subject areas

  • Structural Biology
  • Biochemistry
  • Molecular Biology
  • Computer Science Applications
  • Applied Mathematics

Cite this

Identifying genes with tri-modal association with survival and tumor grade in cancer patients. / Zhang, Minzhe; Wang, Tao; Sirianni, Rosa; Shaul, Philip W; Xie, Yang.

In: BMC Bioinformatics, Vol. 20, No. 1, 13, 08.01.2019.

Research output: Contribution to journalArticle

@article{cbca748df242454f8d2cbf560248d5e3,
title = "Identifying genes with tri-modal association with survival and tumor grade in cancer patients",
abstract = "Background: Previous cancer genomics studies focused on searching for novel oncogenes and tumor suppressor genes whose abundance is positively or negatively correlated with end-point observation, such as survival or tumor grade. This approach may potentially miss some truly functional genes if both its low and high modes have associations with end-point observation. Such genes act as both oncogenes and tumor suppressor genes, a scenario that is unlikely but theoretically possible. Results: We invented an Expectation-Maximization (EM) algorithm to divide patients into low-, middle- and high-expressing groups according to the expression level of a certain gene in both tumor and normal patients. We found one gene, ORMDL3, whose low and high modes were both associated with worse survival and higher tumor grade in breast cancer patients in multiple patient cohorts. We speculate that its tumor suppressor gene role may be real, while its high expression correlating with worse end-point outcome is probably due to the passenger event of the nearby ERBB2's amplification. Conclusions: The proposed EM algorithm can effectively detect genes having tri-modal distributed expression in patient groups compared to normal genes, thus rendering a new perspective on dissecting the association between genomic features and end-point observations. Our analysis of breast cancer datasets suggest that the gene ORMDL3 may have an unexploited tumor suppressive function.",
keywords = "Breast Cancer, Expectation maximization, Oncogene, Survival, Tumor suppressor gene",
author = "Minzhe Zhang and Tao Wang and Rosa Sirianni and Shaul, {Philip W} and Yang Xie",
year = "2019",
month = "1",
day = "8",
doi = "10.1186/s12859-018-2582-7",
language = "English (US)",
volume = "20",
journal = "BMC Bioinformatics",
issn = "1471-2105",
publisher = "BioMed Central",
number = "1",

}

TY - JOUR

T1 - Identifying genes with tri-modal association with survival and tumor grade in cancer patients

AU - Zhang, Minzhe

AU - Wang, Tao

AU - Sirianni, Rosa

AU - Shaul, Philip W

AU - Xie, Yang

PY - 2019/1/8

Y1 - 2019/1/8

N2 - Background: Previous cancer genomics studies focused on searching for novel oncogenes and tumor suppressor genes whose abundance is positively or negatively correlated with end-point observation, such as survival or tumor grade. This approach may potentially miss some truly functional genes if both its low and high modes have associations with end-point observation. Such genes act as both oncogenes and tumor suppressor genes, a scenario that is unlikely but theoretically possible. Results: We invented an Expectation-Maximization (EM) algorithm to divide patients into low-, middle- and high-expressing groups according to the expression level of a certain gene in both tumor and normal patients. We found one gene, ORMDL3, whose low and high modes were both associated with worse survival and higher tumor grade in breast cancer patients in multiple patient cohorts. We speculate that its tumor suppressor gene role may be real, while its high expression correlating with worse end-point outcome is probably due to the passenger event of the nearby ERBB2's amplification. Conclusions: The proposed EM algorithm can effectively detect genes having tri-modal distributed expression in patient groups compared to normal genes, thus rendering a new perspective on dissecting the association between genomic features and end-point observations. Our analysis of breast cancer datasets suggest that the gene ORMDL3 may have an unexploited tumor suppressive function.

AB - Background: Previous cancer genomics studies focused on searching for novel oncogenes and tumor suppressor genes whose abundance is positively or negatively correlated with end-point observation, such as survival or tumor grade. This approach may potentially miss some truly functional genes if both its low and high modes have associations with end-point observation. Such genes act as both oncogenes and tumor suppressor genes, a scenario that is unlikely but theoretically possible. Results: We invented an Expectation-Maximization (EM) algorithm to divide patients into low-, middle- and high-expressing groups according to the expression level of a certain gene in both tumor and normal patients. We found one gene, ORMDL3, whose low and high modes were both associated with worse survival and higher tumor grade in breast cancer patients in multiple patient cohorts. We speculate that its tumor suppressor gene role may be real, while its high expression correlating with worse end-point outcome is probably due to the passenger event of the nearby ERBB2's amplification. Conclusions: The proposed EM algorithm can effectively detect genes having tri-modal distributed expression in patient groups compared to normal genes, thus rendering a new perspective on dissecting the association between genomic features and end-point observations. Our analysis of breast cancer datasets suggest that the gene ORMDL3 may have an unexploited tumor suppressive function.

KW - Breast Cancer

KW - Expectation maximization

KW - Oncogene

KW - Survival

KW - Tumor suppressor gene

UR - http://www.scopus.com/inward/record.url?scp=85059648791&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85059648791&partnerID=8YFLogxK

U2 - 10.1186/s12859-018-2582-7

DO - 10.1186/s12859-018-2582-7

M3 - Article

C2 - 30621577

AN - SCOPUS:85059648791

VL - 20

JO - BMC Bioinformatics

JF - BMC Bioinformatics

SN - 1471-2105

IS - 1

M1 - 13

ER -