Machine Learning for Cancer Subtype Prediction with FSA Method

Yan Liu; Xu Dong Wang; Meikang Qiu; Hui Zhao

doi:10.1007/978-3-030-34139-8_39

Machine Learning for Cancer Subtype Prediction with FSA Method

Yan Liu, Xu Dong Wang, Meikang Qiu, Hui Zhao

BY-Lab Yu

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

3 Scopus citations

Abstract

Recent research demonstrates that gene expression based cancer subtype classification has more advantages over the traditional classification. However, since this kind of data always has thousands of features, performing classification is impossible by human beings without efficient and accurate algorithms. This paper reports an empirical study that explores the problem of finding a highly-efficient and accurate machine learning method on human cancer subtype classification based on the gene expression data in cancer cells. Several machine learning algorithms are well developed to solve this kind of problems, including Naive Bayes Classifier, Support Vector Machine (SVM), Random Forest, Neural Networks. Here we generate two prediction models using SVM and Random Forest algorithms along with a feature selection approach (FSA) to predict the subtype of lung cell lines. The accuracy of the two prediction models is close with a rate of more than 90%. However, the running time of SVM is much shorter than that of Random Forest.

Original language	English (US)
Title of host publication	Smart Computing and Communication - 4th International Conference, SmartCom 2019, Proceedings
Editors	Meikang Qiu
Publisher	Springer
Pages	387-397
Number of pages	11
ISBN (Print)	9783030341381
DOIs	https://doi.org/10.1007/978-3-030-34139-8_39
State	Published - 2019
Event	4th International Conference on Smart Computing and Communications, SmartCom 2019 - Birmingham, United Kingdom Duration: Oct 11 2019 → Oct 13 2019

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	11910 LNCS
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Conference

Conference	4th International Conference on Smart Computing and Communications, SmartCom 2019
Country/Territory	United Kingdom
City	Birmingham
Period	10/11/19 → 10/13/19

Keywords

Cancer subtype
Feature selection
Machine learning
Random Forest
Support Vector Machine

ASJC Scopus subject areas

Theoretical Computer Science
General Computer Science

Access to Document

10.1007/978-3-030-34139-8_39

Cite this

Liu, Y., Wang, X. D., Qiu, M., & Zhao, H. (2019). Machine Learning for Cancer Subtype Prediction with FSA Method. In M. Qiu (Ed.), Smart Computing and Communication - 4th International Conference, SmartCom 2019, Proceedings (pp. 387-397). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 11910 LNCS). Springer. https://doi.org/10.1007/978-3-030-34139-8_39

Machine Learning for Cancer Subtype Prediction with FSA Method. / Liu, Yan; Wang, Xu Dong; Qiu, Meikang et al.
Smart Computing and Communication - 4th International Conference, SmartCom 2019, Proceedings. ed. / Meikang Qiu. Springer, 2019. p. 387-397 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 11910 LNCS).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Liu, Y, Wang, XD, Qiu, M & Zhao, H 2019, Machine Learning for Cancer Subtype Prediction with FSA Method. in M Qiu (ed.), Smart Computing and Communication - 4th International Conference, SmartCom 2019, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 11910 LNCS, Springer, pp. 387-397, 4th International Conference on Smart Computing and Communications, SmartCom 2019, Birmingham, United Kingdom, 10/11/19. https://doi.org/10.1007/978-3-030-34139-8_39

Liu Y, Wang XD, Qiu M, Zhao H. Machine Learning for Cancer Subtype Prediction with FSA Method. In Qiu M, editor, Smart Computing and Communication - 4th International Conference, SmartCom 2019, Proceedings. Springer. 2019. p. 387-397. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-030-34139-8_39

Liu, Yan ; Wang, Xu Dong ; Qiu, Meikang et al. / Machine Learning for Cancer Subtype Prediction with FSA Method. Smart Computing and Communication - 4th International Conference, SmartCom 2019, Proceedings. editor / Meikang Qiu. Springer, 2019. pp. 387-397 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{027f3c593a914af4b0d913bb589963eb,

title = "Machine Learning for Cancer Subtype Prediction with FSA Method",

abstract = "Recent research demonstrates that gene expression based cancer subtype classification has more advantages over the traditional classification. However, since this kind of data always has thousands of features, performing classification is impossible by human beings without efficient and accurate algorithms. This paper reports an empirical study that explores the problem of finding a highly-efficient and accurate machine learning method on human cancer subtype classification based on the gene expression data in cancer cells. Several machine learning algorithms are well developed to solve this kind of problems, including Naive Bayes Classifier, Support Vector Machine (SVM), Random Forest, Neural Networks. Here we generate two prediction models using SVM and Random Forest algorithms along with a feature selection approach (FSA) to predict the subtype of lung cell lines. The accuracy of the two prediction models is close with a rate of more than 90%. However, the running time of SVM is much shorter than that of Random Forest.",

keywords = "Cancer subtype, Feature selection, Machine learning, Random Forest, Support Vector Machine",

author = "Yan Liu and Wang, {Xu Dong} and Meikang Qiu and Hui Zhao",

note = "Publisher Copyright: {\textcopyright} 2019, Springer Nature Switzerland AG.; 4th International Conference on Smart Computing and Communications, SmartCom 2019 ; Conference date: 11-10-2019 Through 13-10-2019",

year = "2019",

doi = "10.1007/978-3-030-34139-8_39",

language = "English (US)",

isbn = "9783030341381",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer",

pages = "387--397",

editor = "Meikang Qiu",

booktitle = "Smart Computing and Communication - 4th International Conference, SmartCom 2019, Proceedings",

}

TY - GEN

T1 - Machine Learning for Cancer Subtype Prediction with FSA Method

AU - Liu, Yan

AU - Wang, Xu Dong

AU - Qiu, Meikang

AU - Zhao, Hui

PY - 2019

Y1 - 2019

N2 - Recent research demonstrates that gene expression based cancer subtype classification has more advantages over the traditional classification. However, since this kind of data always has thousands of features, performing classification is impossible by human beings without efficient and accurate algorithms. This paper reports an empirical study that explores the problem of finding a highly-efficient and accurate machine learning method on human cancer subtype classification based on the gene expression data in cancer cells. Several machine learning algorithms are well developed to solve this kind of problems, including Naive Bayes Classifier, Support Vector Machine (SVM), Random Forest, Neural Networks. Here we generate two prediction models using SVM and Random Forest algorithms along with a feature selection approach (FSA) to predict the subtype of lung cell lines. The accuracy of the two prediction models is close with a rate of more than 90%. However, the running time of SVM is much shorter than that of Random Forest.

AB - Recent research demonstrates that gene expression based cancer subtype classification has more advantages over the traditional classification. However, since this kind of data always has thousands of features, performing classification is impossible by human beings without efficient and accurate algorithms. This paper reports an empirical study that explores the problem of finding a highly-efficient and accurate machine learning method on human cancer subtype classification based on the gene expression data in cancer cells. Several machine learning algorithms are well developed to solve this kind of problems, including Naive Bayes Classifier, Support Vector Machine (SVM), Random Forest, Neural Networks. Here we generate two prediction models using SVM and Random Forest algorithms along with a feature selection approach (FSA) to predict the subtype of lung cell lines. The accuracy of the two prediction models is close with a rate of more than 90%. However, the running time of SVM is much shorter than that of Random Forest.

KW - Cancer subtype

KW - Feature selection

KW - Machine learning

KW - Random Forest

KW - Support Vector Machine

UR - http://www.scopus.com/inward/record.url?scp=85076163584&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85076163584&partnerID=8YFLogxK

U2 - 10.1007/978-3-030-34139-8_39

DO - 10.1007/978-3-030-34139-8_39

M3 - Conference contribution

AN - SCOPUS:85076163584

SN - 9783030341381

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 387

EP - 397

BT - Smart Computing and Communication - 4th International Conference, SmartCom 2019, Proceedings

A2 - Qiu, Meikang

PB - Springer

T2 - 4th International Conference on Smart Computing and Communications, SmartCom 2019

Y2 - 11 October 2019 through 13 October 2019

ER -

Machine Learning for Cancer Subtype Prediction with FSA Method

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this