Multiple networks modules identification by a multi-dimensional Markov chain method

Chenyang Shen, Junjun Pan, Shuqin Zhang, Michael K. Ng

Research output: Contribution to journalArticle

2 Citations (Scopus)

Abstract

As a general approach to study interactions among small biological molecules such as genes and proteins, network analysis has aroused great interest of people from various research disciplines. However, the construction of network is usually quite sensitive to noise which is unavoidable in real data. Besides, the parameter selections for network construction can also affect the result significantly. These two factors largely decrease the consistency of results generated in network analysis. In particular, we consider detecting closely connected subgraphs named module structure. As an important common property of biological networks, this module structure is often destroyed corrupted by both noise and poor parameter selections in network construction. To conquer these two disadvantages to improve the consistency of module structure identified, we propose to process multiple networks for same set of biological molecules simultaneously for common module structure. More specifically, we combine multiple networks together by building an order 3 tensor data with each layer as one of the multiple networks. Then given any molecule(s) as prior information, a novel tensor-based Markov chain algorithm is proposed to iteratively detect the module that includes the prior node. Moreover, the proposed algorithm is capable of evaluating the contribution scores of each network to the detected module structure. The contribution scores from multiple networks can be not only useful criteria to measure the consistency of module structure, but also valid indicator of corruption in networks. To demonstrate the effectiveness and efficiency of the proposed tensor-based Markov chain algorithm, experimental results on synthetic data set as well as two real gene co-expression data sets of human beings are reported. We also validate that the identified common modules are biologically meaningful.

Original languageEnglish (US)
Article number32
Pages (from-to)1-13
Number of pages13
JournalNetwork Modeling Analysis in Health Informatics and Bioinformatics
Volume4
Issue number1
DOIs
StatePublished - Dec 1 2015
Externally publishedYes

Fingerprint

Markov Chains
Noise
Gene Regulatory Networks
Gene Expression
Research
Proteins
Datasets

ASJC Scopus subject areas

  • Urology

Cite this

Multiple networks modules identification by a multi-dimensional Markov chain method. / Shen, Chenyang; Pan, Junjun; Zhang, Shuqin; Ng, Michael K.

In: Network Modeling Analysis in Health Informatics and Bioinformatics, Vol. 4, No. 1, 32, 01.12.2015, p. 1-13.

Research output: Contribution to journalArticle

@article{5c512cf0b5c049c2a04bd3a1fe0ab29f,
title = "Multiple networks modules identification by a multi-dimensional Markov chain method",
abstract = "As a general approach to study interactions among small biological molecules such as genes and proteins, network analysis has aroused great interest of people from various research disciplines. However, the construction of network is usually quite sensitive to noise which is unavoidable in real data. Besides, the parameter selections for network construction can also affect the result significantly. These two factors largely decrease the consistency of results generated in network analysis. In particular, we consider detecting closely connected subgraphs named module structure. As an important common property of biological networks, this module structure is often destroyed corrupted by both noise and poor parameter selections in network construction. To conquer these two disadvantages to improve the consistency of module structure identified, we propose to process multiple networks for same set of biological molecules simultaneously for common module structure. More specifically, we combine multiple networks together by building an order 3 tensor data with each layer as one of the multiple networks. Then given any molecule(s) as prior information, a novel tensor-based Markov chain algorithm is proposed to iteratively detect the module that includes the prior node. Moreover, the proposed algorithm is capable of evaluating the contribution scores of each network to the detected module structure. The contribution scores from multiple networks can be not only useful criteria to measure the consistency of module structure, but also valid indicator of corruption in networks. To demonstrate the effectiveness and efficiency of the proposed tensor-based Markov chain algorithm, experimental results on synthetic data set as well as two real gene co-expression data sets of human beings are reported. We also validate that the identified common modules are biologically meaningful.",
author = "Chenyang Shen and Junjun Pan and Shuqin Zhang and Ng, {Michael K.}",
year = "2015",
month = "12",
day = "1",
doi = "10.1007/s13721-015-0106-1",
language = "English (US)",
volume = "4",
pages = "1--13",
journal = "Network Modeling and Analysis in Health Informatics and Bioinformatics",
issn = "2192-6662",
publisher = "Springer Nature",
number = "1",

}

TY - JOUR

T1 - Multiple networks modules identification by a multi-dimensional Markov chain method

AU - Shen, Chenyang

AU - Pan, Junjun

AU - Zhang, Shuqin

AU - Ng, Michael K.

PY - 2015/12/1

Y1 - 2015/12/1

N2 - As a general approach to study interactions among small biological molecules such as genes and proteins, network analysis has aroused great interest of people from various research disciplines. However, the construction of network is usually quite sensitive to noise which is unavoidable in real data. Besides, the parameter selections for network construction can also affect the result significantly. These two factors largely decrease the consistency of results generated in network analysis. In particular, we consider detecting closely connected subgraphs named module structure. As an important common property of biological networks, this module structure is often destroyed corrupted by both noise and poor parameter selections in network construction. To conquer these two disadvantages to improve the consistency of module structure identified, we propose to process multiple networks for same set of biological molecules simultaneously for common module structure. More specifically, we combine multiple networks together by building an order 3 tensor data with each layer as one of the multiple networks. Then given any molecule(s) as prior information, a novel tensor-based Markov chain algorithm is proposed to iteratively detect the module that includes the prior node. Moreover, the proposed algorithm is capable of evaluating the contribution scores of each network to the detected module structure. The contribution scores from multiple networks can be not only useful criteria to measure the consistency of module structure, but also valid indicator of corruption in networks. To demonstrate the effectiveness and efficiency of the proposed tensor-based Markov chain algorithm, experimental results on synthetic data set as well as two real gene co-expression data sets of human beings are reported. We also validate that the identified common modules are biologically meaningful.

AB - As a general approach to study interactions among small biological molecules such as genes and proteins, network analysis has aroused great interest of people from various research disciplines. However, the construction of network is usually quite sensitive to noise which is unavoidable in real data. Besides, the parameter selections for network construction can also affect the result significantly. These two factors largely decrease the consistency of results generated in network analysis. In particular, we consider detecting closely connected subgraphs named module structure. As an important common property of biological networks, this module structure is often destroyed corrupted by both noise and poor parameter selections in network construction. To conquer these two disadvantages to improve the consistency of module structure identified, we propose to process multiple networks for same set of biological molecules simultaneously for common module structure. More specifically, we combine multiple networks together by building an order 3 tensor data with each layer as one of the multiple networks. Then given any molecule(s) as prior information, a novel tensor-based Markov chain algorithm is proposed to iteratively detect the module that includes the prior node. Moreover, the proposed algorithm is capable of evaluating the contribution scores of each network to the detected module structure. The contribution scores from multiple networks can be not only useful criteria to measure the consistency of module structure, but also valid indicator of corruption in networks. To demonstrate the effectiveness and efficiency of the proposed tensor-based Markov chain algorithm, experimental results on synthetic data set as well as two real gene co-expression data sets of human beings are reported. We also validate that the identified common modules are biologically meaningful.

UR - http://www.scopus.com/inward/record.url?scp=85033580980&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85033580980&partnerID=8YFLogxK

U2 - 10.1007/s13721-015-0106-1

DO - 10.1007/s13721-015-0106-1

M3 - Article

AN - SCOPUS:85033580980

VL - 4

SP - 1

EP - 13

JO - Network Modeling and Analysis in Health Informatics and Bioinformatics

JF - Network Modeling and Analysis in Health Informatics and Bioinformatics

SN - 2192-6662

IS - 1

M1 - 32

ER -