Modeling nonlinearity in dilution design microarray data

Xiuwen Zheng; Hung Chung Huang; Wenyuan Li; Peng Liu; Quan Zhen Li; Ying Liu

doi:10.1093/bioinformatics/btm002

Modeling nonlinearity in dilution design microarray data

Xiuwen Zheng, Hung Chung Huang, Wenyuan Li, Peng Liu, Quan Zhen Li, Ying Liu

Research output: Contribution to journal › Article › peer-review

3 Scopus citations

Abstract

Motivation: Dilution design (Mixed tissue RNA) has been utilized by some researchers to evaluate and assess the performance of multiple microarray platforms. Current microarray data analysis approaches assume that the quantified signal intensities are linearly related to the expression of the corresponding genes in the sample. However, there are sources of nonlinearity in microarray expression measurements. Such nonlinearity study in the expressions of the RNA mixtures provides a new way to analyze gene expression data, and we argue that the nonlinearity can reveal novel information for microarray data analysis. Therefore, we proposed a statistical model, called proportion model, which is based on the linear regression analysis. To approximately quantify the nonlinearity in the dilution design, a new calibration, beta ratio (BR) was derived from the proportion model. Furthermore, a new adjusted fold change adj-FC) was proposed to predict the true FC without nonlinearity, in particular for large FC. Results: We applied our method to one microarray dilution dataset. The experimental results indicated that, to some extent, there are global biases comparing with the linear assumption for the significant genes. Further analysis of those highly expressed genes with significant nonlinearity revealed some promising results, e.g. poison' effect was discovered for some genes in RNA mixtures. The adj-FCs of those genes with 'poison' effect, indicate that the nonlinearity can be also caused by the inherent feature of the genes besides signal noise and technical variation. Moreover, when percentage of overlapping genes (POG) was used as a crossplatform consistency measure, adj-FC outperformed simple fold change to show that Affymetrix and Illumina platforms are consistent.

Original language	English (US)
Pages (from-to)	1339-1347
Number of pages	9
Journal	Bioinformatics
Volume	23
Issue number	11
DOIs	https://doi.org/10.1093/bioinformatics/btm002
State	Published - Jun 2007

ASJC Scopus subject areas

Statistics and Probability
Biochemistry
Molecular Biology
Computer Science Applications
Computational Theory and Mathematics
Computational Mathematics

Access to Document

10.1093/bioinformatics/btm002

Cite this

@article{ad4db331ed834ab3979554a3738edffa,

title = "Modeling nonlinearity in dilution design microarray data",

abstract = "Motivation: Dilution design (Mixed tissue RNA) has been utilized by some researchers to evaluate and assess the performance of multiple microarray platforms. Current microarray data analysis approaches assume that the quantified signal intensities are linearly related to the expression of the corresponding genes in the sample. However, there are sources of nonlinearity in microarray expression measurements. Such nonlinearity study in the expressions of the RNA mixtures provides a new way to analyze gene expression data, and we argue that the nonlinearity can reveal novel information for microarray data analysis. Therefore, we proposed a statistical model, called proportion model, which is based on the linear regression analysis. To approximately quantify the nonlinearity in the dilution design, a new calibration, beta ratio (BR) was derived from the proportion model. Furthermore, a new adjusted fold change adj-FC) was proposed to predict the true FC without nonlinearity, in particular for large FC. Results: We applied our method to one microarray dilution dataset. The experimental results indicated that, to some extent, there are global biases comparing with the linear assumption for the significant genes. Further analysis of those highly expressed genes with significant nonlinearity revealed some promising results, e.g. poison' effect was discovered for some genes in RNA mixtures. The adj-FCs of those genes with 'poison' effect, indicate that the nonlinearity can be also caused by the inherent feature of the genes besides signal noise and technical variation. Moreover, when percentage of overlapping genes (POG) was used as a crossplatform consistency measure, adj-FC outperformed simple fold change to show that Affymetrix and Illumina platforms are consistent.",

author = "Xiuwen Zheng and Huang, {Hung Chung} and Wenyuan Li and Peng Liu and Li, {Quan Zhen} and Ying Liu",

year = "2007",

month = jun,

doi = "10.1093/bioinformatics/btm002",

language = "English (US)",

volume = "23",

pages = "1339--1347",

journal = "Bioinformatics",

issn = "1367-4803",

publisher = "Oxford University Press",

number = "11",

}

TY - JOUR

T1 - Modeling nonlinearity in dilution design microarray data

AU - Zheng, Xiuwen

AU - Huang, Hung Chung

AU - Li, Wenyuan

AU - Liu, Peng

AU - Li, Quan Zhen

AU - Liu, Ying

PY - 2007/6

Y1 - 2007/6

N2 - Motivation: Dilution design (Mixed tissue RNA) has been utilized by some researchers to evaluate and assess the performance of multiple microarray platforms. Current microarray data analysis approaches assume that the quantified signal intensities are linearly related to the expression of the corresponding genes in the sample. However, there are sources of nonlinearity in microarray expression measurements. Such nonlinearity study in the expressions of the RNA mixtures provides a new way to analyze gene expression data, and we argue that the nonlinearity can reveal novel information for microarray data analysis. Therefore, we proposed a statistical model, called proportion model, which is based on the linear regression analysis. To approximately quantify the nonlinearity in the dilution design, a new calibration, beta ratio (BR) was derived from the proportion model. Furthermore, a new adjusted fold change adj-FC) was proposed to predict the true FC without nonlinearity, in particular for large FC. Results: We applied our method to one microarray dilution dataset. The experimental results indicated that, to some extent, there are global biases comparing with the linear assumption for the significant genes. Further analysis of those highly expressed genes with significant nonlinearity revealed some promising results, e.g. poison' effect was discovered for some genes in RNA mixtures. The adj-FCs of those genes with 'poison' effect, indicate that the nonlinearity can be also caused by the inherent feature of the genes besides signal noise and technical variation. Moreover, when percentage of overlapping genes (POG) was used as a crossplatform consistency measure, adj-FC outperformed simple fold change to show that Affymetrix and Illumina platforms are consistent.

AB - Motivation: Dilution design (Mixed tissue RNA) has been utilized by some researchers to evaluate and assess the performance of multiple microarray platforms. Current microarray data analysis approaches assume that the quantified signal intensities are linearly related to the expression of the corresponding genes in the sample. However, there are sources of nonlinearity in microarray expression measurements. Such nonlinearity study in the expressions of the RNA mixtures provides a new way to analyze gene expression data, and we argue that the nonlinearity can reveal novel information for microarray data analysis. Therefore, we proposed a statistical model, called proportion model, which is based on the linear regression analysis. To approximately quantify the nonlinearity in the dilution design, a new calibration, beta ratio (BR) was derived from the proportion model. Furthermore, a new adjusted fold change adj-FC) was proposed to predict the true FC without nonlinearity, in particular for large FC. Results: We applied our method to one microarray dilution dataset. The experimental results indicated that, to some extent, there are global biases comparing with the linear assumption for the significant genes. Further analysis of those highly expressed genes with significant nonlinearity revealed some promising results, e.g. poison' effect was discovered for some genes in RNA mixtures. The adj-FCs of those genes with 'poison' effect, indicate that the nonlinearity can be also caused by the inherent feature of the genes besides signal noise and technical variation. Moreover, when percentage of overlapping genes (POG) was used as a crossplatform consistency measure, adj-FC outperformed simple fold change to show that Affymetrix and Illumina platforms are consistent.

UR - http://www.scopus.com/inward/record.url?scp=34447321058&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=34447321058&partnerID=8YFLogxK

U2 - 10.1093/bioinformatics/btm002

DO - 10.1093/bioinformatics/btm002

M3 - Article

C2 - 17237040

AN - SCOPUS:34447321058

SN - 1367-4803

VL - 23

SP - 1339

EP - 1347

JO - Bioinformatics

JF - Bioinformatics

IS - 11

ER -

Modeling nonlinearity in dilution design microarray data

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this