Abstract
Motivation: MicroRNAs (miRNAs) are small non-coding RNAs that have been successfully identified to be differentially expressed in various cancers. However, some miRNAs were reported to be up-regulated in one subtype of a cancer but down-regulated in another, making overall associations between these miRNAs and the heterogeneous cancer non-linear. These non-linearly associated miRNAs, if identified, are thus informative for cancer subtyping. Results: Here, we propose mirPLS, a Partial Linear Structure identifier for miRNA data that simultaneously identifies miRNAs of linear or non-linear associations with cancer status when non-linearly associated miRNAs can then be used for subsequent cancer subtyping. Simulation studies showed that mirPLS can identify both non-linearly and linearly outcome-associated miRNAs more accurately than the comparison methods. Using the identified nonlinearly associated miRNAs much improves the cancer subtyping accuracy. Applications to miRNA data of three different cancer types suggest that the cancer subtypes defined by the non-linearly associated miRNAs identified by mirPLS are consistently more predictive of patient survival and more biological meaningful. Availability and implementation: The R package mirPLS is available for downloading from https://github.com/ pfruan/mirPLS.
Original language | English (US) |
---|---|
Pages (from-to) | 4902-4909 |
Number of pages | 8 |
Journal | Bioinformatics |
Volume | 36 |
Issue number | 19 |
DOIs | |
State | Published - Oct 1 2020 |
Externally published | Yes |
ASJC Scopus subject areas
- Statistics and Probability
- Biochemistry
- Molecular Biology
- Computer Science Applications
- Computational Theory and Mathematics
- Computational Mathematics