Title: Negative correlation based gene markers identification in integrative gene expression data
Authors: Tao Zeng; Xuan Guo; Juan Liu
Addresses: School of Computer, Wuhan University, Wuhan 430072, China ' School of Computer, Wuhan University, Wuhan 430072, China ' School of Computer, Wuhan University, Wuhan 430072, China
Abstract: Along with the emergence and development of translational biomedicine, more and more genetic information has been applied in clinical practice. In the recent decade, the discovery of genetic markers for cancer prognosis obtains increasing attention and many methods have been developed. The 'element' methods use one or two independent genes to judge the Boolean status of disease. The 'set' methods use multiple gene markers as a whole to classify patients into different risks. And the advanced 'sets' methods use a group of different sets of biomarkers in an assembling manner. Either the existed 'set' or 'sets' methods only concern positive correlations among genes. However, the negative regulation, negative feedback, and functional repression are actually the relevant clues in cancer studies. Therefore, in this paper, based on the integrative gene expression data organised as gene-time-sample data or gene-sample-source data, we propose to mine Negatively Correlated Gene Sets (NCGSs) from multiple datasets, and use them along with the maximal positively correlated gene sets for prognosis classification. The experiment results suggest the promotions of cancer prognosis accuracy and meaningful pathogen relevance of gene markers by NCGS applications.
Keywords: NCGSs; negatively correlated gene sets; integrative gene expression data; cancer prognosis; response prediction; biological networks; gene markers; gene marker identification.
DOI: 10.1504/IJDMB.2014.062889
International Journal of Data Mining and Bioinformatics, 2014 Vol.10 No.1, pp.1 - 17
Received: 06 Sep 2011
Accepted: 10 Sep 2011
Published online: 21 Oct 2014 *