Title: A new clustering approach for learning transcriptional modules
Authors: Francesco Archetti; Ilaria Giordani; Giancarlo Mauri; Enza Messina
Addresses: DISCO – Department of Computer Science, Systems and Communication, University of Milano Bicocca, Consorzio Milano Ricerche, Milan, Italy ' DISCO – Department of Computer Science, Systems and Communication, University of Milano Bicocca, Consorzio Milano Ricerche, Milan, Italy ' DISCO – Department of Computer Science, Systems and Communication, University of Milano Bicocca, Consorzio Milano Ricerche, Milan, Italy ' DISCO – Department of Computer Science, Systems and Communication, University of Milano Bicocca, Consorzio Milano Ricerche, Milan, Italy
Abstract: In modern biology, we had an explosion of genomic data from multiple sources, like measurements of RNA levels, gene sequences, annotations or interaction data. These heterogeneous data provide important information that should be integrated through suitable learning methods aimed at elucidating regulatory networks. We propose an iterative relational clustering procedure for finding modules of co-regulated genes. This approach integrates information concerning known Transcription Factors (TFs)gene interactions with gene expression data to find clusters of genes that share a common regulatory program. The results obtained on two well-known gene expression data sets from Saccharomyces cerevisiae are shown.
Keywords: gene transcriptional modules; gene clusters; relational clustering; regulatory networks; data mining; bioinformatics; transcription factors; gene expression data; Saccharomyces cerevisiae.
DOI: 10.1504/IJDMB.2012.049248
International Journal of Data Mining and Bioinformatics, 2012 Vol.6 No.3, pp.304 - 323
Accepted: 02 Oct 2010
Published online: 17 Dec 2014 *