Title: Minimum redundancy maximum relevance and VNS based gene selection for cancer classification in high-dimensional data

Authors: Ahmed Bir-Jmel; Sidi Mohamed Douiri; Souad Elbernoussi

Addresses: Department of Mathematics, Faculty of Sciences, Laboratory of Mathematics, Computer Science and Applications-Security of Information, Mohammed V University in Rabat, Morocco ' Department of Mathematics, Faculty of Sciences, Laboratory of Mathematics, Computer Science and Applications-Security of Information, Mohammed V University in Rabat, Morocco ' Department of Mathematics, Faculty of Sciences, Laboratory of Mathematics, Computer Science and Applications-Security of Information, Mohammed V University in Rabat, Morocco

Abstract: DNA microarray is a technique for measuring simultaneously the expression levels of a huge number of genes, these levels have a significant impact on cancer classification tasks. In DNA datasets, the number of genes exceeds the number of samples that make the presence of irrelevant or redundant genes possible. In this paper, two hybrid multivariate filters for gene selection, named VNSMI and VNSCor, are presented. These methods surpass the univariate filters by considering the possible interaction between genes through the search for a subset of genes that contains the minimum redundancy and the maximum relevance (MRMR). In the first stage of our approaches, we use a univariate filter by selecting the best-ranked genes. Then, we apply the variable neighbourhood search (VNS) metaheuristic coupled with an innovative stochastic local search (SLS) algorithm to find the final subset of genes. The experiments performed show that the proposed approaches are feasible and effective.

Keywords: gene selection; feature selection; cancer classification; VNS; stochastic local search; SLS; normalised mutual information; MRMR; DNA microarray.

DOI: 10.1504/IJCSE.2024.136254

International Journal of Computational Science and Engineering, 2024 Vol.27 No.1, pp.78 - 89

Received: 23 May 2022
Received in revised form: 01 Sep 2022
Accepted: 20 Sep 2022

Published online: 25 Jan 2024 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article