Usage of ensemble model and genetic algorithm in pipeline for feature selection from cancer microarray data Online publication date: Thu, 20-Aug-2020
by Sahu Barnali; Dehuri Satchidananda; Jagadev Alok Kumar
International Journal of Bioinformatics Research and Applications (IJBRA), Vol. 16, No. 3, 2020
Abstract: This paper proposes an ensemble of feature selection techniques with genetic algorithm (GA) in pipeline for selecting features from microarray data. The ensemble is a combination of filter and wrapper-based feature selection methods. In addition, GA in pipeline has been used for refinement of ensemble output to produce a non-local set of robust feature subset. An extensive computational experiment has been carried out on a prostate cancer dataset for validation of the method and comparison with group genetic algorithm (GGA). Finally, the resultant feature subsets of GA, GGA, and other constituents of the ensemble in standalone mode have been used for uncovering frequent patterns based on Apriori and FP-growth. The experimental study confirms that the proposed method gives classification accuracy of 100%, 98.34%, 98.02%, and 97% based on an ensemble of classifiers w. r. t. 5, 10, 15, and 20 features, respectively, vis-à-vis 92.34%, 90.34%, 86.54%, and 87.21% of GGA.
Existing subscribers:
Go to Inderscience Online Journals to access the Full Text of this article.
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Bioinformatics Research and Applications (IJBRA):
Login with your Inderscience username and password:
Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.
If you still need assistance, please email subs@inderscience.com