An evolutionary approach for high dimensional attribute selection Online publication date: Sat, 23-Aug-2014
by Lydia Boudjeloud-Assala
International Journal of Intelligent Information and Database Systems (IJIIDS), Vol. 6, No. 6, 2012
Abstract: We present a method to select a relevant dimension subset (with few or no loss of information) for clustering and outlier detection in high dimensional datasets. We use a heuristic search for relevant dimension subset selection based on genetic algorithm. The genetic algorithm fitness function for clustering uses the validity indexes of classification algorithms. We first use these validity indexes to select a dimension subset and then, to evaluate the clustering quality in this subspace. For outlier detection, the genetic algorithm fitness function is an individual distance-based function. The performances of our new approach of dimension selection are evaluated on simulations with different high dimensional datasets for the two applications (clustering and outlier detection). Furthermore, as the number of dimensions is low, it is possible to display the datasets in order to visually evaluate and interpret the obtained results.
Existing subscribers:
Go to Inderscience Online Journals to access the Full Text of this article.
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Intelligent Information and Database Systems (IJIIDS):
Login with your Inderscience username and password:
Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.
If you still need assistance, please email subs@inderscience.com