Title: An incremental clustering using bat-spotted hyena optimiser with spark framework

Authors: Ch. Vidyadhari; N. Sandhya; N. Ramakrishnaiah

Addresses: IT Department, Department of Computer Science and Engineering, JNTUK, Kakinada, India ' VNR Vignana Jyothi Institute of Engineering and Technology, Telangana, 500090, India ' CSE Department, University College of Engineering, JNTUK, Kakinada, India

Abstract: Recently, clustering techniques gained more importance due to huge range of applications in the field of data mining, pattern recognition, data clustering, bio informatics and many other applications. In this paper, a new approach called spotted hyena bat algorithm (SHBA)-based incremental clustering with spark framework is proposed. The SHBA algorithm is derived by integrating the spotted hyena optimiser (SHO) and bat algorithm (BA), that is highly desirable for handling high dimensional data and provides a unique solution with high satisfactory results. The process of incremental clustering is performed in a spark framework by considering the master and the slave nodes. The proposed approach effectively clusters the data, especially high dimensional data and is more robust against various attacks and provides more unified solution. Moreover, the proposed SHBA achieves higher performance by considering the evaluation metrics, such as Jaccard coefficient, rand coefficient, and clustering accuracy of 0.950, 0.943, and 0.962.

Keywords: spark architecture; master and slave nodes; incremental clustering; entropy function; spotted hyena optimisation; SHO.

DOI: 10.1504/IJIIDS.2023.131414

International Journal of Intelligent Information and Database Systems, 2023 Vol.16 No.2, pp.167 - 195

Received: 20 Oct 2021
Accepted: 30 Oct 2022

Published online: 09 Jun 2023 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article