Title: Aggregated clustering for grouping of users based on web page navigation behaviour
Authors: R. GeethaRamani; P. Revathy; B. Lakshmi
Addresses: Department of Information Science and Technology, CEG, Anna University, Chennai, India ' Department of Computer Science and Engineering, Rajalakshmi Engineering College, Chennai, India ' Department of Information Science and Technology, CEG, Anna University, Chennai, India
Abstract: In this epoch, a significant amount of patterns are retrieved using data mining techniques. Clustering is one of the technique that plays an vital role in web mining. This paper works on MSNBC dataset with the average access length of 6. It aims to cluster the users based on their navigation behaviour. An iterative aggregated clustering is proposed, in which various clustering algorithms like EM clustering, farthest first, K-means clustering, density based cluster, filtered cluster are applied on the dataset. The resultant clusters from various algorithms are aggregated correspondingly and the frequency of instances in each cluster is determined. Then the instance with two-third majority is grouped in that cluster. The work revealed that 91% of users clustered in the first iteration under 17 clusters and 99% of users in subsequent iterations in another 17 clusters and rest of the users are grouped as one cluster, resulting 35 hard clusters.
Keywords: data mining; MSNBC; web usage mining; hard clusters; aggregated clustering.
DOI: 10.1504/IJRIS.2019.099853
International Journal of Reasoning-based Intelligent Systems, 2019 Vol.11 No.2, pp.161 - 169
Received: 09 Sep 2017
Accepted: 16 Mar 2018
Published online: 24 May 2019 *