Title: A comparative investigation of approaches for web search results clustering
Authors: Zaher Salah; Abdel-Rahman Al-Ghuwairi; Ahmad Aloqaily; Aladdin Baarah; Ayoub Alsarhan
Addresses: Prince Al Hussein Bin Abdullah II, Faculty for Information Technology, Hashemite University, P.O. Box 150459, Zarqa 13115, Jordan ' Prince Al Hussein Bin Abdullah II, Faculty for Information Technology, Hashemite University, P.O. Box 150459, Zarqa 13115, Jordan ' Prince Al Hussein Bin Abdullah II, Faculty for Information Technology, Hashemite University, P.O. Box 150459, Zarqa 13115, Jordan ' Prince Al Hussein Bin Abdullah II, Faculty for Information Technology, Hashemite University, P.O. Box 150459, Zarqa 13115, Jordan ' Prince Al Hussein Bin Abdullah II, Faculty for Information Technology, Hashemite University, P.O. Box 150459, Zarqa 13115, Jordan
Abstract: The huge diversity of topics covered by the online textual documents is expected as these documents are originated from various resources and cover various topics. Looking to all of these aspects, how to search and find specific documents relative to a specific topic in the user mind and how to facilitate the browsing process? And how to reflect properly the user intention to the information retrieval system to perform the searching and delivering task in precise and fast process? This paper investigates various techniques used for clustering the web search results in order to meet that user's information needs. The goal is not to facilitate finding specific documents only but also to make it easier to preview the general structure and distribution of the topics and to reveal hidden topics in the corpus. The aim of this paper is to provide the reader with the relevant background in much more detail.
Keywords: information retrieval; machine learning; text mining; web search results clustering; WSRC.
DOI: 10.1504/IJAIP.2020.109516
International Journal of Advanced Intelligence Paradigms, 2020 Vol.17 No.3/4, pp.342 - 366
Received: 20 Apr 2017
Accepted: 30 Jan 2018
Published online: 11 Sep 2020 *