Title: A novel machine extraction algorithm for implicit and explicit keywords based on dynamic web metadata of scientific scholars' corpus
Authors: Mawloud Mosbah
Addresses: LRES Laboratory, Informatics Department, Faculty of Sciences, University 20 Août 1955, Skikda, Algeria
Abstract: Keywords extraction, as an operation to construct metadata, is an important pre-processing task considered by many natural language processing applications such as text summarisation, information retrieval, and clustering of documents. In this paper, we introduce a novel machine extraction algorithm for implicit and explicit keywords. The algorithm relies on a dynamic corpus of similar documents built by information retrieval engines. In addition to the direct utilisation of the keywords for similar documents, our algorithm combines some basic techniques. The given results, compared with some basic methods of the literature, seem to be very promising and we claim also the efficiency of our solution.
Keywords: natural language processing; keywords extraction; automatic construction of metadata; implicit keywords; explicit keywords.
DOI: 10.1504/IJWET.2023.131136
International Journal of Web Engineering and Technology, 2023 Vol.18 No.1, pp.29 - 44
Received: 17 Mar 2022
Received in revised form: 22 Aug 2022
Accepted: 08 Jan 2023
Published online: 31 May 2023 *