Title: A hybrid optimal weighting scheme and machine learning for rendering sentiments in tweets
Authors: Walid Cherif; Abdellah Madani; Mohamed Kissi
Addresses: Laboratory LIMA, Department of Computer Science, University Chouaib Doukkali, Faculty of Sciences, B.P. 20, 24000, El Jadida, Morocco ' Laboratory LAROSERI, Department of Computer Science, University Chouaib Doukkali, Faculty of Sciences, B.P. 20, 24000, El Jadida, Morocco ' Laboratory LIM, Department of Computer Science, University Hassan II Casablanca, Faculty of Sciences and Technology, B.P. 146, 20650, Mohammedia, Morocco
Abstract: Over recent years, the world has experienced an explosive growth in the volume of shared web texts. Everyday, a huge volume of opinions expressed in various forms such as articles, reviews and tweets is generated. In general, opinion mining refers to the task of extracting opinions, and sentiment analysis is the technique that extracts subjectivity and polarity; in other words, it determines whether a text is positive or negative (Taboada et al., 2011). Arabic sentiment analysis is conducted in this study using a publically available data set written in both modern standard Arabic and the Jordanian dialect. A new mathematical approach is introduced to determine the polarity of the tweet by using four functions whose parameters are the solutions of a linear program. These functions are then classified using support vector machines and K-nearest neighbours. The results show that the proposed approach is considerably reliable in Arabic sentiment analysis.
Keywords: automatic language processing; low level light stemming; sentiment analysis; support vector machines; SVM; k-nearest neighbour; KNN; hybrid weighting; optimal weighting; machine learning; sentiments; tweets; Twitter; Arabic.
DOI: 10.1504/IJIEI.2016.080527
International Journal of Intelligent Engineering Informatics, 2016 Vol.4 No.3/4, pp.322 - 339
Received: 25 Sep 2015
Accepted: 13 May 2016
Published online: 28 Nov 2016 *