Title: Enhancing the accuracy of transformer-based embeddings for sentiment analysis in social big data

Authors: Wiem Zemzem; Moncef Tagina

Addresses: LARIA, ENSI, University of Manouba, Manouba, Tunisia ' LARIA, ENSI, University of Manouba, Manouba, Tunisia

Abstract: Social media have opened a venue for online users to post and share their opinions in different life aspects, which leads to big data. As a result, sentiment analysis has become a fast-growing field of research in Natural Language Processing (NLP) owing to its central role in analysing the public's opinion in many areas, including advertising, business and marketing. This study proposes a transformer-based approach, which integrates contextualised words with Part-of-Speech (POS) embedding. Then, the enhanced word vector is forwarded to a hybrid deep learning architecture combining a Convolutional Neural Network (CNN) and a Bidirectional Long Short-Term-Memory (BiLSTM) to discover the post's sentiment. Extensive experiments on four review data sets from diverse domains demonstrate that the proposed method outperforms other machine learning approaches in terms of accuracy.

Keywords: deep learning; sentiment analysis; word embedding; big data; natural language processing.

DOI: 10.1504/IJCAT.2023.135525

International Journal of Computer Applications in Technology, 2023 Vol.73 No.3, pp.169 - 177

Received: 28 Dec 2021
Accepted: 19 Mar 2022

Published online: 18 Dec 2023 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article