Evaluating Machine Learning Methods for Bangla Text Emotion AnalysisShow others and affiliations
2024 (English)In: 2024 Asia Pacific Conference on Innovation in Technology (APCIT), Institute of Electrical and Electronics Engineers Inc. , 2024Conference paper, Published paper (Refereed)
Abstract [en]
Text-based emotion identification goes beyond simple sentiment analysis by capturing emotions in a more nuanced way, akin to shades of gray rather than just positive or negative sentiments. This paper details our experiments with emotion analysis on Bangla text. We collected a corpus of user comments from various social media groups discussing socioeconomic and political topics to identify six emotions: sadness, disgust, surprise, fear, anger, and joy. We evaluated the performance of four widely used machine learning algorithms—RF, DT, k-NN, and SVM—alongside three popular deep learning algorithms—CNNs, LSTM, and Transformer Learning—using TF-IDF feature extraction and word embedding techniques. The results showed that among the machine learning algorithms, DT, RF, k-NN, and SVM achieved accuracy scores of 82%, 84%, 73%, and 83%, respectively. In contrast, the deep learning models CNN and LSTM both achieved higher performance with an accuracy of 85% and 86% respectively. These findings highlight the effectiveness of traditional ML and DL approaches in detecting emotions from Bangla social media texts, indicating significant potential for further advancements in this area.
Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers Inc. , 2024.
Keywords [en]
Emotion Analysis, RF, DT, KNN, SVM, CNN, LSTM and Transformer Model
National Category
Computer Sciences Natural Language Processing
Research subject
Cyber Security
Identifiers
URN: urn:nbn:se:ltu:diva-110399DOI: 10.1109/APCIT62007.2024.10673544Scopus ID: 2-s2.0-85205532210OAI: oai:DiVA.org:ltu-110399DiVA, id: diva2:1906193
Conference
Asia Pacific Conference on Innovation in Technology (APCIT 2024), Mysuru, India, July 26-27, 2024
Note
ISBN for host publication: 979-8-3503-6153-7
2024-10-162024-10-162025-02-01Bibliographically approved