Enhancing Low-Resource Bangla Fake News Detection through Deep Convolutional Neural NetworksShow others and affiliations
2024 (English)In: Intelligent Computing and Optimization: Proceedings of the 7th International Conference on Intelligent Computing and Optimization 2023 (ICO2023), Volume 2 / [ed] Pandian Vasant; Vladimir Panchenko; Elias Munapo; Gerhard-Wilhelm Weber; J. Joshua Thomas; Rolly Intan; Mohammad Shamsul Arefin, Springer Science and Business Media Deutschland GmbH , 2024, p. 104-114Chapter in book (Refereed)
Abstract [en]
In today’s digital landscape, the detection of false information is of utmost importance, especially in languages like Bangla, which lack abundant natural language processing (NLP) resources. The rapid spread of misinformation through online platforms, particularly within Bangla-speaking communities, has become a pressing concern. However, the limited availability of NLP tools for Bangla has posed significant challenges in developing reliable models for identifying deceptive content. In response to these challenges, researchers have made notable progress in classifying Bangla news using deep learning techniques and language models like BERT. This study presents a detailed exploration of a deep convolutional neural network (CNN) model tailored for categorizing Bangla news articles as authentic or counterfeit. By integrating BERT (Bangla Electra) into the model’s architecture, an impressive accuracy rate of 94.33% was achieved, with our proprietary model surpassing this at 94.5%.To ensure the reliability of results, a range of NLP techniques were applied during data preprocessing, including data cleansing, tokenization, stop word and punctuation removal, and stemming. Feature extraction involved the combined use of TF-IDF and Bag of Words techniques. The dataset, obtained from Kaggle, comprised 7,000 genuine news texts and 1,000 counterfeit news texts. In summary, this research significantly contributes to Bangla news classification by showcasing the effectiveness of deep CNN models in accurately discerning between legitimate and fabricated news articles.
Place, publisher, year, edition, pages
Springer Science and Business Media Deutschland GmbH , 2024. p. 104-114
Series
Lecture Notes in Networks and Systems, ISSN 2367-3370, E-ISSN 2367-3389 ; 1167
National Category
Computer Sciences
Research subject
Cyber Security
Identifiers
URN: urn:nbn:se:ltu:diva-111488DOI: 10.1007/978-3-031-73318-5_11Scopus ID: 2-s2.0-85215692547OAI: oai:DiVA.org:ltu-111488DiVA, id: diva2:1934361
Note
ISBN for host publication: 978-3-031-73317-8 (Print), 978-3-031-73318-5 (Online)
2025-02-042025-02-042025-10-21Bibliographically approved