Explainable Bengali Multiclass News ClassificationShow others and affiliations
2023 (English)In: 2023 26th International Conference on Computer and Information Technology (ICCIT), Institute of Electrical and Electronics Engineers (IEEE), 2023Conference paper, Published paper (Refereed)
Abstract [en]
The automatic classification of news articles is crucial in the era of information overflow as it assists readers in accessing relevant information in a timely manner. Even though text classification is not a new area of study, there is potential for advancement concerning the Bengali language. Unlike other languages, Bengali is a complex language, and most of the datasets available online are imbalanced in terms of class label distribution. To increase the performance of classification methods and make them robust to handle imbalanced data, in this work, we propose a model consisting of pre-trained BERT architecture. We use a publicly available dataset of Bengali news articles with nine classes and achieve 92% accuracy. Along with the classification, explaining the model and the result is necessary for the application of trustworthy Artificial Intelligence. From this motivation, we use Integrated Gradient, an explainable AI technique, to explain the outcome of our model. We show which words in a news article affect the model to choose a particular class.
Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2023.
Keywords [en]
BERT, Text Classification, Bengali News, Trustworthy AI, Explainability
National Category
Natural Language Processing
Identifiers
URN: urn:nbn:se:liu:diva-201246DOI: 10.1109/ICCIT60459.2023.10441218ISBN: 9798350359015 (electronic)ISBN: 9798350359022 (print)OAI: oai:DiVA.org:liu-201246DiVA, id: diva2:1841508
Conference
26th International Conference on Computer and Information Technology (ICCIT), Cox's Bazar, Bangladesh, 13-15 December 2023.
2024-02-282024-02-282025-02-07Bibliographically approved