Autonomous Email Categorization using Machine Learning Models in Thunderbird Client
2025 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE credits
Student thesis
Abstract [en]
Email categorization is a significant challenge in today's digital workplace, where sorting through messages may become complicated. This thesis addresses this problem by developing and evaluating the Thunderbird email client's autonomous email categorization system.
Specifically, the study explores several machine learning models: Traditional algorithms (Logistic Regression, Support Vector Machines (SVM), Random Forest, XGBoost) and a Deep Learning model, Distil-BERT.
The study also showcases the development of a custom Thunderbird extension that integrates fine-tuned machine learning models through a FastAPI backend service, providing empirical verification of the proposed approach. This implementation supports single and ensemble model predictions, offering flexibility in dealing with the trade-off between accuracy and computational resource usage.
The findings of this research provide insights and lessons learned regarding the implementation, practical challenges, and future opportunities of automated email categorization, enriching the general understanding of using machine learning solutions for everyday productivity problems.
Place, publisher, year, edition, pages
2025. , p. 44
Keywords [en]
Distil-BERT, Transformer-Based Models, Machine Learning, Email Classification
National Category
Artificial Intelligence Computer Sciences
Identifiers
URN: urn:nbn:se:liu:diva-212896ISRN: LIU-IDA/LITH-EX-A--25/009--SEOAI: oai:DiVA.org:liu-212896DiVA, id: diva2:1950722
Subject / course
Computer science
Presentation
2025-03-28, Alan Turing, IDA, Linköpings universitet, Linköping, 10:30 (English)
Supervisors
Examiners
2025-05-062025-04-082025-05-06Bibliographically approved