liu.seSearch for publications in DiVA
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Machine Learning for Social Sciences: Stance Classification of User Messages on a Migrant-Critical Discussion Forum
Linnaeus University, Sweden.
Linköping University, Department of Science and Technology, Media and Information Technology. Linköping University, Faculty of Science & Engineering. Linnaeus University, Sweden. (iVis, INV)ORCID iD: 0000-0002-1907-7820
2021 (English)In: Proceedings of the 2021 Swedish Workshop on Data Science (SweDS) / [ed] Rafael M. Martins, Morgan Ericsson, Danny Weyns, Kostiantyn Kucher, IEEE , 2021Conference paper, Published paper (Refereed)
Abstract [en]

In this paper, we present our methodology for supervised stance classification of sparse and imbalanced social media data. We test our framework on a manually labeled dataset of 5700 messages about immigration in the Swedish language posted on the Flashback forum, a controversial online discussion platform. Our proposed approach currently achieves a macro- averaged F1-score of 0.72 for test data on a two-class problem compared against 0.27 for a baseline four-class model. Since effective classification of imbalanced and sparse textual data in under-resourced languages presents certain methodological challenges, our study contributes to a discussion on the best pathways to achieve highest model performance given the character of the data and unavailability of large training datasets for this task. Moreover, this work exemplifies the application of ML methodology to social media data, which can be particularly relevant for social scientists working in this area and interested in leveraging the possibilities of machine learning in their research field. This methodology and the obtained results provide a foundation for further in-depth analyses of social media texts in the Swedish language following a data-driven approach.

Place, publisher, year, edition, pages
IEEE , 2021.
Keywords [en]
social media, sentiment classification, stance classification, supervised learning, Swedish text data classification
National Category
Natural Language Processing Peace and Conflict Studies Other Social Sciences not elsewhere specified
Research subject
Social Sciences; Computer and Information Sciences Computer Science, Computer Science
Identifiers
URN: urn:nbn:se:liu:diva-181839DOI: 10.1109/SweDS53855.2021.9637718ISI: 000833296400001ISBN: 9781665418300 (electronic)ISBN: 9781665418317 (print)OAI: oai:DiVA.org:liu-181839DiVA, id: diva2:1620077
Conference
2021 Swedish Workshop on Data Science (SweDS), Växjö, Sweden, December 2-3, 2021
Projects
DISAAvailable from: 2021-12-14 Created: 2021-12-14 Last updated: 2025-02-20Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full text

Authority records

Yantseva, VictoriaKucher, Kostiantyn

Search in DiVA

By author/editor
Yantseva, VictoriaKucher, Kostiantyn
By organisation
Media and Information TechnologyFaculty of Science & Engineering
Natural Language ProcessingPeace and Conflict StudiesOther Social Sciences not elsewhere specified

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 181 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf