liu.seSearch for publications in DiVA
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Semi-automated classification for multi-label open-ended questions
Western Univ, Canada.
Univ Waterloo, Canada.
Linköping University, Department of Health, Medicine and Caring Sciences, Division of Society and Health. Linköping University, Faculty of Medicine and Health Sciences. Region Östergötland, Regionledningskontoret, Enheten för folkhälsa.ORCID iD: 0000-0002-6281-7783
2020 (English)In: Survey Methodology, ISSN 0714-0045, E-ISSN 1492-0921, Vol. 46, no 2, p. 265-282Article in journal (Refereed) Published
Abstract [en]

In surveys, text answers from open-ended questions are important because they allow respondents to provide more information without constraints. When classifying open-ended questions automatically using supervised learning, often the accuracy is not high enough. Alternatively, a semi-automated classification strategy can be considered: answers in the easy-to-classify group are classified automatically, answers in the hard-to-classify group are classified manually. This paper presents a semi-automated classification method for multi-label open-ended questions where text answers may be associated with multiple classes simultaneously. The proposed method effectively combines multiple probabilistic classifier chains while avoiding prohibitive computational costs. The performance evaluation on three different data sets demonstrates the effectiveness of the proposed method.

Place, publisher, year, edition, pages
Ottawa, ON, Canada: STATISTICS CANADA , 2020. Vol. 46, no 2, p. 265-282
Keywords [en]
semi-automated classification; Open-ended questions; Multi-label data
National Category
Probability Theory and Statistics
Identifiers
URN: urn:nbn:se:liu:diva-172426ISI: 000599497900005OAI: oai:DiVA.org:liu-172426DiVA, id: diva2:1515581
Available from: 2021-01-10 Created: 2021-01-10 Last updated: 2021-01-13Bibliographically approved

Open Access in DiVA

No full text in DiVA

Search in DiVA

By author/editor
Wenemark, Marika
By organisation
Division of Society and HealthFaculty of Medicine and Health SciencesEnheten för folkhälsa
In the same journal
Survey Methodology
Probability Theory and Statistics

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 97 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf