liu.seSearch for publications in DiVA
Change search
ReferencesLink to record
Permanent link

Direct link
Using the probability of readability to order Swedish texts
Linköping University, Department of Computer and Information Science, Human-Centered systems. Linköping University, The Institute of Technology. Santa Anna IT Research Institute AB, Linköping, Sweden.ORCID iD: 0000-0002-6357-4461
Språkbanken, University of Gothenburg, Gothenburg.
2012 (English)In: Proceedings of the Fourth Swedish Language Technology Conference, 2012, 27-28 p.Conference paper, Abstract (Refereed)
Abstract [en]

In this study we present a new approach to rank readability in Swedish texts based on lexical, morpho-syntactic and syntactic analysis of text as well as machine learning. The basic premise and theory is presented as well as a small experiment testing the feasibility, but not actual performance, of the approach. The experiment shows that it is possible to implement a system based on the approach, however, the actual performance of such a system has not been evaluated as the necessary resources for such an evaluation does not yet exist for Swedish. The experiment also shows that a classifier based on the aforementioned linguistic analysis, on our limited test set, outperforms classifiers based on established metrics used to assess readability such as LIX, OVIX and Nominal Ratio.

Place, publisher, year, edition, pages
2012. 27-28 p.
National Category
Language Technology (Computational Linguistics)
URN: urn:nbn:se:liu:diva-93371OAI: diva2:624361
The Fourth Swedish Language Technology Conference, October 24-26, Lund 2012
Available from: 2013-05-31 Created: 2013-05-31 Last updated: 2013-05-31Bibliographically approved

Open Access in DiVA

falkenjackmuhlenbocksltc2012(85 kB)411 downloads
File information
File name FULLTEXT01.pdfFile size 85 kBChecksum SHA-512
Type fulltextMimetype application/pdf

Search in DiVA

By author/editor
Falkenjack, Johan
By organisation
Human-Centered systemsThe Institute of Technology
Language Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar
Total: 411 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 490 hits
ReferencesLink to record
Permanent link

Direct link