Can a graded reader of authentic material be generated?
Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
The thesis investigates if a graded reader for English leveled to the CEFR levels by using the English Vocabulary Profile (EVP) dictionary can be generated from a corpus of authentic material. It was tested on Wikipedia and the ukWaC corpus. There were some problems in making correctmatches between the words in the EVP word lists with the tagged words of the corpora. The results show it might be possible to find enough suitable texts to generate a graded reader for at least the higher CEFR levels if only lemmas are considered. If also the POS tags should be matched between the word list and the corpora the errors were too big to be able to give a conclusive answer.
Place, publisher, year, edition, pages
2013. , 69 p.
IdentifiersURN: urn:nbn:se:liu:diva-100131ISRN: LIU-IDA/LITH-EX-A--13/050--SEOAI: oai:DiVA.org:liu-100131DiVA: diva2:660072
Subject / course
Computer and information science at the Institute of Technology