The Impact of Cohesion Errors in Extraction Based Summaries
2014 (English)In: LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, European Language Resources Association, 2014, 1575-1582 p.Conference paper (Refereed)
We present results from an eye tracking study of automatic text summarization. Automatic text summarization is a growing field due to the modern world’s Internet based society, but to automatically create perfect summaries is challenging. One problem is that extraction based summaries often have cohesion errors. By the usage of an eye tracking camera, we have studied the nature of four different types of cohesion errors occurring in extraction based summaries. A total of 23 participants read and rated four different texts and marked the most difficult areas of each text. Statistical analysis of the data revealed that absent cohesion or context and broken anaphoric reference (pronouns) caused some disturbance in reading, but that the impact is restricted to the effort to read rather than the comprehension of the text. However, erroneous anaphoric references (pronouns) were not always detected by the participants which poses a problem for automatic text summarizers. The study also revealed other potential disturbing factors.
Place, publisher, year, edition, pages
European Language Resources Association, 2014. 1575-1582 p.
Automatic summarization, eye-tracking studies, cohesion
Language Technology (Computational Linguistics)
IdentifiersURN: urn:nbn:se:liu:diva-117528ISI: 000355611003030ISBN: 978-2-9517408-8-4OAI: oai:DiVA.org:liu-117528DiVA: diva2:809198
9th International Conference on Language Resources and Evaluation (LREC)