A Comparison of Merging Strategies for Translation of German Compounds
2009 (English)In: Proceedings of the Student Research Workshop at the 12th Conference of the European Chapter of the ACL (EACL 2009), Association for Computational Linguistics , 2009, 61-69 p.Conference paper (Refereed)
In this article, compound processing for translation into German in a factored statistical MT system is investigated. Compound sare handled by splitting them prior to training, and merging the parts after translation. I have explored eight merging strategies using different combinations of external knowledge sources, such as word lists, and internal sources that are carried through the translation process, such as symbols or parts-of-speech. I show that for merging to be successful, some internal knowledge source is needed. I also show that an extra sequence model for part-ofspeech is useful in order to improve the order of compound parts in the output. The best merging results are achieved by a matching scheme for part-of-speech tags.
Place, publisher, year, edition, pages
Association for Computational Linguistics , 2009. 61-69 p.
Natural language processing, machine translation, compounds
National CategoryComputer Science Language Technology (Computational Linguistics) Language Technology (Computational Linguistics)
IdentifiersURN: urn:nbn:se:liu:diva-20318OAI: oai:DiVA.org:liu-20318DiVA: diva2:233949