Exploring termhood using language models
2011 (English)In: Proceedings of the Workshop CHAT 2011: Creation, Harmonization and Application of Terminology Resources / [ed] Tatiana Gornostay, Andrejs Vasiljevs, Tartu University Library (Estonia): Northern European Association for Language Technology (NEALT) , 2011, 32-35 p.Conference paper (Refereed)
Term extraction metrics are mostly based on frequency counts. This can be a problem when trying to extract previously unseen multi-word terms. This paper explores whether smoothed language models can be used instead. Although a simplistic use of language models is examined in this paper, the results indicate that with more refinement, smoothed language models may be used instead of unsmoothed frequency-count based termhood metrics.
Place, publisher, year, edition, pages
Tartu University Library (Estonia): Northern European Association for Language Technology (NEALT) , 2011. 32-35 p.
, NEALT Proceedings Series, Vol. 12
automatic term extraction, computational terminology, machine learning
Language Technology (Computational Linguistics)
IdentifiersURN: urn:nbn:se:liu:diva-75238OAI: oai:DiVA.org:liu-75238DiVA: diva2:505124
NODALIDA 2011 Workshop Creation, Harmonization and Application of Terminology Resources, May 11, 2011, Riga, Latvia