liu.seSearch for publications in DiVA
Change search
ReferencesLink to record
Permanent link

Direct link
Translating medical terminologies through word alignment in parallel text corpora
Linköping University, Department of Computer and Information Science, NLPLAB - Natural Language Processing Laboratory. Linköping University, The Institute of Technology.
2009 (English)In: JOURNAL OF BIOMEDICAL INFORMATICS, ISSN 1532-0464, Vol. 42, no 4, 692-701 p.Article in journal (Refereed) Published
Abstract [en]

Developing international multilingual terminologies is a time-consuming process. We present a methodology which aims to ease this process by automatically acquiring new translations of medical terms based on word alignment in parallel text corpora, and test it on English and French. After collecting a parallel, English-French corpus, we detected French translations of English terms from three terminologies-MeSH, SNOMED CT and the MedlinePlus Health Topics. We obtained respectively for each terminology 74.8%, 77.8% and 76.3% of linguistically correct new translations. A sample of the MeSH translations was submitted to expert review and 61.5% were deemed desirable additions to the French MeSH. In conclusion, we successfully obtained good quality new translations, which underlines the suitability of using alignment in text corpora to help translating terminologies. Our method may be applied to different European languages and provides a methodological framework that may be used with different processing tools.

Place, publisher, year, edition, pages
2009. Vol. 42, no 4, 692-701 p.
Keyword [en]
Natural Language Processing; Medical terminology; Multilinguality; Parallel corpora; Word alignment
National Category
Engineering and Technology
URN: urn:nbn:se:liu:diva-20170DOI: 10.1016/j.jbi.2009.03.002OAI: diva2:233680
Available from: 2009-09-01 Created: 2009-08-31 Last updated: 2009-09-01

Open Access in DiVA

No full text

Other links

Publisher's full text

Search in DiVA

By author/editor
Merkel, Magnus
By organisation
NLPLAB - Natural Language Processing LaboratoryThe Institute of Technology
Engineering and Technology

Search outside of DiVA

GoogleGoogle Scholar
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Altmetric score

Total: 24 hits
ReferencesLink to record
Permanent link

Direct link