Classifying Implant-Bearing Patients via their Medical Histories: a Pre-Study on Swedish EMRs with Semi-Supervised GAN-BERTShow others and affiliations
2022 (English)In: LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, EUROPEAN LANGUAGE RESOURCES ASSOC-ELRA , 2022, p. 5428-5435Conference paper, Published paper (Refereed)
Abstract [en]
In this paper, we compare the performance of two BERT-based text classifiers whose task is to classify patients (more precisely, their medical histories) as having or not having implant(s) in their body. One classifier is a fully-supervised BERT classifier. The other one is a semi-supervised GAN-BERT classifier. Both models are compared against a fully-supervised SVM classifier. Since fully-supervised classification is expensive in terms of data annotation, with the experiments presented in this paper, we investigate whether we can achieve a competitive performance with a semi-supervised classifier based only on a small amount of annotated data. Results are promising and show that the semi-supervised classifier has a competitive performance when compared with the fully-supervised classifier.
Place, publisher, year, edition, pages
EUROPEAN LANGUAGE RESOURCES ASSOC-ELRA , 2022. p. 5428-5435
Keywords [en]
text classification; BERT; GAN-BERT; electronic medical records; EMR; clinical text mining
National Category
Other Computer and Information Science
Identifiers
URN: urn:nbn:se:liu:diva-192025ISI: 000889371705059ISBN: 9791095546726 (print)OAI: oai:DiVA.org:liu-192025DiVA, id: diva2:1740911
Conference
13th International Conference on Language Resources and Evaluation (LREC), Marseille, FRANCE, jun 20-25, 2022
Note
Funding Agencies|Vinnova (Swedens innovation agency) [2021-01699]
2023-03-022023-03-022023-03-02