liu.seSearch for publications in DiVA
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Natural language processing in pathology: a scoping review
Symbiant Pathol Expert Centre, Netherlands; University of Amsterdam, Netherlands.
University of Amsterdam, Netherlands.
University of Amsterdam, Netherlands.
Linköping University, Department of Biomedical Engineering, Medical Informatics. Linköping University, Faculty of Science & Engineering. University of Amsterdam, Netherlands.
2016 (English)In: Journal of Clinical Pathology, ISSN 0021-9746, E-ISSN 1472-4146, Vol. 69, no 11, 949-955 p.Article, review/survey (Refereed) Published
Abstract [en]

Background Encoded pathology data are key for medical registries and analyses, but pathology information is often expressed as free text. Objective We reviewed and assessed the use of NLP (natural language processing) for encoding pathology documents. Materials and methods Papers addressing NLP in pathology were retrieved from PubMed, Association for Computing Machinery (ACM) Digital Library and Association for Computational Linguistics (ACL) Anthology. We reviewed and summarised the study objectives; NLP methods used and their validation; software implementations; the performance on the dataset used and any reported use in practice. Results The main objectives of the 38 included papers were encoding and extraction of clinically relevant information from pathology reports. Common approaches were word/phrase matching, probabilistic machine learning and rule-based systems. Five papers (13%) compared different methods on the same dataset. Four papers did not specify the method(s) used. 18 of the 26 studies that reported F-measure, recall or precision reported values of over 0.9. Proprietary software was the most frequently mentioned category (14 studies); General Architecture for Text Engineering (GATE) was the most applied architecture overall. Practical system use was reported in four papers. Most papers used expert annotation validation. Conclusions Different methods are used in NLP research in pathology, and good performances, that is, high precision and recall, high retrieval/removal rates, are reported for all of these. Lack of validation and of shared datasets precludes performance comparison. More comparative analysis and validation are needed to provide better insight into the performance and merits of these methods.

Place, publisher, year, edition, pages
BMJ PUBLISHING GROUP , 2016. Vol. 69, no 11, 949-955 p.
Keyword [en]
COMPUTER SYSTEMS; SURGICAL PATHOLOGY; REPORTS
National Category
Cancer and Oncology
Identifiers
URN: urn:nbn:se:liu:diva-133120DOI: 10.1136/jclinpath-2016-203872ISI: 000388006000002PubMedID: 27451435OAI: oai:DiVA.org:liu-133120DiVA: diva2:1055255
Available from: 2016-12-12 Created: 2016-12-09 Last updated: 2016-12-12

Open Access in DiVA

No full text

Other links

Publisher's full textPubMed

Search in DiVA

By author/editor
Cornet, Ronald
By organisation
Medical InformaticsFaculty of Science & Engineering
In the same journal
Journal of Clinical Pathology
Cancer and Oncology

Search outside of DiVA

GoogleGoogle Scholar

Altmetric score

Total: 66 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf