liu.seSearch for publications in DiVA
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Generalisation and reliability of deep learning for digital pathology in a clinical setting
Linköping University, Department of Science and Technology, Media and Information Technology. Linköping University, Faculty of Science & Engineering.ORCID iD: 0000-0002-8734-6500
2023 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Deep learning (DL) is a subfield of artificial intelligence (AI) focused on developing algorithms that learn from data to perform some tasks that can aid humans in their daily life or work assignments. Research demonstrates the potential of DL in supporting pathologists with routine tasks like detecting breast cancer metastases and grading prostate cancer. However, a widespread adoption of DL technology in pathology labs has been slow for several reasons. DL models often exhibit performance variations across medical centres, patient subgroups, and even within the same centre over time. While collecting more data and retraining the algorithms seems like a straightforward solution, it is a costly and time-consuming process. Moreover, retraining DL systems with regulatory approvals is complex due to existing regulations. Another limitation of DL models is their inability to provide confidence estimates for predictions, leaving users in the dark about their reliability. Finally, establishing a close collaboration between the research community, vendors, and pathology labs is crucial for producing effective DL systems for patient care. However, this collaboration faces challenges like miscommunication, misalignment of goals, and misunderstanding priorities.

This thesis presents various approaches that could tackle the generalisation and reliability challenges faced by diagnostic DL systems for digital pathology with a strong emphasis on the clinical needs. To address the generalisation issues, an unsupervised approach to quantify expected changes in a model’s performance between two datasets is proposed. This approach can serve as an initial validation step before deploying diagnostic DL systems in clinical practice, reducing annotation costs. Additionally, an unsupervised framework based on generative models is proposed to identify substantially different inputs, known as out-of-distribution (OOD) samples. Detecting OOD samples plays a crucial role in enhancing the reliability of DL algorithms. Furthermore, several studies are conducted to explore what benefits uncertainty estimation could bring. Firstly, various uncertainty estimation approaches are extensively evaluated, focusing on identifying incorrect predictions and generalisability issues between medical centres and specific patient groups. In addition, the results reveal that combining uncertainty estimation methods with DL outputs leads to a more robust classification score, enhancing the overall performance and reliability of the classification process. Another study demonstrates that spatial uncertainty aggregation improves the effectiveness of uncertainty estimation in tumour segmentation tasks. This is evaluated on the detection of false negatives which may reduce the risk of missing tumour cells. Finally, the clinical prerequisites for developing and validating diagnostic DL systems for digital pathology are discussed, along with an overview of explainable AI techniques.

In conclusion, multiple approaches to facilitate the adoption of DL systems in clinical practice, addressing reliability, generalisability, and clinical needs aspects are discussed in this thesis. I believe that the extensive efforts in the research community will have a positive impact on the development, validation, and deployment of DL systems in digital pathology labs, empowering pathologists with trustworthy AI tools.

Place, publisher, year, edition, pages
Linköping: Linköping University Electronic Press, 2023. , p. 55
Series
Linköping Studies in Science and Technology. Dissertations, ISSN 0345-7524 ; 2337
Keywords [en]
Deep learning, Digital pathology, Generalisation, Uncertainty estimation, Anomaly detection, Data distribution shift
National Category
Biomedical Laboratory Science/Technology
Identifiers
URN: urn:nbn:se:liu:diva-198154DOI: 10.3384/9789180753005ISBN: 9789180752992 (print)ISBN: 9789180753005 (electronic)OAI: oai:DiVA.org:liu-198154DiVA, id: diva2:1800798
Public defence
2023-11-03, Wrannesalen, Center for Medical Image Science and Visualization, Linköping University Hospital, Linköping, 09:15 (English)
Opponent
Supervisors
Available from: 2023-09-28 Created: 2023-09-28 Last updated: 2023-09-28Bibliographically approved
List of papers
1. Survey of XAI in digital pathology
Open this publication in new window or tab >>Survey of XAI in digital pathology
2020 (English)In: Artificial intelligence and machine learning for digital pathology, Cham: Springer, 2020, p. 56-88Chapter in book (Refereed)
Abstract [en]

Artificial intelligence (AI) has shown great promise for diagnostic imaging assessments. However, the application of AI to support medical diagnostics in clinical routine comes with many challenges. The algorithms should have high prediction accuracy but also be transparent, understandable and reliable. Thus, explainable artificial intelligence (XAI) is highly relevant for this domain. We present a survey on XAI within digital pathology, a medical imaging sub-discipline with particular characteristics and needs. The review includes several contributions. Firstly, we give a thorough overview of current XAI techniques of potential relevance for deep learning methods in pathology imaging, and categorise them from three different aspects. In doing so, we incorporate uncertainty estimation methods as an integral part of the XAI landscape. We also connect the technical methods to the specific prerequisites in digital pathology and present findings to guide future research efforts. The survey is intended for both technical researchers and medical professionals, one of the objectives being to establish a common ground for cross-disciplinary discussions.

Place, publisher, year, edition, pages
Cham: Springer, 2020
Keywords
XAI, Digital pathology, AI, Medical imaging
National Category
Other Engineering and Technologies
Identifiers
urn:nbn:se:liu:diva-168439 (URN)10.1007/978-3-030-50402-1_4 (DOI)9783030504021 (ISBN)
Note

Forskningsfinansiär: Swedish e-Science Research Center

Available from: 2020-08-24 Created: 2020-08-24 Last updated: 2025-02-10Bibliographically approved
2. Unsupervised Anomaly Detection In Digital Pathology Using GANs
Open this publication in new window or tab >>Unsupervised Anomaly Detection In Digital Pathology Using GANs
2021 (English)In: 2021 IEEE 18th International Symposium On Biomedical Imaging (ISBI), Institute of Electrical and Electronics Engineers (IEEE), 2021, p. 1878-1882Conference paper, Published paper (Refereed)
Abstract [en]

Machine learning (ML) algorithms are optimized for the distribution represented by the training data. For outlier data, they often deliver predictions with equal confidence, even though these should not be trusted. In order to deploy ML-based digital pathology solutions in clinical practice, effective methods for detecting anomalous data are crucial to avoid incorrect decisions in the outlier scenario. We propose a new unsupervised learning approach for anomaly detection in histopathology data based on generative adversarial networks (GANs). Compared to the existing GAN-based methods that have been used in medical imaging, the proposed approach improves significantly on performance for pathology data. Our results indicate that histopathology imagery is substantially more complex than the data targeted by the previous methods. This complexity requires not only a more advanced GAN architecture but also an appropriate anomaly metric to capture the quality of the reconstructed images.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2021
Series
IEEE International Symposium on Biomedical Imaging, ISSN 1945-7928, E-ISSN 1945-8452
Keywords
digital pathology, deep learning, GAN, anomaly detection
National Category
Computer Sciences
Identifiers
urn:nbn:se:liu:diva-178631 (URN)10.1109/ISBI48211.2021.9434141 (DOI)000786144100399 ()9781665429474 (ISBN)9781665412469 (ISBN)
Conference
18th IEEE International Symposium on Biomedical Imaging (ISBI), Nice, FRANCE, apr 13-16, 2021
Note

Funding: Swedish e-Science Research Center; VINNOVAVinnova [2017-02447]

Available from: 2021-08-25 Created: 2021-08-25 Last updated: 2023-09-28Bibliographically approved
3. Generalisation effects of predictive uncertainty estimation in deep learning for digital pathology
Open this publication in new window or tab >>Generalisation effects of predictive uncertainty estimation in deep learning for digital pathology
2022 (English)In: Scientific Reports, E-ISSN 2045-2322, Vol. 12, no 1, article id 8329Article in journal (Refereed) Published
Abstract [en]

Deep learning (DL) has shown great potential in digital pathology applications. The robustness of a diagnostic DL-based solution is essential for safe clinical deployment. In this work we evaluate if adding uncertainty estimates for DL predictions in digital pathology could result in increased value for the clinical applications, by boosting the general predictive performance or by detecting mispredictions. We compare the effectiveness of model-integrated methods (MC dropout and Deep ensembles) with a model-agnostic approach (Test time augmentation, TTA). Moreover, four uncertainty metrics are compared. Our experiments focus on two domain shift scenarios: a shift to a different medical center and to an underrepresented subtype of cancer. Our results show that uncertainty estimates increase reliability by reducing a models sensitivity to classification threshold selection as well as by detecting between 70 and 90% of the mispredictions done by the model. Overall, the deep ensembles method achieved the best performance closely followed by TTA.

Place, publisher, year, edition, pages
Nature Portfolio, 2022
National Category
Bioinformatics (Computational Biology)
Identifiers
urn:nbn:se:liu:diva-185374 (URN)10.1038/s41598-022-11826-0 (DOI)000797636300082 ()35585087 (PubMedID)
Note

Funding Agencies|Swedish e-Science Research Center; VINNOVA [2017-02447]

Available from: 2022-05-31 Created: 2022-05-31 Last updated: 2023-09-28
4. Generalization of Deep Learning in Digital Pathology: Experience in Breast Cancer Metastasis Detection
Open this publication in new window or tab >>Generalization of Deep Learning in Digital Pathology: Experience in Breast Cancer Metastasis Detection
Show others...
2022 (English)In: Cancers, ISSN 2072-6694, Vol. 14, no 21, article id 5424Article in journal (Refereed) Published
Abstract [en]

Simple Summary Pathology is a cornerstone in cancer diagnostics, and digital pathology and artificial intelligence-driven image analysis could potentially save time and enhance diagnostic accuracy. For clinical implementation of artificial intelligence, a major question is whether the computer models maintain high performance when applied to new settings. We tested the generalizability of a highly accurate deep learning model for breast cancer metastasis detection in sentinel lymph nodes from, firstly, unseen sentinel node data and, secondly, data with a small change in surgical indication, in this case lymph nodes from axillary dissections. Model performance dropped in both settings, particularly on axillary dissection nodes. Retraining of the model was needed to mitigate the performance drop. The study highlights the generalization challenge of clinical implementation of AI models, and the possibility that retraining might be necessary. Poor generalizability is a major barrier to clinical implementation of artificial intelligence in digital pathology. The aim of this study was to test the generalizability of a pretrained deep learning model to a new diagnostic setting and to a small change in surgical indication. A deep learning model for breast cancer metastases detection in sentinel lymph nodes, trained on CAMELYON multicenter data, was used as a base model, and achieved an AUC of 0.969 (95% CI 0.926-0.998) and FROC of 0.838 (95% CI 0.757-0.913) on CAMELYON16 test data. On local sentinel node data, the base model performance dropped to AUC 0.929 (95% CI 0.800-0.998) and FROC 0.744 (95% CI 0.566-0.912). On data with a change in surgical indication (axillary dissections) the base model performance indicated an even larger drop with a FROC of 0.503 (95%CI 0.201-0.911). The model was retrained with addition of local data, resulting in about a 4% increase for both AUC and FROC for sentinel nodes, and an increase of 11% in AUC and 49% in FROC for axillary nodes. Pathologist qualitative evaluation of the retrained model s output showed no missed positive slides. False positives, false negatives and one previously undetected micro-metastasis were observed. The study highlights the generalization challenge even when using a multicenter trained model, and that a small change in indication can considerably impact the model s performance.

Place, publisher, year, edition, pages
MDPI, 2022
Keywords
digital pathology; artificial intelligence; computational pathology; deep learning; generalization; lymph node metastases; breast cancer
National Category
Cancer and Oncology
Identifiers
urn:nbn:se:liu:diva-190225 (URN)10.3390/cancers14215424 (DOI)000883894700001 ()36358842 (PubMedID)
Note

Funding Agencies|Vinnova [2017-02447]

Available from: 2022-11-30 Created: 2022-11-30 Last updated: 2025-09-05

Open Access in DiVA

fulltext(20790 kB)1507 downloads
File information
File name FULLTEXT02.pdfFile size 20790 kBChecksum SHA-512
7a4aa43cb3c7004f483d5fae80115f094bfc987fe3c940fbc56d9cc081e2a4b375e44f276eb5e86525c5c998b6b68554c31b94faf24e9549c10ed0561b830d27
Type fulltextMimetype application/pdf

Other links

Publisher's full text

Authority records

Pocevičiūtė, Milda

Search in DiVA

By author/editor
Pocevičiūtė, Milda
By organisation
Media and Information TechnologyFaculty of Science & Engineering
Biomedical Laboratory Science/Technology

Search outside of DiVA

GoogleGoogle Scholar
Total: 1510 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 1783 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf