liu.seSearch for publications in DiVA
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Managing imbalanced training data by sequential segmentation in machine learning
Linköping University, Department of Biomedical Engineering, Division of Biomedical Engineering.
2019 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

Imbalanced training data is a common problem in machine learning applications. Thisproblem refers to datasets in which the foreground pixels are significantly fewer thanthe background pixels. By training a machine learning model with imbalanced data, theresult is typically a model that classifies all pixels as the background class. A result thatindicates no presence of a specific condition when it is actually present is particularlyundesired in medical imaging applications. This project proposes a sequential system oftwo fully convolutional neural networks to tackle the problem. Semantic segmentation oflung nodules in thoracic computed tomography images has been performed to evaluate theperformance of the system. The imbalanced data problem is present in the training datasetused in this project, where the average percentage of pixels belonging to the foregroundclass is 0.0038 %. The sequential system achieved a sensitivity of 83.1 % representing anincrease of 34 % compared to the single system. The system only missed 16.83% of thenodules but had a Dice score of 21.6 % due to the detection of multiple false positives. Thismethod shows considerable potential to be a solution to the imbalanced data problem withcontinued development.

Place, publisher, year, edition, pages
2019. , p. 68
National Category
Medical Image Processing
Identifiers
URN: urn:nbn:se:liu:diva-155091ISRN: LIU-IMT-TFK-A–19/560–SEOAI: oai:DiVA.org:liu-155091DiVA, id: diva2:1296713
Subject / course
Biomedical Laboratory Science
Available from: 2019-03-19 Created: 2019-03-17 Last updated: 2019-03-19Bibliographically approved

Open Access in DiVA

fulltext(9398 kB)136 downloads
File information
File name FULLTEXT02.pdfFile size 9398 kBChecksum SHA-512
0f3ddbb0aa72607ac4ec0405a9416555d1f1ff4a5a7c332fb17c355ebaa6e4461860031246d347cb081afcc88ce6fba64032980039369c0fb1a6a82764ecbeed
Type fulltextMimetype application/pdf

By organisation
Division of Biomedical Engineering
Medical Image Processing

Search outside of DiVA

GoogleGoogle Scholar
Total: 136 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 1247 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf