liu.seSearch for publications in DiVA
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Investigation of multivariate prediction methods for the analysis of biomarker data
Linköping University, The Department of Physics, Chemistry and Biology.
2006 (English)Independent thesis Basic level (professional degree), 20 points / 30 hpStudent thesis
Abstract [en]

The paper describes predictive modelling of biomarker data stemming from patients suffering from multiple sclerosis. Improvements of multivariate analyses of the data are investigated with the goal of increasing the capability to assign samples to correct subgroups from the data alone.

The effects of different preceding scalings of the data are investigated and combinations of multivariate modelling methods and variable selection methods are evaluated. Attempts at merging the predictive capabilities of the method combinations through voting-procedures are made. A technique for improving the result of PLS-modelling, called bagging, is evaluated.

The best methods of multivariate analysis of the ones tried are found to be Partial least squares (PLS) and Support vector machines (SVM). It is concluded that the scaling have little effect on the prediction performance for most methods. The method combinations have interesting properties – the default variable selections of the multivariate methods are not always the best. Bagging improves performance, but at a high cost. No reasons for drastically changing the work flows of the biomarker data analysis are found, but slight improvements are possible. Further research is needed.

Place, publisher, year, edition, pages
Institutionen för fysik, kemi och biologi , 2006. , 56 p.
Keyword [en]
Multivariate analysis, multiple sclerosis, biomarker, predictive modeling, partial least squares, support vector machines, variable selection, bagging, neural networks
National Category
Bioinformatics (Computational Biology)
Identifiers
URN: urn:nbn:se:liu:diva-5889ISRN: LITH-IFM-EX--06/1556–-SEOAI: oai:DiVA.org:liu-5889DiVA: diva2:21549
Uppsok
fysik/kemi/matematik
Supervisors
Examiners
Available from: 2006-05-05 Created: 2006-05-05

Open Access in DiVA

fulltext(2602 kB)942 downloads
File information
File name FULLTEXT01.pdfFile size 2602 kBChecksum MD5
62a12da3618bf17cdd23cf68a9b3f48dec49c8fbd9714889b11dd9e7c0cbae6d04ab447d
Type fulltextMimetype application/pdf

By organisation
The Department of Physics, Chemistry and Biology
Bioinformatics (Computational Biology)

Search outside of DiVA

GoogleGoogle Scholar
Total: 942 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 378 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf