liu.seSearch for publications in DiVA
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Proteus: A new predictor for protean segments
Linköping University, Department of Physics, Chemistry and Biology, Biotechnology. Linköping University, Faculty of Science & Engineering.
2015 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

The discovery of intrinsically disordered proteins has led to a paradigm shift in protein science. Many disordered proteins have regions that can transform from a disordered state to an ordered. Those regions are called protean segments.

Many intrinsically disordered proteins are involved in diseases, including Alzheimer's disease, Parkinson's disease and Down's syndrome, which makes them prime targets for medical research. As protean segments often are the functional part of the proteins, it is of great importance to identify those regions.

This report presents Proteus, a new predictor for protean segments. The predictor uses Random Forest (a decision tree ensemble classifier) and is trained on features derived from amino acid sequence and conservation data.

Proteus compares favourably to state of the art predictors and performs better than the competition on all four metrics: precision, recall, F1 and MCC.

The report also looks at the differences between protean and non-protean regions and how they differ between the two datasets that were used to train the predictor.

Place, publisher, year, edition, pages
2015. , 49 p.
Keyword [en]
bioinformatics, protein, machine learning, predictor, protean segments, molecular recognition feature, intrinsically disordered proteins, proteus
National Category
Bioinformatics and Systems Biology Bioinformatics (Computational Biology)
Identifiers
URN: urn:nbn:se:liu:diva-121260ISRN: LITH-IFM-A-EX--15/3118--SEOAI: oai:DiVA.org:liu-121260DiVA: diva2:852903
Subject / course
Biotechnology
Supervisors
Examiners
Available from: 2015-11-27 Created: 2015-09-10 Last updated: 2015-11-27Bibliographically approved

Open Access in DiVA

fulltext(683 kB)94 downloads
File information
File name FULLTEXT01.pdfFile size 683 kBChecksum SHA-512
8ccf189be0ce64fe3cf8572b010f2db61282666fc789097611a33cd3635a57d0b7d67723b12db4ad72caeb10deda8625e7cfcead3153d162b9b8e9b98be1e424
Type fulltextMimetype application/pdf

Search in DiVA

By author/editor
Söderquist, Fredrik
By organisation
BiotechnologyFaculty of Science & Engineering
Bioinformatics and Systems BiologyBioinformatics (Computational Biology)

Search outside of DiVA

GoogleGoogle Scholar
Total: 94 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 207 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf