liu.seSearch for publications in DiVA
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Efficient Characterization of Short Anelloviruses Fragments Found in Metagenomic Samples
Linköping University, Department of Physics, Chemistry and Biology, Bioinformatics. Linköping University, The Institute of Technology.
2012 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

Some viral metagenomic serum samples contain a huge amount of Anellovirus, which is a genetically diverse family with a few conserved regions making it hard to efficiently characterize. Multiple sequence alignment of the Anelloviruses found in the sample must be constructed to get a clear picture of Anellovirus diversity and to identify stable regions. Using available multiple sequence alignment software directly on these fragments results in an MSA of a very poor quality due to their diversity, misaligned regions and low-quality regions present in the sequence.

An efficient MSA must be constructed in order to characterize these Anellovirus present in the samples. Pairwise alignment is used to align one fragment to the database sequences at a time. The fragments are then aligned to the database sequences using the start and end position from the pairwise alignment results. The algorithm will also exclude non-aligned portions of the fragments, as these are very hard to handle properly and are often products of misassembly or chimeric sequenced fragments. Other tools to aid further analysis were developed, such as finding a non-overlapping window that contains the most fragments, find consensus of the alignment and extract any regions from the MSA for further analysis.

An MSA was constructed with a high percent of correctly aligned bases compared to an MSA constructed using MSA softwares. The minimal number of genomes found in the sampled sequence was found as well as a distribution of the fragments along the database sequence. Moreover, highly conserved region and the window containing most fragments were extracted from the MSA and phylogenetic trees were constructed for these regions. 

Place, publisher, year, edition, pages
2012. , 31 p.
Keyword [en]
Anellovirus, TTV, torque teno virus, MSA construction, multiple sequence alignment construction, anellovirus characterisation
National Category
Other Medical Engineering
Identifiers
URN: urn:nbn:se:liu:diva-85813ISRN: LITH-IFM-A-EX--12/2711--SEOAI: oai:DiVA.org:liu-85813DiVA: diva2:572806
Subject / course
Master's Program Biomedical Engineering
Presentation
2012-11-12, 19:42 (English)
Uppsok
Technology
Supervisors
Examiners
Available from: 2012-12-04 Created: 2012-11-28 Last updated: 2012-12-04Bibliographically approved

Open Access in DiVA

fulltext(1059 kB)215 downloads
File information
File name FULLTEXT01.pdfFile size 1059 kBChecksum SHA-512
e482743c9454521b36dda77bba5495b7b82c0744010a604d0fe187a3033a480458c9962286349b186a2273e3dbba834a754107af31080a6cf1b0b9abc7833e3e
Type fulltextMimetype application/pdf

By organisation
BioinformaticsThe Institute of Technology
Other Medical Engineering

Search outside of DiVA

GoogleGoogle Scholar
Total: 215 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 168 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf