Similarity searching in DNA sequences by spectral distortion measures
2006 (English)In: Advances in Data Mining. Applications in Medicine, Web Mining, Marketing, Image and Signal Mining: 6th Industrial Conference on Data Mining, ICDM 2006, Leipzig, Germany, July 14-15, 2006. Proceedings / [ed] Petra Perner, Springer Berlin/Heidelberg, 2006, 24-37 p.Chapter in book (Refereed)Text
Searching for similarity among biological sequences is an important research area of bioinformatics because it can provide insight into the evolutionary and genetic relationships between species that open doors to new scientific discoveries such as drug design and treament. In this paper, we introduce a novel measure of similarity between two biological sequences without the need of alignment. The method is based on the concept of spectral distortion measures developed for signal processing. The proposed method was tested using a set of six DNA sequences taken from Escherichia coli K-12 and Shigella flexneri, and one random sequence. It was further tested with a complex dataset of 40 DNA sequences taken from the GenBank sequence database. The results obtained from the proposed method are found superior to some existing methods for similarity measure of DNA sequences.
Place, publisher, year, edition, pages
Springer Berlin/Heidelberg, 2006. 24-37 p.
Lecture Notes in Computer Science, ISSN 0302-9743 (print), 1611-3349 (online) ; 4065
IdentifiersURN: urn:nbn:se:liu:diva-125012DOI: 10.1007/11790853_3ISBN: 978-3-540-36036-0 (Print)ISBN: 978-3-540-36037-7 (Online)OAI: oai:DiVA.org:liu-125012DiVA: diva2:902788