liu.seSearch for publications in DiVA
Change search
ReferencesLink to record
Permanent link

Direct link
An XML-based Database of Molecular Pathways
Linköping University, Department of Computer and Information Science.
2005 (English)Independent thesis Basic level (professional degree), 20 points / 30 hpStudent thesisAlternative title
En XML-baserad databas för molekylära reaktioner (Swedish)
Abstract [en]

Research of protein-protein interactions produce vast quantities of data and there exists a large number of databases with data from this research. Many of these databases offers the data for download on the web in a number of different formats, many of them XML-based.

With the arrival of these XML-based formats, and especially the standardized formats such as PSI-MI, SBML and BioPAX, there is a need for searching in data represented in XML. We wanted to investigate the capabilities of XML query tools when it comes to searching in this data. Due to the large datasets we concentrated on native XML database systems that in addition to search in XML data also offers storage and indexing specially suited for XML documents.

A number of queries were tested on data exported from the databases IntAct and Reactome using the XQuery language. There were both simple and advanced queries performed. The simpler queries consisted of queries such as listing information on a specified protein or counting the number of reactions.

One central issue with protein-protein interactions is to find pathways, i.e. series of interconnected chemical reactions between proteins. This problem involve graph searches and since we suspected that the complex queries it required would be slow we also developed a C++ program using a graph toolkit.

The simpler queries were performed relatively fast. Pathway searches in the native XML databases took long time even for short searches while the C++ program achieved much faster pathway searches.

Place, publisher, year, edition, pages
Institutionen för datavetenskap , 2005. , 118 p.
Keyword [en]
XML, native XML databases, XQuery, protein-protein interactions, pathway search
National Category
Computer Science
URN: urn:nbn:se:liu:diva-3717ISRN: LITH-IDA-EX--05/051--SEOAI: diva2:20435
Available from: 2005-09-07 Created: 2005-09-07

Open Access in DiVA

fulltext(904 kB)4200 downloads
File information
File name FULLTEXT01.pdfFile size 904 kBChecksum MD5
Type fulltextMimetype application/pdf

By organisation
Department of Computer and Information Science
Computer Science

Search outside of DiVA

GoogleGoogle Scholar
Total: 4200 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 5738 hits
ReferencesLink to record
Permanent link

Direct link