An evaluation of the use of XML for representation, querying, and analysis of molecular interactions
2006 (English)In: Current Trends in Database Technology – EDBT 2006: EDBT 2006 Workshops PhD, DataX, IIDB, IIHA, ICSNW, QLQP, PIM, PaRMA, and Reactivity on the Web, Munich, Germany, March 26-31, 2006, Revised Selected Papers / [ed] Torsten Grust, Hagen Höpfner, Arantza Illarramendi, Stefan Jablonski, Marco Mesiti, Sascha Müller, Paula-Lavinia Patranjan, Kai-Uwe Sattler, Myra Spiliopoulou and Jef Wijsen, Springer Berlin/Heidelberg, 2006, Vol. 4254, 220-233 p.Chapter in book (Refereed)
Currently, biology researchers rapidly generate new information on how genes, proteins and other molecules interact in living organisms. To completely understand the machinery underlying life it is necessary to integrate and analyze these large quantities of data. As one step in this direction, new standards for describing molecular interactions have been defined based on XML. This work evaluates the usage of the XML Query language XQuery for molecular interactions, as it would be of great benefit to the user to work directly on data represented in the new standards. We use and compare a set of available XQuery implementations, eXist, X-Hive, Sedna and QizX/open for querying and analysis on data exported from available databases. Our conclusion is that XQuery can easily be used for the most common queries in this domain but is not feasible for more complex analyses. In particular, for queries containing path analysis the available XQuery implementations have poor performance and an extension of the GTL package clearly outperforms XQuery. The paper ends with a discussion regarding the usability of XQuery in this domain. In particular we point out the need for more efficient graph handling and that XQuery also requires the user to understand the exact XML format of each dataset.
Place, publisher, year, edition, pages
Springer Berlin/Heidelberg, 2006. Vol. 4254, 220-233 p.
Lecture Notes in Computer Science, ISSN 0302-9743 (print), 1611-3349 (online) ; 4254
Engineering and Technology
IdentifiersURN: urn:nbn:se:liu:diva-48058DOI: 10.1007/11896548_20ISBN: 978-3-540-46788-5ISBN: e-978-3-540-46790-8ISBN: 3-540-46788-2OAI: oai:DiVA.org:liu-48058DiVA: diva2:268954