liu.seSearch for publications in DiVA
Change search
ReferencesLink to record
Permanent link

Direct link
Comparing the Performance of NoSQL Approaches for Managing Archetype-Based Electronic Health Record Data
Linköping University, Department of Biomedical Engineering. Linköping University, Faculty of Science & Engineering. Departamento de Tecnologia da Informação e Educação em Saúde, Universidade do Estado do Rio de Janeiro, Rio de Janeiro, RJ, Brazil.
Departamento de Tecnologia da Informação e Educação em Saúde, Universidade do Estado do Rio de Janeiro, Rio de Janeiro, RJ, Brazil .
Linköping University, Department of Computer and Information Science, Database and information techniques. Linköping University, The Institute of Technology.
Linköping University, Department of Biomedical Engineering, Medical Informatics. Linköping University, Faculty of Science & Engineering. Region Östergötland.
Show others and affiliations
2016 (English)In: PLoS ONE, ISSN 1932-6203, E-ISSN 1932-6203, Vol. 11, no 3, e0150069Article in journal (Refereed) Published
Abstract [en]

This study provides an experimental performance evaluation on population-based queries of NoSQL databases storing archetype-based Electronic Health Record (EHR) data. There are few published studies regarding the performance of persistence mechanisms for systems that use multilevel modelling approaches, especially when the focus is on population-based queries. A healthcare dataset with 4.2 million records stored in a relational database (MySQL) was used to generate XML and JSON documents based on the openEHR reference model. Six datasets with different sizes were created from these documents and imported into three single machine XML databases (BaseX, eXistdb and Berkeley DB XML) and into a distributed NoSQL database system based on the MapReduce approach, Couchbase, deployed in different cluster configurations of 1, 2, 4, 8 and 12 machines. Population-based queries were submitted to those databases and to the original relational database. Database size and query response times are presented. The XML databases were considerably slower and required much more space than Couchbase. Overall, Couchbase had better response times than MySQL, especially for larger datasets. However, Couchbase requires indexing for each differently formulated query and the indexing time increases with the size of the datasets. The performances of the clusters with 2, 4, 8 and 12 nodes were not better than the single node cluster in relation to the query response time, but the indexing time was reduced proportionally to the number of nodes. The tested XML databases had acceptable performance for openEHR-based data in some querying use cases and small datasets, but were generally much slower than Couchbase. Couchbase also outperformed the response times of the relational database, but required more disk space and had a much longer indexing time. Systems like Couchbase are thus interesting research targets for scalable storage and querying of archetype-based EHR data when population-based use cases are of interest.

Place, publisher, year, edition, pages
Public Library Science , 2016. Vol. 11, no 3, e0150069
National Category
Computer Science
URN: urn:nbn:se:liu:diva-125961DOI: 10.1371/journal.pone.0150069ISI: 000371992300032PubMedID: 26958859OAI: diva2:910694
Swedish e‐Science Research Center

Funding agencies: Coordenacao de Aperfeicoamento de Pessoal de Nivel Superior (CAPES Foundation - Brazil) [4055/11]; Conselho Brasileiro de Desenvolvimento Cientifico e Tecnologico (CNPq) [150916/2013-2]

Available from: 2016-03-09 Created: 2016-03-09 Last updated: 2016-04-13

Open Access in DiVA

fulltext(1665 kB)101 downloads
File information
File name FULLTEXT01.pdfFile size 1665 kBChecksum SHA-512
Type fulltextMimetype application/pdf

Other links

Publisher's full textPubMed

Search in DiVA

By author/editor
Freire, Sergio MirandaWei-Kleiner, FangSundvall, ErikKarlsson, DanielLambrix, Patrick
By organisation
Department of Biomedical EngineeringFaculty of Science & EngineeringDatabase and information techniquesThe Institute of TechnologyMedical InformaticsRegion Östergötland
In the same journal
Computer Science

Search outside of DiVA

GoogleGoogle Scholar
Total: 101 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Altmetric score

Total: 520 hits
ReferencesLink to record
Permanent link

Direct link