liu.seSearch for publications in DiVA
Change search
Link to record
Permanent link

Direct link
Biography [eng]

I am interested in problems related to the management of databases and knowledge. My focus in this broad context is on data on the Web (Semantic Web and Linked Data, in particular) and on graph data, as well as on problems in which the data is distributed over multiple, autonomous and/or heterogeneous sources (keywords: federated data management, federated knowledge graphs). Other keywords that appear in some of my work are data provenance and data quality. Regarding all of these topics, my interests range from theoretical foundations (e.g., complexity and expressive power of query languages) all the way to systems-building related research (e.g., efficient storage of datasets and databases, query processing, and query optimization).

Publications (10 of 42) Show all publications
Elhalawati, A., Dimou, A., Hartig, O. & Hernández, D. (2025). Flexible RML-Based Mapping of Property Graphs to RDF. In: Proceedings of the Workshops of the EDBT/ICDT 2025 Joint Conference: . Paper presented at 28th International Conference on Extending Database Technology (EDBT) and 28th International Conference on Database Theory (ICDT).
Open this publication in new window or tab >>Flexible RML-Based Mapping of Property Graphs to RDF
2025 (English)In: Proceedings of the Workshops of the EDBT/ICDT 2025 Joint Conference, 2025Conference paper, Published paper (Refereed)
Abstract [en]

RDF graphs and (Labeled) Property Graphs (PGs) have emerged as data models for representing graph databases. Given the differences between the two models, ensuring interoperability between them has become essential, to leverage the strengths of both models. Various approaches have been proposed to map PGs to RDF graphs. However, these approaches differ in terms of structure, representation, size of the generated RDF graph, and degree of configuration provided to the user, making direct comparisons challenging. While declarative methods prevailed to construct RDF graphs from other data formats, the mapping languages proposed for such transformations have not been considered so far for mapping PGs to RDF graphs. In this work, we provide a representation of PG-to-RDF approaches through templates described using RML, a mapping language to construct RDF graphs from heterogeneous data. We show that all considered PG-to-RDF approaches can be represented in RML and, by having a uniform representation of them, we can compare them showcasing their differences. Finally, we show that not only can RML be used to capture PG-to-RDF mappings, but it actually offers more expressive power than the considered PG-to-RDF approaches.

National Category
Computer Sciences
Identifiers
urn:nbn:se:liu:diva-212262 (URN)
Conference
28th International Conference on Extending Database Technology (EDBT) and 28th International Conference on Database Theory (ICDT)
Funder
Swedish Research Council, 2019-05655
Available from: 2025-03-17 Created: 2025-03-17 Last updated: 2025-04-03
Khayatbashi, S., Hartig, O. & Jalali, A. (2025). Transforming Object-Centric Event Logs to Temporal Event Knowledge Graphs. In: Katarzyna Gdowska; María Teresa Gómez-López; Jana-Rebecca Rehse (Ed.), Business Process Management Workshops: BPM 2024 International Workshops, Krakow, Poland, September 1–6, 2024, Revised Selected Papers. Paper presented at 22nd International Conference on Business Process Management (BPM), Krakow, Poland, September 1–6, 2024 (pp. 300-313). Cham: Springer
Open this publication in new window or tab >>Transforming Object-Centric Event Logs to Temporal Event Knowledge Graphs
2025 (English)In: Business Process Management Workshops: BPM 2024 International Workshops, Krakow, Poland, September 1–6, 2024, Revised Selected Papers / [ed] Katarzyna Gdowska; María Teresa Gómez-López; Jana-Rebecca Rehse, Cham: Springer, 2025, p. 300-313Conference paper, Published paper (Refereed)
Abstract [en]

Event logs play a fundamental role in enabling data-driven business process analysis. Traditionally, these logs track events related to a single object, known as the case, limiting the scope of analysis. Recent advancements, such as Object-Centric Event Log (OCEL) and Event Knowledge Graph (EKG), capture better how events relate to multiple objects. However, attributes of objects can change over time, which was not initially considered in OCEL or EKG. While OCEL 2.0 has addressed some of these limitations, there remains a research gap concerning how attribute changes should be accommodated in EKG and how OCEL 2.0 logs can be transformed into EKG. This paper fills this gap by introducing Temporal Event Knowledge Graph (tEKG) and defining an algorithm to convert an OCEL 2.0 log to a tEKG.

Place, publisher, year, edition, pages
Cham: Springer, 2025
Series
Lecture Notes in Business Information Processing, ISSN 1865-1348, E-ISSN 1865-1356 ; 534
Keywords
event knowledge graphs, object-centric event data, object-centric process mining
National Category
Computer and Information Sciences
Identifiers
urn:nbn:se:liu:diva-212264 (URN)10.1007/978-3-031-78666-2_23 (DOI)001467337000023 ()2-s2.0-86000449927 (Scopus ID)9783031786655 (ISBN)9783031786662 (ISBN)
Conference
22nd International Conference on Business Process Management (BPM), Krakow, Poland, September 1–6, 2024
Funder
Swedish Research Council, 2019-05655
Note

Funding Agencies|Vetenskapsradet (the Swedish Research Council) [2019-05655]

Available from: 2025-03-17 Created: 2025-03-17 Last updated: 2025-05-14
Li, H., Hartig, O., Armiento, R. & Lambrix, P. (2024). Ontology-based GraphQL server generation for data access and data integration. Semantic Web, 15(5), 1639-1675
Open this publication in new window or tab >>Ontology-based GraphQL server generation for data access and data integration
2024 (English)In: Semantic Web, ISSN 1570-0844, E-ISSN 2210-4968, Vol. 15, no 5, p. 1639-1675Article in journal (Refereed) Published
Abstract [en]

In a GraphQL Web API, a so-called GraphQL schema defines the types of data objects that can be queried, and so-called resolver functions are responsible for fetching the relevant data from underlying data sources. Thus, we can expect to use GraphQL not only for data access but also for data integration, if the GraphQL schema reflects the semantics of data from multiple data sources, and the resolver functions can obtain data from these data sources and structure the data according to the schema. However, there does not exist a semantics-aware approach to employ GraphQL for data integration. Furthermore, there are no formal methods for defining a GraphQL API based on an ontology.In this work, we introduce a framework for using GraphQL in which a global domain ontology informs the generation of a GraphQL server that answers requests by querying heterogeneous data sources.The core of this framework consists of an algorithm to generate a GraphQL schema based on an ontology and a generic resolver function based on semantic mappings. We provide a prototype, OBG-gen, of this framework, and we evaluate our approach over a real-world data integration scenario in the materials design domain and two synthetic benchmark scenarios (Linköping GraphQL Benchmark and GTFS-Madrid-Bench). The experimental results of our evaluation indicate that: (i) our approach is feasible to generate GraphQL servers for data access and integration over heterogeneous data sources, thus avoiding a manual construction of GraphQL servers, and (ii) our data access and integration approach is general and applicable to different domains where data is shared or queried via different ways.

Place, publisher, year, edition, pages
IOS Press, 2024
Keywords
Data Integration, Ontology, GraphQL
National Category
Computer Sciences
Identifiers
urn:nbn:se:liu:diva-200165 (URN)10.3233/sw-233550 (DOI)001358071100007 ()
Funder
CUGS (National Graduate School in Computer Science)Swedish e‐Science Research CenterSwedish Research Council, 2018-04147Swedish Research Council, 2019-05655Swedish Agency for Economic and Regional Growth
Note

Funding Agencies|Swedish e-Science Research Centre (SeRC); Swedish Na-tional Graduate School in Computer Science (CUGS); Swedish Research Council (Vetenskapsradet) [2018-04147, 2019-05655]; Swedish Agency for Economic and Regional and Growth (Tillvaextverket)

Available from: 2024-01-12 Created: 2024-01-12 Last updated: 2024-12-04
Lambrix, P., Armiento, R., Li, H., Hartig, O., Abd Nikooie Pour, M. & Li, Y. (2024). The materials design ontology. Semantic Web, 15(2), 481-515
Open this publication in new window or tab >>The materials design ontology
Show others...
2024 (English)In: Semantic Web, ISSN 1570-0844, E-ISSN 2210-4968, Vol. 15, no 2, p. 481-515Article in journal (Refereed) Published
Abstract [en]

In the materials design domain, much of the data from materials calculations is stored in different heterogeneous databases with different data and access models. Therefore, accessing and integrating data from different sources is challenging. As ontology-based access and integration alleviates these issues, in this paper we address data access and interoperability for computational materials databases by developing the Materials Design Ontology. This ontology is inspired by and guided by the OPTIMADE effort that aims to make materials databases interoperable and includes many of the data providers in computational materials science. In this paper, first, we describe the development and the content of the Materials Design Ontology. Then, we use a topic model-based approach to propose additional candidate concepts for the ontology. Finally, we show the use of the Materials Design Ontology by a proof-of-concept implementation of a data access and integration system for materials databases based on the ontology.

Place, publisher, year, edition, pages
IOS Press, 2024
Keywords
Ontology, Ontology Development, Data Access, Data Integration, Materials Science, Materials Design Ontology
National Category
Computer Sciences
Identifiers
urn:nbn:se:liu:diva-198433 (URN)10.3233/sw-233340 (DOI)001212197500009 ()
Funder
CUGS (National Graduate School in Computer Science)Swedish Research Council, 2018-04147Swedish Agency for Economic and Regional GrowthSwedish e‐Science Research Center
Note

Funding Agencies|Swedish e-Science Research Centre (SeRC); Swedish National Graduate School in Computer Science (CUGS); Swedish Research Council (Vetenskapsradet) [2018-04147]; Swedish Agency for Economic and Regional and Growth (Tillvaxtverket)

Available from: 2023-10-12 Created: 2023-10-12 Last updated: 2024-05-24Bibliographically approved
Cheng, S. & Hartig, O. (2023). A Cost Model to Optimize Queries over Heterogeneous Federations of RDF Data Sources. In: Joint Proceedings of the ESWC 2023 Workshops and Tutorials co-located with 20th European Semantic Web Conference (ESWC 2023): . Paper presented at 1st International Workshop on Data Management for Knowledge Graphs (DMKG), colocated with ESWC, Hersonissos, Greece, May 28-29, 2023.
Open this publication in new window or tab >>A Cost Model to Optimize Queries over Heterogeneous Federations of RDF Data Sources
2023 (English)In: Joint Proceedings of the ESWC 2023 Workshops and Tutorials co-located with 20th European Semantic Web Conference (ESWC 2023), 2023Conference paper, Published paper (Refereed)
Abstract [en]

Federated processing of queries over RDF data sources offers significant potential when a SPARQL query cannot be answered by a single data source alone. However, finding efficient plans to execute a queryover a federation is challenging, especially if different federation members provide different types of data access interfaces. Different interfaces imply different request types, different forms of responses, and different physical algorithms that can be used, each of which consumes varying amounts of resources during query execution. This heterogeneity poses additional obstacles to the task of planning query executions, in addition to the inherent complexity arising from numerous possible join orderings andvarious physical algorithms. As a first step to address these challenges, we propose a cost model that captures the resource requirements of different operators depending on the type of federation member,allowing us to estimate cost of a given query execution plan without actually executing it. To evaluate our approach, we conduct experiments on FedBench with our cost model and compare it to the current state-of-the-art approach to query planning for heterogeneous federations of RDF data sources.

Series
CEUR Workshop Proceedings, E-ISSN 1613-0073 ; 3443
National Category
Computer Sciences
Identifiers
urn:nbn:se:liu:diva-198001 (URN)
Conference
1st International Workshop on Data Management for Knowledge Graphs (DMKG), colocated with ESWC, Hersonissos, Greece, May 28-29, 2023
Funder
Swedish Research Council, 2019-05655CUGS (National Graduate School in Computer Science)
Available from: 2023-09-21 Created: 2023-09-21 Last updated: 2024-11-25Bibliographically approved
Cheng, S., Ferrada, S. & Hartig, O. (2023). Considering Vocabulary Mappings in Query Plans for Federations of RDF Data Sources. In: Mohamed Sellami, Maria-Esther Vidal, Boudewijn van Dongen, Walid Gaaloul, Hervé Panetto (Ed.), Cooperative Information Systems: 29th International Conference, CoopIS 2023, Groningen, The Netherlands, October 30–November 3, 2023, Proceedings. Paper presented at 29th International Conference on Cooperative Information Systems (CoopIS), Groningen, The Netherlands, October 30 - November 3, 2023 (pp. 21-40). Springer
Open this publication in new window or tab >>Considering Vocabulary Mappings in Query Plans for Federations of RDF Data Sources
2023 (English)In: Cooperative Information Systems: 29th International Conference, CoopIS 2023, Groningen, The Netherlands, October 30–November 3, 2023, Proceedings / [ed] Mohamed Sellami, Maria-Esther Vidal, Boudewijn van Dongen, Walid Gaaloul, Hervé Panetto, Springer, 2023, p. 21-40Conference paper, Published paper (Refereed)
Abstract [en]

Federations of RDF data sources oer great potential for queries that cannot be answered by a single data source. However, querying such federations poses several challenges, one of which is that different but semantically-overlapping vocabularies may be used for the respective RDF data. Since the federation members usually retain their autonomy, this heterogeneity cannot simply be homogenized by modifying the data in the data sources. Therefore, handling this heterogeneity becomes a critical aspect of query planning and execution. We introduce an approach to address this challenge by leveraging vocabulary mappings for the processing of queries over federations with heterogeneous vocabularies. This approach not only translates SPARQL queries but also preserves the correctness of results during query execution. We demonstrate the effectiveness of the approach and measure how the application of vocabulary mappings affects on the performance of federated query processing.

Place, publisher, year, edition, pages
Springer, 2023
Series
Lecture Notes in Computer Science, ISSN 0302-9743, E-ISSN 1611-3349 ; 14353
National Category
Computer Sciences
Identifiers
urn:nbn:se:liu:diva-198003 (URN)10.1007/978-3-031-46846-9_2 (DOI)001560834900002 ()2-s2.0-85176015280 (Scopus ID)9783031468469 (ISBN)9783031468452 (ISBN)
Conference
29th International Conference on Cooperative Information Systems (CoopIS), Groningen, The Netherlands, October 30 - November 3, 2023
Funder
Swedish Research Council, 2019-05655CUGS (National Graduate School in Computer Science)
Note

Funding Agencies|National Graduate School in Computer Science, Sweden (CUGS); Vetenskapsradet (the Swedish Research Council) [2019-05655]

Available from: 2023-09-21 Created: 2023-09-21 Last updated: 2025-10-02Bibliographically approved
Dang, M.-H., Aimonier-Dava, J., Molli, P., Hartig, O., Skaf-Molli, H. & Le Crom, Y. (2023). FedShop: A Benchmark for Testing the Scalability of SPARQL Federation Engines. In: Terry R. Payne, Valentina Presutti, Guilin Qi, María Poveda-Villalón, Giorgos Stoilos, Laura Hollink, Zoi Kaoudi, Gong Cheng, Juanzi Li (Ed.), The Semantic Web – ISWC 2023: 22nd International Semantic Web Conference, Athens, Greece, November 6–10, 2023, Proceedings, Part II. Paper presented at 22nd International Semantic Web Conference (ISWC) (pp. 285-301). Springer
Open this publication in new window or tab >>FedShop: A Benchmark for Testing the Scalability of SPARQL Federation Engines
Show others...
2023 (English)In: The Semantic Web – ISWC 2023: 22nd International Semantic Web Conference, Athens, Greece, November 6–10, 2023, Proceedings, Part II / [ed] Terry R. Payne, Valentina Presutti, Guilin Qi, María Poveda-Villalón, Giorgos Stoilos, Laura Hollink, Zoi Kaoudi, Gong Cheng, Juanzi Li, Springer, 2023, p. 285-301Conference paper, Published paper (Refereed)
Abstract [en]

While several approaches to query a federation of SPARQL endpoints have been proposed in the literature, very little is known about the effectiveness of these approaches and the behavior of the resulting query engines for cases in which the number of federation members increases. The existing benchmarks that are typically used to evaluate SPARQL federation engines do not consider such a form of scalability. In this paper, we set out to close this knowledge gap by investigating the behavior of 4 state-of-the-art SPARQL federation engines using a novel benchmark designed for scalability experiments. Based on the benchmark, we show that scalability is a challenge for each of these engines, especially with respect to the effectiveness of their source selection & query decomposition approaches. FedShop is freely available online at:https://github.com/GDD-Nantes/FedShop

Place, publisher, year, edition, pages
Springer, 2023
Series
Lecture Notes in Computer Science, ISSN 0302-9743, E-ISSN 1611-3349 ; 14266
National Category
Computer Sciences
Identifiers
urn:nbn:se:liu:diva-198002 (URN)10.1007/978-3-031-47243-5_16 (DOI)001160741700016 ()978-3-031-47243-5 (ISBN)978-3-031-47242-8 (ISBN)
Conference
22nd International Semantic Web Conference (ISWC)
Funder
Swedish Research Council, 2019-05655
Available from: 2023-09-21 Created: 2023-09-21 Last updated: 2024-11-28
Hartig, O. & Yoshida, Y. (2023). GRADES-NDA'23: 6th Joint Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA). In: Companion of the 2023 International Conference on Management of Data, SIGMOD/PODS 2023: . Paper presented at 2023 International Conference on Management of Data (SIGMOD/PODS), Seattle, WA, USA, June 18-23, 2023 (pp. 307-308). Association for Computing Machinery (ACM)
Open this publication in new window or tab >>GRADES-NDA'23: 6th Joint Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA)
2023 (English)In: Companion of the 2023 International Conference on Management of Data, SIGMOD/PODS 2023, Association for Computing Machinery (ACM), 2023, p. 307-308Conference paper, Published paper (Other academic)
Abstract [en]

GRADES-NDA is the premier workshop series on graph data management and analytics that aims to bring together researchers from academia, industry, and government. GRADES-NDA'23 is a forum for discussing recent advances in (large-scale) graph data management and analytics systems, as well as proposing and discussing novel methods and techniques for addressing domain-specific challenges or handling noise in real-world graphs. In 2023, GRADES-NDA is in its sixth edition.

Place, publisher, year, edition, pages
Association for Computing Machinery (ACM), 2023
National Category
Information Systems Computer Sciences
Identifiers
urn:nbn:se:liu:diva-198022 (URN)10.1145/3555041.3590820 (DOI)2-s2.0-85162882075 (Scopus ID)
Conference
2023 International Conference on Management of Data (SIGMOD/PODS), Seattle, WA, USA, June 18-23, 2023
Available from: 2023-09-21 Created: 2023-09-21 Last updated: 2023-09-28Bibliographically approved
Alam, M., Trojahn, C., Hertling, S., Pesquita, C., Aebeloe, C., Aras, H., . . . Türker, R. (Eds.). (2023). Joint Proceedings of the ESWC 2023 Workshops and Tutorials co-located with 20th European Semantic Web Conference (ESWC). Paper presented at 20th European Semantic Web Conference (ESWC), Hersonissos, Greece, May 28-29, 2023. CEUR-WS.org, 3443
Open this publication in new window or tab >>Joint Proceedings of the ESWC 2023 Workshops and Tutorials co-located with 20th European Semantic Web Conference (ESWC)
Show others...
2023 (English)Conference proceedings (editor) (Other academic)
Place, publisher, year, edition, pages
CEUR-WS.org, 2023
Series
CEUR Workshop Proceedings ; 3443
National Category
Computer Sciences Information Systems
Identifiers
urn:nbn:se:liu:diva-198020 (URN)
Conference
20th European Semantic Web Conference (ESWC), Hersonissos, Greece, May 28-29, 2023
Available from: 2023-09-21 Created: 2023-09-21 Last updated: 2023-09-26Bibliographically approved
Li, H., Hartig, O., Armiento, R. & Lambrix, P. (2023). OBG-gen: Ontology-Based GraphQL Server Generation for Data Integration. In: Irini Fundulaki, Kouji Kozaki, Daniel Garijo, Jose Manuel Gomez-Perez (Ed.), Proceedings of the ISWC 2023 Posters, Demos and Industry Tracks: From Novel Ideas to Industrial Practice: co-located with 22nd International Semantic Web Conference (ISWC 2023). Paper presented at 22nd International Semantic Web Conference, November 6–10, 2023, Athens, Greece.
Open this publication in new window or tab >>OBG-gen: Ontology-Based GraphQL Server Generation for Data Integration
2023 (English)In: Proceedings of the ISWC 2023 Posters, Demos and Industry Tracks: From Novel Ideas to Industrial Practice: co-located with 22nd International Semantic Web Conference (ISWC 2023) / [ed] Irini Fundulaki, Kouji Kozaki, Daniel Garijo, Jose Manuel Gomez-Perez, 2023Conference paper, Published paper (Refereed)
Abstract [en]

A GraphQL server contains two building blocks: (1) a GraphQL schema defining the types of data objects that can be requested; (2) resolver functions fetching the relevant data from underlying data sources. GraphQL can be used for data integration if the GraphQL schema provides an integrated view of data from multiple data sources, and the resolver functions are implemented accordingly.However, there does not exist a semantics-aware approach to use GraphQL for data integration.We proposed a framework using GraphQL for data integration in which a global domain ontology informs the generation of a GraphQL server. Furthermore, we implemented a prototype of this framework, OBG-gen. In this paper, we demonstrate OBG-gen in a real-world data integration scenario in the materials design domain and in  a synthetic benchmark scenario.

Series
CEUR Workshop Proceedings, ISSN 1613-0073 ; 3632
National Category
Computer Sciences
Identifiers
urn:nbn:se:liu:diva-198287 (URN)
Conference
22nd International Semantic Web Conference, November 6–10, 2023, Athens, Greece
Funder
Swedish e‐Science Research CenterSwedish Agency for Economic and Regional GrowthCUGS (National Graduate School in Computer Science)
Available from: 2023-10-03 Created: 2023-10-03 Last updated: 2024-02-02
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0002-1741-2090

Search in DiVA

Show all publications

Profile pages

Homepage