liu.seSearch for publications in DiVA
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Similarity joins and clustering for SPARQL
Linköping University, Department of Computer and Information Science, Database and information techniques. Linköping University, Faculty of Science & Engineering. Univ Chile, Chile.ORCID iD: 0000-0002-9834-8376
Univ Chile, Chile.
Univ Chile, Chile.
2024 (English)In: Semantic Web, ISSN 1570-0844, E-ISSN 2210-4968, Vol. 15, no 5, p. 1701-1732Article in journal (Refereed) Published
Abstract [en]

The SPARQL standard provides operators to retrieve exact matches on data, such as graph patterns, filters and grouping. This work proposes and evaluates two new algebraic operators for SPARQL 1.1 that return similarity-based results instead of exact results. First, a similarity join operator is presented, which brings together similar mappings from two sets of solution mappings. Second, a clustering solution modifier is introduced, which instead of grouping solution mappings according to exact values, brings them together by using similarity criteria. For both cases, a variety of algorithms are proposed and analysed, and use-case queries that showcase the relevance and usefulness of the novel operators are presented. For similarity joins, experimental results are provided by comparing different physical operators over a set of real world queries, as well as comparing our implementation to the closest work found in the literature, DBSimJoin, a PostgreSQL extension that supports similarity joins. For clustering, synthetic queries are designed in order to measure the performance of the different algorithms implemented.

Place, publisher, year, edition, pages
IOS PRESS , 2024. Vol. 15, no 5, p. 1701-1732
Keywords [en]
Similarity joins; clustering; SPARQL
National Category
Information Systems
Identifiers
URN: urn:nbn:se:liu:diva-210194DOI: 10.3233/SW-243540ISI: 001358071100009OAI: oai:DiVA.org:liu-210194DiVA, id: diva2:1917783
Note

Funding Agencies|ANID - Millennium Science Initiative Program [ICN17_002]; Swedish Research Council [2019-05655]; CENIIT program at Linkoeping University [17.05]; Fondecyt [1221926]

Available from: 2024-12-03 Created: 2024-12-03 Last updated: 2024-12-03

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full text

Search in DiVA

By author/editor
Ferrada, Sebastian
By organisation
Database and information techniquesFaculty of Science & Engineering
In the same journal
Semantic Web
Information Systems

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 31 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • oxford
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf