liu.seSearch for publications in DiVA
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
An Unsupervised Data-driven Method to Discover Equivalent Relations in Large Linked Datasets
University of Sheffield, England.
University of Sheffield, England.
Linköping University, Department of Computer and Information Science, Human-Centered systems. Linköping University, Faculty of Science & Engineering.ORCID iD: 0000-0003-0036-6662
University of Sheffield, England.
Show others and affiliations
2017 (English)In: Semantic Web, ISSN 1570-0844, E-ISSN 2210-4968, Vol. 8, no 2Article in journal (Refereed) Published
Abstract [en]

This article addresses a number of limitations of state-of-the-art methods of Ontology Alignment: 1) they primarily address concepts and entities while relations are less well-studied; 2) many build on the assumption of the well-formedness of ontologies which is unnecessarily true in the domain of Linked Open Data; 3) few have looked at schema heterogeneity from a single source, which is also a common issue particularly in very large Linked Dataset created automatically from heterogeneous resources, or integrated from multiple datasets. We propose a domain-and language-independent and completely unsupervised method to align equivalent relations across schemata based on their shared instances. We introduce a novel similarity measure able to cope with unbalanced population of schema elements, an unsupervised technique to automatically decide similarity threshold to assert equivalence for a pair of relations, and an unsupervised clustering process to discover groups of equivalent relations across different schemata. Although the method is designed for aligning relations within a single dataset, it can also be adapted for cross-dataset alignment where sameAs links between datasets have been established. Using three gold standards created based on DBpedia, we obtain encouraging results from a thorough evaluation involving four baseline similarity measures and over 15 comparative models based on variants of the proposed method. The proposed method makes significant improvement over baseline models in terms of F1 measure (mostly between 7% and 40%), and it always scores the highest precision and is also among the top performers in terms of recall. We also make public the datasets used in this work, which we believe make the largest collection of gold standards for evaluating relation alignment in the LOD context.

Place, publisher, year, edition, pages
IOS PRESS , 2017. Vol. 8, no 2
Keyword [en]
ontology alignment; ontology mapping; Linked Data; DBpedia; similarity measure
National Category
Information Systems
Identifiers
URN: urn:nbn:se:liu:diva-136222DOI: 10.3233/SW-150193ISI: 000396859500003OAI: oai:DiVA.org:liu-136222DiVA: diva2:1086233
Note

Funding Agencies|EPSRC [EP/J019488/1]

Available from: 2017-03-31 Created: 2017-03-31 Last updated: 2017-04-25

Open Access in DiVA

fulltext(712 kB)1 downloads
File information
File name FULLTEXT01.pdfFile size 712 kBChecksum SHA-512
8879997ab83c214bac72227b0e229eea293d7af75edfb5eb6b21f73506144468f403564878352d9bde9040e19ed1d27061af9335a4eb7268f1fafb08eb067367
Type fulltextMimetype application/pdf

Other links

Publisher's full text

Search in DiVA

By author/editor
Blomqvist, Eva
By organisation
Human-Centered systemsFaculty of Science & Engineering
In the same journal
Semantic Web
Information Systems

Search outside of DiVA

GoogleGoogle Scholar
Total: 1 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Altmetric score

Total: 22 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf