Description

Title Near sameness is somewhat the same as sameness
Abstract The lack of a central naming authority is a key characteristic of the World Wide Web and the Semantic Web alike. This means that data publishers are free to introduce IRIs that refer to the same instance or concept. In order to still be able to meaningfully interchange data on the web, it is necessary to introduce links between these IRIs. `owl:sameAs' is by far the most common property for this sort of interlinking. I will present https://sameAs.cc, a resource that computes the equivalence closure for 558M explicit `owl:sameAs' statements obtained from the web. The closure consists of 35B implicit statements. As is to be expected, some of these 35B implicit are incorrect, e.g., stating that The Netherlands is the same as Germany. There are also more subtle errors, e.g., equating the person Barack Obama to the 44th president of the U.S. I will present an approach for automatically detecting identity communities, i.e., subgraphs of identity graphs that have relatively few errors and that therefore are a better starting point for interlinking.

Other presentations by Wouter Beek

DateTitle
03 December 2012
03 June 2013
25 March 2013
10 February 2014
03 March 2014
12 January 2015
18 May 2015
18 January 2016
14 November 2016
06 November 2017 The Semantic Frontier: Going Where No Triple Store Has Gone Before
11 June 2018 Near sameness is somewhat the same as sameness