Paper 3

Reducing the semantic heterogeneity of unstructured P2P systems: a contribution based on a dissemination protocol

Authors: Thomas Cerqueus, Sylvie Cazalens, and Philippe Lamarre

Volume 7 (2012)

Abstract

In resource sharing P2P systems with autonomous participants, each peer is free to use the ontology with which it annotates its resources. Semantic heterogeneity occurs when the peers do not use the same ontology. For example, a contributing peer A (e.g. a doctor) may annotate its photos, diagrams, data sets with some ontology of its own, while peer B (e.g. a genetician) uses another one. In order to answer a query issued in the system, peers need to know alignments that state correspondences between entities of two ontologies. Assuming that each peer has some partial initial knowledge of some alignments, we focus on correspondences sharing between the peers as a means to learn additional correspondences. We rst provide several measures of semantic heterogeneity that enable to draw a semantic picture of the system and to evaluate the eciency of protocols independently of query evaluation. We propose CorDis, a gossip-based protocol that disseminates the correspondences that the peers want to share in the system. To overcome the peers’ storage limitations, we propose to consider a history of past queries and to favor the correspondences involving frequently used entities. We study several policies that a peer may adopt in case of inconsistency i.e. when shared correspondences confict with its own knowledge. We conduct experiments with a set of 93 ontologies actively used in the biomedical domain. We evaluate the CorDis protocol with respect to the proposed measures of semantic heterogeneity and show its good behavior for decreasing them in several contexts.