Find a copy in the library
Finding libraries that hold this item...
Details
Genre/Form: | Thèses et écrits académiques |
---|---|
Material Type: | Thesis/dissertation |
Document Type: | Book |
All Authors / Contributors: |
Fatiha Saïs; Marie-Christine Rousset; Nathalie Pernelle; Université de Paris-Sud. Faculté des sciences d'Orsay (Essonne).; Université Paris-Sud (1970-2019). |
OCLC Number: | 1164717392 |
Description: | Microfiches. ; 105 x 148 mm. |
Series Title: | Lille thèses. |
Responsibility: | Fatiha Saïs ; sous la direction de [Marie-Christine Rousset et Nathalie Pernelle]. |
Abstract:
This thesis deals with semantic data integration guided by an ontology. Data integration aims at combining autonomous and heterogonous data sources. To this end, all the data should be represented according to the same schema and according to a unified semantics. This thesis is divided into two parts. In the first one, we present an automatic and flexible method for data reconciliation with an ontology. We consider the case where data are represented in tables. The reconciliation result is represented in the SML format which we have defined. Its originality stems from the fact that it allows representing all the established mappings but also information that is imperfectly identified. In the second part, we present two methods of reference reconciliation. This problem consists in deciding whether different data descriptions refer to the same real world entity. We have considered this problem when data is described according to the same schema. The first method, called L2R, is logical: it translates the schema and the data semantics into a set of logical rules which allow inferring correct decisions both of reconciliation and no reconciliation. The second method, called N2R, is numerical. It translates the schema semantics into an informed similarity measure used by a numerical computation of the similarity of the reference pairs. This computation is expressed in a non linear equation system solved by using an iterative method. Our experiments on real datasets demonstrated the robustness and the feasibility of our approaches. The solutions that we bring to the two problems of reconciliation are completely automatic and guided only by an ontology.
Reviews

