An effective and time-efficient approach for Linked Data fusion using genetic algorithms Online publication date: Wed, 16-Nov-2016
by Khayra Bencherif; Mimoun Malki
International Journal of Metadata, Semantics and Ontologies (IJMSO), Vol. 11, No. 2, 2016
Abstract: The Linked Open Data Cloud is a project that uses RDF formalism to publish data in the form of a triple on the web under open licence. With the ever increasing amount of data sets available in the LOD Cloud, it is already beyond the human capability to integrate heterogeneous data manually. So far, the task of Linked Data fusion entails a significant amount of time owing to the large number of instances in the data sets from the LOD Cloud. In this paper, we suggest a new system to efficiently combine heterogeneous data from the LOD Cloud. First, we extract similar instances from the LOD Cloud to identify identical or related information. Then, our system collects all predicates and objects of the similar instances to construct a set of trees. Finally, we propose a genetic algorithm to merge data in the constructed trees. In the following, we give an overview of our system architecture and we detail our genetic algorithm. We also evaluate our system using real data sets showing that it can increase the completeness and the conciseness in data fusion. Moreover, we prove that our system is faster when fusing large data sets from the LOD Cloud.
Existing subscribers:
Go to Inderscience Online Journals to access the Full Text of this article.
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Metadata, Semantics and Ontologies (IJMSO):
Login with your Inderscience username and password:
Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.
If you still need assistance, please email subs@inderscience.com