Title: Natural language processing driven ontology modelling to address semantic interoperability in information extraction
Authors: B. Damayanthi Jesudas; B. Gurumoorthy
Addresses: Product Lifecycle Management Lab, CPDM, Indian Institute of Science, Bangalore, India ' Product Lifecycle Management Lab, CPDM, Indian Institute of Science, Bangalore, India
Abstract: The aim of this research is to investigate the use of natural language processing techniques for ontology modelling that would address semantic interoperability problems arising due to terminological and semantic differences during information extraction from documents. These documents contain terminologies that are specific to the group and experts involved in creating them. As manual extraction is a tedious process, natural language processing techniques are suggested in this work. Text from different documents describing new terms are taken as input. These texts are processed to establish either semantic equivalence of the new term with concepts in reference ontology or add the term as a new concept if appropriate. Semantic role labelling is used for concept modelling and semantic role labelling tool is used in deriving the concept instances from the text, which are matched against concepts in the reference ontology to establish equivalence. If instances of attributes are equivalent, then concepts are said to be equivalent and if equivalence is limited to attributes, then ontology is populated with new concept. Results on technical text are presented with discussion on how the limitations observed can be removed.
Keywords: natural language processing; ontology modelling; semantic interoperability; information extraction; domain ontology; semantic role labelling; SRL; SENNA; semantic roles; semantic equivalence; mapping.
DOI: 10.1504/IJPLM.2018.094720
International Journal of Product Lifecycle Management, 2018 Vol.11 No.3, pp.268 - 293
Received: 08 Feb 2017
Accepted: 18 May 2018
Published online: 12 Sep 2018 *