Analytics-as-a-service framework for terms association mining in unstructured data Online publication date: Thu, 31-Jul-2014
by Richard K. Lomotey; Ralph Deters
International Journal of Business Process Integration and Management (IJBPIM), Vol. 7, No. 1, 2014
Abstract: Today's high-dimensional data, which is mostly unstructured, makes data patterns discovery (a.k.a. data mining) challenging and difficult for services engineers. Unstructured data mining deviates from existing information extraction methodologies that have been previously put forward due to the fact that recent data formation and storage has no standard schema; and the data is heterogeneous. While the topic is receiving significant attention recently from both the industry and academia, in this work, we aim at performing term association mining from distributed unstructured data storages. To achieve this goal, an analytics-as-a-service (AaaS) framework is proposed that theoretically relies on the Bernoulli algorithm to ensure the accurate determination association between terms. Specifically, the tool is applied to document-oriented data storages where the CouchDB data storage is employed for testing. The pilot evaluation of the proposed AaaS framework for the extraction of mining medical terms shows high accuracy and reliability regarding association maps.
Existing subscribers:
Go to Inderscience Online Journals to access the Full Text of this article.
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Business Process Integration and Management (IJBPIM):
Login with your Inderscience username and password:
Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.
If you still need assistance, please email subs@inderscience.com