Automated validation of structured large databases: an illustration of material code bulk validation Online publication date: Tue, 29-Dec-2015
by Ravindra Patankar; Sandeep Dulluri
International Journal of Big Data Intelligence (IJBDI), Vol. 3, No. 1, 2016
Abstract: The accumulation of data and henceforth the storage is growing at an exponential phase owing to the decrease in the memory costs and increasingly complex business processes. With the increased data, typically there would be an increase in the complexity of validating the data. Often, the complexity and effort in validation of large scale databases grows nonlinearly with the increase in database size (Lee et al., 1999). In this paper, we discuss a novel methodology for bulk validation of large scale structured databases. The approach we propose is generic and has been tested in a real time environment. We present an illustration of validation on a material codes validation problem faced by a Fortune 100 enterprise. The demonstration would highlight the heterogeneity, and scale-scope of data validation related problems and henceforth tackling these problems effectively via application of machine learning techniques on Big Data.
Existing subscribers:
Go to Inderscience Online Journals to access the Full Text of this article.
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Big Data Intelligence (IJBDI):
Login with your Inderscience username and password:
Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.
If you still need assistance, please email subs@inderscience.com