Title: Using knowledge management to create a Data Hub and leverage the usage of a Data Lake
Authors: Matheus Correia Ferreira; Fernanda Bruno Dos Santos; Carlos Eduardo Barbosa; Jano Moreira De Souza
Addresses: COPPE – Graduate School and Research in Engineering, Universidade Federal do Rio de Janeiro (UFRJ), P.O. Box 68511, Rio de Janeiro, 21941-972, Brazil ' Postgraduate Program in Computer Science (PPGI), Universidade Federal do Rio de Janeiro (UFRJ), P.O. Box 68511, Rio de Janeiro, 21941-972, Brazil ' COPPE – Graduate School and Research in Engineering, Universidade Federal do Rio de Janeiro (UFRJ), P.O. Box 68511, Rio de Janeiro, 21941-972, Brazil ' COPPE – Graduate School and Research in Engineering, Universidade Federal do Rio de Janeiro (UFRJ), P.O. Box 68511, Rio de Janeiro, 21941-972, Brazil
Abstract: As repositories into which different types of data without pre-processing and modelling can be added, Data Lakes have attracted much interest. They speed up the delivery of data to users and preserve its highest granularity level. That same flexibility can be dangerous. If not managed, it is easy to lose control of the repository because of the volume it holds and its growth. As Data Lakes do not carry the semantics of a regular database, understanding its contents can be cumbersome, which undermines its widespread use within a company, dampening the perception that it has helped data science efforts. This work uses knowledge management models as a basis to solve these issues by enriching the data in a Data Lake with information that enhances its usability. Concomitantly, with the use of a data portal platform and suggested metadata, we propose a portal that provides easy access to the Data Lake.
Keywords: Data Lake; data science; metadata; knowledge management; CKAN; comprehensive knowledge archive network.
DOI: 10.1504/IJKMS.2018.094214
International Journal of Knowledge Management Studies, 2018 Vol.9 No.3, pp.260 - 277
Received: 25 Mar 2017
Accepted: 10 Nov 2017
Published online: 22 Aug 2018 *