Title: Privacy preserving data mining - past and present
Authors: G. Sathish Kumar; K. Premalatha
Addresses: Department of Computer Science and Engineering, Sri Krishna College of Engineering and Technology, Coimbatore, Tamil Nadu, India ' Department of Computer Science and Engineering, Bannari Amman Institute of Technology, Erode, Tamil Nadu, India
Abstract: Data mining is the process of discovering patterns and correlations within the huge volume of data to forecast the outcomes. There are serious challenges occurring in data mining techniques due to privacy violation and sensitive information disclosure while providing the dataset to third parties. It is necessary to protect user's private and sensitive data from exposure without the authorisation of data holders or providers when extracting useful information and revealing patterns from the dataset. Also, internet phishing gives more threat over the web on extensive spread of private information. Privacy preserving data mining (PPDM) is an essential for exchanging confidential information in terms of data analysis, validation, and publishing. To achieve data privacy, a number of algorithms have been designed in the data mining sector. This article delivers a broad survey on privacy preserving data mining algorithms, different datasets used in the research and analyses the techniques based on certain parameters. The survey is highlighted by identifying the outcome of each research along with its advantages and disadvantages. This survey will guide the future research in PPDM to choose the appropriate techniques for their research.
Keywords: data mining; privacy preserving data mining; PPDM; privacy preserving techniques; sensitive attributes; privacy threats.
DOI: 10.1504/IJBIDM.2022.124844
International Journal of Business Intelligence and Data Mining, 2022 Vol.21 No.2, pp.149 - 170
Received: 15 Jun 2020
Accepted: 17 Feb 2021
Published online: 11 Aug 2022 *