Title: HighPU: a high privacy-utility approach to mining frequent itemset with differential privacy
Authors: Yabin Wang; Yi Qiao; Zhaobin Liu; Zhiyi Huang
Addresses: Department of Computer Science, Dalian Maritime University, Dalian, China ' Department of Computer Science, Dalian Maritime University, Dalian, China ' Department of Computer Science, Dalian Maritime University, Dalian, China ' Department of Computer Science, Otago University, Otago, New Zealand
Abstract: In the field of data mining, frequent itemset mining (FIM) is a popular technique for analysing transaction datasets and establishing the foundation of association rules. Publishing frequent itemsets, however presents privacy challenges. Differential privacy provides strong privacy assurance to users. In this paper, we study the problem of mining frequent itemsets under the rigorous differential privacy model. We propose an approach, called HighPU, which achieves both high data utility and high degree of privacy in FIM. HighPU begins by truncating transactions over the original dataset. Then HighPU directly searches for maximal frequent itemsets. And we use a consistent approach to improve the accuracy of the results. Extensive experiments using several real datasets illustrate that HighPU significantly outperforms the current state of the art.
Keywords: differential privacy; frequent itemset mining; FIM; top-k itemsets; privacy protection.
International Journal of Embedded Systems, 2019 Vol.11 No.5, pp.624 - 633
Received: 21 Sep 2018
Accepted: 09 Nov 2018
Published online: 24 Sep 2019 *