Title: HighPU: a high privacy-utility approach to mining frequent itemset with differential privacy

Authors: Yabin Wang; Yi Qiao; Zhaobin Liu; Zhiyi Huang

Addresses: Department of Computer Science, Dalian Maritime University, Dalian, China ' Department of Computer Science, Dalian Maritime University, Dalian, China ' Department of Computer Science, Dalian Maritime University, Dalian, China ' Department of Computer Science, Otago University, Otago, New Zealand

Abstract: In the field of data mining, frequent itemset mining (FIM) is a popular technique for analysing transaction datasets and establishing the foundation of association rules. Publishing frequent itemsets, however presents privacy challenges. Differential privacy provides strong privacy assurance to users. In this paper, we study the problem of mining frequent itemsets under the rigorous differential privacy model. We propose an approach, called HighPU, which achieves both high data utility and high degree of privacy in FIM. HighPU begins by truncating transactions over the original dataset. Then HighPU directly searches for maximal frequent itemsets. And we use a consistent approach to improve the accuracy of the results. Extensive experiments using several real datasets illustrate that HighPU significantly outperforms the current state of the art.

Keywords: differential privacy; frequent itemset mining; FIM; top-k itemsets; privacy protection.

DOI: 10.1504/IJES.2019.102436

International Journal of Embedded Systems, 2019 Vol.11 No.5, pp.624 - 633

Received: 21 Sep 2018
Accepted: 09 Nov 2018

Published online: 24 Sep 2019 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article