Title: Privacy-preserving global user profile construction through federated learning
Authors: Zheng Huo; Teng Wang; Yilin Fan; Ping He
Addresses: Information Technology School, Hebei University of Economics and Business, Shijiazhuang, Hebei Province, China ' CECT Network Communication Research Institute, Shijiazhuang, Hebei Province, China ' Information Technology School, Hebei University of Economics and Business, Shijiazhuang, Hebei Province, China ' Information Technology School, Hebei University of Economics and Business, Shijiazhuang, Hebei Province, China
Abstract: User profiles are derived from big data left on the internet through machine learning algorithms. However, threats of data privacy leakages restrict access to the data in centralised machine learning. Federated learning (FL) can avoid privacy leakage during the data collection phase. Herein, we propose an algorithm for constructing a privacy-preserving global user profile (PPGUP) through FL in a vertical data-segmentation scenario. Participants train local clusters on their data using the CLIQUE algorithm, and carefully encrypt cluster parameters using Paillier encryption to protect parameters from an untrusted aggregator. The aggregator then makes intersections over the cluster parameters without decryption, to construct a GUP. The experiment results show that precision of PPGUP reaches 80% when is set to 1.5, which is improved by 50% compared with DP-UserPro. The runtime exhibits a linear growth with the growth of the dataset size and the increase in the number of participants.
Keywords: differential privacy; federated learning; CLIQUE algorithm; Paillier encryption; user profile; private-set-intersection; participants; aggregate server; privacy budget; vertical data segmentation.
DOI: 10.1504/IJCSE.2023.129739
International Journal of Computational Science and Engineering, 2023 Vol.26 No.2, pp.199 - 209
Received: 13 Aug 2021
Accepted: 24 Nov 2021
Published online: 22 Mar 2023 *