Title: Weighted edge sampling for static graphs
Authors: Muhammad Irfan Yousuf; Raheel Anwar
Addresses: Department of Computer Science, University of Engineering and Technology, Lahore, Pakistan ' Institute for Mathematics and Scientific Computing, Karl Franzens University, Graz, Austria
Abstract: Graph sampling provides an efficient yet inexpensive solution for analysing large graphs. The purpose of sampling a graph is to extract a small representative subgraph from a big graph so that the sample can be used in place of the big graph for studying and analysing it. In this paper, we propose a new sampling method called weighted edge sampling. In this method, we give equal weight to all the edges in the beginning. During the sampling process, we sample an edge with the probability proportional to its weight. When an edge is sampled, we increase the weight of its neighbouring edges and this increases their probability to be sampled. Our method extracts the neighbourhood of a sampled edge more efficiently than previous approaches. We evaluate the efficacy of our sampling approach empirically using several real-world datasets. We find that our method produces better samples than the previous approaches. Our results show that our samples better estimate the degree and path length of the original graphs whereas our samples are less efficient in estimating the clustering coefficient of a graph.
Keywords: graph sampling; edge sampling; edge weight; graph induction.
DOI: 10.1504/IJDMMM.2023.134612
International Journal of Data Mining, Modelling and Management, 2023 Vol.15 No.4, pp.355 - 368
Received: 04 Jun 2022
Accepted: 03 Jan 2023
Published online: 30 Oct 2023 *