Title: Mining recent frequent cooccurring items from transaction stream
Authors: Sumit Misra; Soumyadeep Thakur; Manosij Ghosh; Sanjoy Kumar Saha
Addresses: Department of Computer Science and Engineering, Jadavpur University, Kolkata, 700032, India ' Department of Computer Science and Engineering, Jadavpur University, Kolkata, 700032, India ' Department of Computer Science and Engineering, Jadavpur University, Kolkata, 700032, India ' Department of Computer Science and Engineering, Jadavpur University, Kolkata, 700032, India
Abstract: The work presents a simple but novel scheme to determine the recent frequent cooccurring items from transaction streams. It enables to follow the current trend, pattern of occurrence of the items and also help in recommendation. Proposed methodology works on a sequence of transaction forming a time-window. Based on the items present in a transaction, item-sets are formed and encoded. An in-memory structure maintains the frequency of the item-sets. Frequency is time-decayed count that prioritises the recent occurrences. To limit the size of in-memory structure, a pruning operation is carried out to remove the less frequent item-sets. Threshold for pruning is chosen dynamically. Impacts of decaying parameter and window size are also analysed. Experiment has been carried out on two datasets and performance has been compared with two recent works. Proposed methodology provides a better accuracy. Low time and space complexity make it suitable for stream analytic.
Keywords: recent frequent item-set; time decaying count; item pruning; stream analytic.
International Journal of Data Science, 2019 Vol.4 No.4, pp.288 - 304
Received: 12 Oct 2018
Accepted: 14 Aug 2019
Published online: 22 Feb 2020 *