Title: Structures of frequent itemsets and classifying structures of association rule set by order relations
Authors: Anh Tran; Tin Truong; Bac Le
Addresses: Department of Mathematics and Computer Science, University of Dalat, 01 Phu Dong Thien Vuong, Dalat, Vietnam ' Department of Mathematics and Computer Science, University of Dalat, 01 Phu Dong Thien Vuong, Dalat, Vietnam ' Department of Computer Science, University of Science, VNU – Ho Chi Minh, 227 Nguyen Van Cu, District 5, Ho Chi Minh city, Vietnam
Abstract: This paper shows a mathematical foundation for almost important features in the problem of discovering knowledge by association rules. The sets of itemsets and association rules are partitioned into disjoint classes by two appropriate equivalence relations based on closures. The structure and unique representation of frequent itemsets are figured out through their generators and corresponding eliminable itemsets. Due to this structure, each equivalence rule class is split into different sets of basic and consequence rules according to an order relation. Indeed, the basic set comprises minimal elements (basic rules) whose forms are explicitly showed. Then, we propose operators to non-repeatedly deduce all consequence rules by adding, deleting or moving appropriate eliminable itemsets in both sides of basic rules. Further, we show that mining association rules based on a new order relation, min relation, is better than four other ones in terms of reductions in the time to extract basic rules, their cardinalities and rule lengths. These theoretical results are proven to be reliable. Experimental study on many benchmark databases shows the efficiency of the corresponding algorithms. Our approach (e.g., the partitions of the frequent itemset and association rule sets) is suitable to deal with big data because it can be exploited in parallel and distributed environment.
Keywords: association rules mining; basic rules; rule generators; equivalence relations; order relations; frequent itemsets; intelligent information; knowledge discovery; partitioning; big data.
DOI: 10.1504/IJIIDS.2014.068339
International Journal of Intelligent Information and Database Systems, 2014 Vol.8 No.4, pp.295 - 323
Received: 01 Apr 2013
Accepted: 08 Jan 2014
Published online: 10 Apr 2015 *