Page 314 -
P. 314

3:20 Page 277
                                                            2011/6/1
                               13-ch06-243-278-9780123814791
                         HAN
                                                                                    #35
                                                                           6.6 Bibliographic Notes  277


                               [MTV94]. A joint publication combining these works later appeared in Agrawal,
                                                       +
                               Mannila, Srikant et al. [AMS 96]. A method for generating association rules from
                               frequent itemsets is described in Agrawal and Srikant [AS94a].
                                 References for the variations of Apriori described in Section 6.2.3 include the
                               following. The use of hash tables to improve association mining efficiency was stud-
                               ied by Park, Chen, and Yu [PCY95a]. The partitioning technique was proposed by
                               Savasere, Omiecinski, and Navathe [SON95]. The sampling approach is discussed in
                               Toivonen [Toi96]. A dynamic itemset counting approach is given in Brin, Motwani,
                               Ullman, and Tsur [BMUT97]. An efficient incremental updating of mined association
                               rules was proposed by Cheung, Han, Ng, and Wong [CHNW96]. Parallel and dis-
                               tributed association data mining under the Apriori framework was studied by Park,
                               Chen, and Yu [PCY95b]; Agrawal and Shafer [AS96]; and Cheung, Han, Ng, et al.
                                    +
                               [CHN 96]. Another parallel association mining method, which explores itemset clus-
                               tering using a vertical database layout, was proposed in Zaki, Parthasarathy, Ogihara,
                               and Li [ZPOL97].
                                 Other scalable frequent itemset mining methods have been proposed as alterna-
                               tives to the Apriori-based approach. FP-growth, a pattern-growth approach for mining
                               frequent itemsets without candidate generation, was proposed by Han, Pei, and Yin
                               [HPY00] (Section 6.2.4). An exploration of hyper structure mining of frequent patterns,
                                                                            +
                               called H-Mine, was proposed by Pei, Han, Lu, et al. [PHL 01]. A method that integrates
                               top-down and bottom-up traversal of FP-trees in pattern-growth mining was proposed
                               by Liu, Pan, Wang, and Han [LPWH02]. An array-based implementation of prefix-
                               tree structure for efficient pattern growth mining was proposed by Grahne and Zhu
                               [GZ03b]. Eclat, an approach for mining frequent itemsets by exploring the vertical data
                               format, was proposed by Zaki [Zak00]. A depth-first generation of frequent itemsets by
                               a tree projection technique was proposed by Agarwal, Aggarwal, and Prasad [AAP01].
                               An integration of association mining with relational database systems was studied by
                               Sarawagi, Thomas, and Agrawal [STA98].
                                 The mining of frequent closed itemsets was proposed in Pasquier, Bastide, Taouil,
                               and Lakhal [PBTL99], where an Apriori-based algorithm called A-Close for such min-
                               ing was presented. CLOSET, an efficient closed itemset mining algorithm based on
                               the frequent pattern growth method, was proposed by Pei, Han, and Mao [PHM00].
                               CHARM by Zaki and Hsiao [ZH02] developed a compact vertical TID list structure
                               called diffset, which records only the difference in the TID list of a candidate pattern
                               from its prefix pattern. A fast hash-based approach is also used in CHARM to prune
                               nonclosed patterns. CLOSET+ by Wang, Han, and Pei [WHP03] integrates previously
                               proposed effective strategies as well as newly developed techniques such as hybrid tree-
                               projection and item skipping. AFOPT, a method that explores a right push operation on
                               FP-trees during the mining process, was proposed by Liu, Lu, Lou, and Yu [LLLY03].
                               Grahne and Zhu [GZ03b] proposed a prefix-tree–based algorithm integrated with
                               array representation, called FPClose, for mining closed itemsets using a pattern-growth
                               approach.
                                                         +
                                 Pan, Cong, Tung, et al. [PCT 03] proposed CARPENTER, a method for finding
                               closed patterns in long biological data sets, which integrates the advantages of vertical
   309   310   311   312   313   314   315   316   317   318   319