• 沒有找到結果。

Perfect Hashing Schemes for Mining Association Rules

N/A
N/A
Protected

Academic year: 2022

Share "Perfect Hashing Schemes for Mining Association Rules"

Copied!
1
0
0

加載中.... (立即查看全文)

全文

(1)
(2)

題名: Perfect Hashing Schemes for Mining Association Rules 作者: Chang, C. C.;Lin, C. Y

日期: 2005-03

上傳時間: 2009-12-17T06:58:37Z 出版者: Asia University

摘要: Hashing schemes are widely used to improve the performance of data mining association rules, as in the DHP algorithm that utilizes the hash table in identifying the validity of candidate itemsets according to the number of the table's bucket accesses. However, since the hash table used in DHP is plagued by the collision problem, the process of

generating large itemsets at each level requires two database scans, which leads to poor performance. In this paper we propose perfect hashing schemes to avoid collisions in the hash table. The main idea is to employ a refined encoding scheme, which transforms large itemsets into large 2-itemsets and thereby makes the application of perfect hashing feasible. Our experimental results demonstrate that the new method is also efficient (about three times faster than DHP), and scalable when the database size increases. We also propose another variant of the perfect hash scheme with reduced memory requirements.

The properties and performances of several perfect hashing schemes are also investigated and compared

參考文獻

相關文件

6 《中論·觀因緣品》,《佛藏要籍選刊》第 9 冊,上海古籍出版社 1994 年版,第 1

The first row shows the eyespot with white inner ring, black middle ring, and yellow outer ring in Bicyclus anynana.. The second row provides the eyespot with black inner ring

"Extensions to the k-Means Algorithm for Clustering Large Data Sets with Categorical Values," Data Mining and Knowledge Discovery, Vol. “Density-Based Clustering in

We try to explore category and association rules of customer questions by applying customer analysis and the combination of data mining and rough set theory.. We use customer

Furthermore, in order to achieve the best utilization of the budget of individual department/institute, this study also performs data mining on the book borrowing data

It is concluded that the proposed computer aided text mining method for patent function model analysis is able improve the efficiency and consistency of the result with

MPHP, a revised version of DHP, employs minimal perfect hashing functions (non-collision hashing functions) to generate frequent item sets directly without the need of

According to the related researches the methods to mine association rules, they need too much time to implement their algorithms; therefore, this thesis proposes an efficient