Mining Association Rules in Text

Share "Mining Association Rules in Text"

N/A

Protected

學年: 2021

Info

下載

Protected

Academic year: 2021

Share "Mining Association Rules in Text"

Copied!

加載中.... (立即查看全文)

立即下載 ( 1 頁 )

全文

(1)

Mining Association Rules in Text

蔣以仁

Pi-Chin Fan;I-Jen Chiang;Ya Wen Tan;Te Chang Huang

Abstract

In this paper, we propose a new algorithm named Multipass with Inverted Hashing and Pruning (MIHP) for mining association rules between words in text databases. The characteristics of text databases are quite different from those of retail

transaction databases, and existing mining algorithms cannot handle text

databases efficiently because of the large number of itemsets (i.e., words) that need to be counted. Two well-known mining algorithms, the Apriori algorithm [1] and the Direct Hashing and Pruning (DHP) algorithm [8], are evaluated in the context of mining text databases, and are compared with the proposed MIHP algorithm. It has been shown that the MIHP algorithm has better performance for large text

參考文獻

立即下載 ( PDF - 1 頁 - 33.35 KB )

相關文件

2 A smoothing-type algorithm

In summary, the main contribution of this paper is to propose a new family of smoothing functions and correct a flaw in an algorithm studied in [13], which is used to guarantee

A Novel Algorithm for Volume-Preserving Parameterizations of 3-Manifolds

In this paper, we develop a novel volumetric stretch energy minimization algorithm for volume-preserving parameterizations of simply connected 3-manifolds with a single boundary

Introduction In this paper, we study the solvabilities of three optimization problems associated with second-order cone

Optim. Humes, The symmetric eigenvalue complementarity problem, Math. Rohn, An algorithm for solving the absolute value equation, Eletron. Seeger and Torki, On eigenvalues induced by

Topic Hierarchy Generation for Text Segments: A Practical Web-based Approach

Additional Key Words and Phrases: Topic Hierarchy Generation, Text Segment, Hierarchical Clustering, Partitioning, Search-Result Snippet, Text Data

Toward the Design of a Bioinformatic Machine V]p@Tqt h j

This bioinformatic machine is a PC cluster structure using special hardware to accelerate dynamic programming, genetic algorithm and data mining algorithm.. In this machine,

We try to explore category and association rules of customer questions by applying customer analysis and the combination of data mining and rough set theory

We try to explore category and association rules of customer questions by applying customer analysis and the combination of data mining and rough set theory.. We use customer

中華大學

So, we develop a tool of collaborative learning in this research, utilize the structure of server / client, and combine the functions of text and voice communication via

中華大學

It is concluded that the proposed computer aided text mining method for patent function model analysis is able improve the efficiency and consistency of the result with

上傳您的學習材料以下載所有文件。

您的文件將被豐富，在 9lib TW 上共享以幫助學習。

相關文件

Electronic Health Records in Ambulatory Care

目錄基金會網站設計與簡易系統設計

319

二、華嚴與法華平等相成的判教論

Transmission eigenvalues for the electromagnetic scattering problem in pseudo-chiral media and a practical reconstruction method

Table of Content

188

This document is originally written in Chinese. In case of discrepancy between the text of this translated version and that of the Chinese version, the Chinese text shall prevail.

In the text?

jt The Analysis and Research of Clustering Algorithms and Clusters Parameters sEtksERPQ