Treffer: 2004 ACM Symposium on Applied Computing POSTER ABSTRACT SQL Based Frequent Pattern Mining without Candidate Generation

Title:
2004 ACM Symposium on Applied Computing POSTER ABSTRACT SQL Based Frequent Pattern Mining without Candidate Generation
Contributors:
The Pennsylvania State University CiteSeerX Archives
Collection:
CiteSeerX
Subject Terms:
Document Type:
Fachzeitschrift text
File Description:
application/pdf
Language:
English
Rights:
Metadata may be used without restrictions as long as the oai identifier remains attached to it.
Accession Number:
edsbas.F960E043
Database:
BASE

Weitere Informationen

Scalable data mining in large databases is one of today’s real challenges to database research area. The integration of data mining with database systems is an essential component for any successful large-scale data mining application. A fundamental component in data mining tasks is finding frequent patterns in a given dataset. Most of the previous studies adopt an Apriori-like candidate set generation-andtest approach. However, candidate set generation is still costly, especially when there exist prolific patterns and/or long patterns. In this study we present an evaluation of SQL based frequent pattern mining with a novel frequent pattern growth (FP-growth) method, which is efficient and scalable for mining both long and short patterns without candidate generation. We examine some techniques to improve performance. In addition, we have made performance evaluation on commercial DBMS (IBM DB2 UDB EEE V8).