120 likes | 453 Views
Summarization of Frequent Pattern Mining . What is FPM?. Why being frequent is so important? Application of FPM Decision make/Business Software Debugging Bioinformatics Other data mining tasks Indexing Clustering/Classification/Association Rule . What have been done.
E N D
What is FPM? • Why being frequent is so important? • Application of FPM • Decision make/Business • Software Debugging • Bioinformatics • Other data mining tasks • Indexing • Clustering/Classification/Association Rule
What have been done • Frequent Itemset Mining • Frequent Sequential Pattern Mining • Frequent Subgraph Mining • Frequent Tree Mining • Mining A Single Large Graph • Frequent motifs
FPM is a way to think E A A E B B A A A B B A A B B F E A A B C E A B F C C D D F C C D C D C D D A D F D C A B D C
Algorithm Foundations • Apriori Property • Enumeration Algorithm • Level-wise search • Depth-first search • Data structure • For Patterns • For Data
Apriori R. Agrawal and R. Srikant. Fast algorithms for mining association rules. VLDB, 487-499, 1994
Resource and Tools • Important FPM websites • FIMI workshop website • http://fimi.cs.helsinki.fi/ • Mining Structure Data website • http://hms.liacs.nl/graphs.html • Commercial Databases • Oracle, IBM DB2, SQL Server • General Data Mining Information • KDDNuggets (general/job/software, etc) • Weka (www.cs.waikato.ac.nz/ml/weka/)
Why FPM does not work? • Too many patterns? • What can we do? • Pattern Pruning • Additional constraints? • Pattern summarization • Representative Patterns? • Pattern Ranking
What is missing • The common foundation for FPM, clustering, classification, etc… • FPM formalization • language/compiler/automatic discovery • FPM understanding • How and why they are being generated? • The relationship between dataset and pattern