1 / 16

Moment: Maintaining Closed Frequent Itemsets over a Stream Sliding window

Moment: Maintaining Closed Frequent Itemsets over a Stream Sliding window. Yun Chi, Haixun Wang, Philip S. Yu, Richard R. Muntz, ICDM 2004. Adviser: Jia-Ling Koh Speaker: Shu-Ning Shin Date: 2005.5.6. Introduction.

quasim
Download Presentation

Moment: Maintaining Closed Frequent Itemsets over a Stream Sliding window

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Moment: Maintaining Closed Frequent Itemsets over a Stream Sliding window Yun Chi, Haixun Wang, Philip S. Yu, Richard R. Muntz, ICDM 2004. Adviser: Jia-Ling Koh Speaker: Shu-Ning Shin Date: 2005.5.6

  2. Introduction • Algorithm Moment: Mime closed frequent itemsets in the most N transactions in data stream. • Data structure, closed enumeration tree (CET), maintain: • Closed frequent itemsets, • Boundary between closed frequent itemsets and the rest.

  3. Problem • Lexicographic order: • Closed frequent itemset: none of its supersets has the same support. • Items Σ={A, B, C, D}, window size N=4, minimum support s = ½.

  4. CET (1) • Four types of itemsets node: • Infrequent: • Infrequent gateway node, dashed circle — D. • Frequent but not closed: • Unpromising gateway node, dashed rectangle — AC. • Intermediate node — A. • Closed: • Closed node, solid rectangle — ABC.

  5. CET (2) • Property 1: if nI is an infrequent gateway node, then any node nJ where represents an infrequent itemset. • Property 2: if nI is an unpromising gateway node, then nI is not closed, and none of nI’s descendents is closed. • Property 3: if nI is an intermediate node, then nI is not closed and nI has closed descendents.

  6. Moment: Build CET (1) • Node nI has information : • itemset I, node type, support, tid_sum • Hash table: • store all closed frequent itemsets • check if nI is an unpromising gateway node, if exit a nJ where • hash on the (support, tid_sum) of nI

  7. Moment: Build CET (2)

  8. A B C D 0 0 0 0 Moment: Build CET (3) • Items Σ={A, B, C, D}, Explore(n{i}), for each i in Σ. ψ

  9. Moment: Add CET (1)

  10. AD 0 0 CD Moment: Add CET (2) • Adding a transaction tid 5: • Call Addition(nψ, t5, D, minsup) ψ 4 C 2 D 4 A F={D} AD 1 2 CD 3 AC 5 A, C, D

  11. Moment: Delete CET (1)

  12. Moment: Delete CET (2) • Deleting a transaction tid 1: F={D} 1 3 C D

  13. Moment: Update CET (3) • Deleting a transaction tid 2: 3 2 B A 2 AB

  14. Experiment (1) • Dataset: T20I4D100K • Window Size N = 100000

  15. Experiment (2)

  16. Experiment (3) • Real Datase: BMS-WebView-1 • Items: 497, transactions: 59602 • Window Size N = 50000

More Related