1 / 13

Pertemuan XIV

Pertemuan XIV. FUNGSI MAYOR Assosiation. What Is Association Mining?. Association rule mining: Finding frequent patterns, associations, correlations, or causal structures among sets of items or objects in transaction databases, relational databases, and other information repositories.

moana
Download Presentation

Pertemuan XIV

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Pertemuan XIV FUNGSI MAYOR Assosiation

  2. What Is Association Mining? • Association rule mining: • Finding frequent patterns, associations, correlations, or causal structures among sets of items or objects in transaction databases, relational databases, and other information repositories. • Applications: • Basket data analysis, cross-marketing, catalog design, loss-leader analysis, clustering, classification, etc. • Examples. • Rule form: “Body ® Head [support, confidence]”. • buys(x, “diapers”) ® buys(x, “beers”) [0.5%, 60%] • major(x, “CS”) ^ takes(x, “DB”) ® grade(x, “A”) [1%, 75%]

  3. Association Rules • Wal-Mart customers who purchase Barbie dolls have a 60% likelihood of also purchasing one of three types of candy bars [Forbes, Sept 8, 1997] • Customers who purchase maintenance agreements are very likely to purchase large appliances (author experience) • When a new hardware store opens, one of the most commonly sold items is toilet bowl cleaners (author experience) • So what…

  4. Tugasasosiasi data mining adalahmenemukanatribut yang munculdalamsatuwaktu.

  5. Rule Measures: Support and Confidence • Find all the rules X & Y  Z with minimum confidence and support • support, s, probability that a transaction contains {X  Y  Z} • confidence, c, conditional probability that a transaction having {X  Y} also contains Z Customer buys both Customer buys diaper Customer buys beer • Let minimum support 50%, and minimum confidence 50%, we have • A  C (50%, 66.6%) • C  A (50%, 100%)

  6. Association Rule Mining • Given a set of transactions, find rules that will predict the occurrence of an item based on the occurrences of other items in the transaction Market-Basket transactions Example of Association Rules {Diaper}  {Beer},{Milk, Bread}  {Eggs,Coke},{Beer, Bread}  {Milk}, Implication means co-occurrence, not causality!

  7. Definition: Frequent Itemset • Itemset • A collection of one or more items • Example: {Milk, Bread, Diaper} • k-itemset • An itemset that contains k items • Support count () • Frequency of occurrence of an itemset • E.g. ({Milk, Bread,Diaper}) = 2 • Support • Fraction of transactions that contain an itemset • E.g. s({Milk, Bread, Diaper}) = 2/5 • Frequent Itemset • An itemset whose support is greater than or equal to a minsup threshold

  8. Example: Definition: Association Rule • Association Rule • An implication expression of the form X  Y, where X and Y are itemsets • Example: {Milk, Diaper}  {Beer} • Rule Evaluation Metrics • Support (s) • Fraction of transactions that contain both X and Y • Confidence (c) • Measures how often items in Y appear in transactions thatcontain X

  9. Mining Association Rules Example of Rules: {Milk,Diaper}  {Beer} (s=0.4, c=0.67){Milk,Beer}  {Diaper} (s=0.4, c=1.0) {Diaper,Beer}  {Milk} (s=0.4, c=0.67) {Beer}  {Milk,Diaper} (s=0.4, c=0.67) {Diaper}  {Milk,Beer} (s=0.4, c=0.5) {Milk}  {Diaper,Beer} (s=0.4, c=0.5) • Observations: • All the above rules are binary partitions of the same itemset: {Milk, Diaper, Beer} • Rules originating from the same itemset have identical support but can have different confidence • Thus, we may decouple the support and confidence requirements

  10. The Apriori Algorithm — Example Database D L1 C1 Scan D C2 C2 L2 Scan D L3 C3 Scan D

  11. AlgoritmaAsosiasi MBA (Market Basket Analysis) Langkah-langkahalgoritma MBA: • Tetapkanbesaran darikonsepitemsetsering, nilai minimum besaran support danbesaran confidence yang diinginkan. • Menetapkansemuaitemsetsering, yaituitemset yang memilikifrekuensiitemset minimal sebesarbilangan  sebelumnya. • Dari semuaitemsetsering, hasilkanaturanasosiasi yang memenuhinilai minimum support dan confidence

  12. Support (AB) = P(AB) Confidence(AB) = P(B|A)

More Related