Data mining
Download
1 / 14

DATA MINING - PowerPoint PPT Presentation


  • 200 Views
  • Uploaded on

DATA MINING. Using Association Rules by Andrew Williamson. What is Data Mining?. A.K.A. – Knowledge Discovery in DataBases (KDD)

loader
I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
capcha
Download Presentation

PowerPoint Slideshow about 'DATA MINING' - richard_edik


An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript
Data mining l.jpg

DATA MINING

Using Association Rules

by

Andrew Williamson


What is data mining l.jpg
What is Data Mining?

  • A.K.A. – Knowledge Discovery in DataBases (KDD)

  • “Data mining is the automated extraction of hidden predictive information from databases. Data mining software allows users to analyze large databases. This can enable you to solve business decision problems, for data mining as an extension of statistics; while it doesn't solve your problems, it is the technology that can find your problems. You can build a predictive model of an ideal customer from your own databases and, using this information, build your marketing strategy or even determine the creation of new products and services.”

  • - www.biz-services-savvy.com


Slide3 l.jpg


An example of association l.jpg
An example of Association non-trivial extraction of implicit, previously unknown, and potentially useful information from data” ”.

“A simple example of data mining is its use in a retail sales department. If a store tracks the purchases of a customer and notices that a customer buys a lot of silk shirts, the data mining system will make a correlation between that customer and silk shirts…


Association example cont l.jpg
Association example cont. non-trivial extraction of implicit, previously unknown, and potentially useful information from data” ”.

…The sales department will look at that information and may begin direct mail marketing of silk shirts to that customer, or it may alternatively attempt to get the customer to buy a wider range of products. In this case, the data mining system used by the retail store discovered new information about the customer that was previously unknown to the company. “

  • - wikipedia.org


Association rules l.jpg
Association Rules non-trivial extraction of implicit, previously unknown, and potentially useful information from data” ”.

  • Using Association Rules, X  Y, helps to identify shopping trends over a given amount of time

  • 1) Support

  • 2) Confidence


Association rules cont l.jpg
Association Rules Cont. non-trivial extraction of implicit, previously unknown, and potentially useful information from data” ”.

  • 1) Support

    • The ratio of transactions that contain both an item X and an item Y, over all transactions.

      • P(X U Y)

    • Measures the significance of the given rule.


Association rules8 l.jpg
Association Rules.. non-trivial extraction of implicit, previously unknown, and potentially useful information from data” ”.

  • 2) Confidence

    • The ratio of transactions containing X, and also contains Y.

      • P(X U Y) / P( X )

    • Measures the strength of the correlation of the given rule.


Association rules9 l.jpg
Association Rules… non-trivial extraction of implicit, previously unknown, and potentially useful information from data” ”.

  • The Association rules X  Y with a support and confidence ratio of 50% or more are considered to be meaningful and therefore are kept, otherwise the rule will be discarded or ignored.


2 phases of association rules l.jpg
2 Phases of Association Rules non-trivial extraction of implicit, previously unknown, and potentially useful information from data” ”.

  • When considering an association rule to examine follow these 2 steps:

  • Phase 1

    • List together all rules with a high support ratio.

    • A.K.A. – The Frequent Itemset.

  • Phase 2

    • List together all rules with a high Confidence ratio from the Frequent Itemset group.


Extended association rules l.jpg
Extended Association Rules non-trivial extraction of implicit, previously unknown, and potentially useful information from data” ”.

  • Standard Association Rules express a correlation between values of a single dimensional schema.

  • While, more meaningful associations may be discovered when incorporating multi - dimensional schemas


Extended association rules cont l.jpg
Extended Association Rules cont. non-trivial extraction of implicit, previously unknown, and potentially useful information from data” ”.

  • A Simple Example:

    • Sleeping bags  Tents

      • This alone, seems obvious and not useful.

    • Sleeping bags  Tents ( region=north, season=summer)

      • more meaningful incorporating region and season of transactions.


Extended association rules cont13 l.jpg
Extended Association Rules cont.. non-trivial extraction of implicit, previously unknown, and potentially useful information from data” ”.

  • More meaningful information may be associated by augmenting the original Association Rules

    • X  Y (Z)

      • Transactions which satisfy Z and contain X, are more likely to contain Y as well.


Work cited l.jpg
Work Cited non-trivial extraction of implicit, previously unknown, and potentially useful information from data” ”.

  • www.wikipedia.org

  • http://www.biz-services-savvy.com

  • http://sba.luc.edu

  • http://www-128.ibm.com


ad