Privacy Preserving Market Basket Data Analysis. Ling Guo, Songtao Guo, Xintao Wu University of North Carolina at Charlotte. Market Basket Data. …. 1: presence 0: absence. Association rule (R.Agrawal SIGMOD 1993) with support and confidence .
Ling Guo, Songtao Guo, Xintao Wu
University of North Carolina at Charlotte
1: presence 0: absence
2 x 2 contingency table
Objective measures for A=>B
: Cheated in the exam : Didn’t cheat in the exam
Cheated in exam
Purpose: Get the proportion( ) of population
members that cheated in the exam.
Do you belong to A? (p)
Do you belong to ?(1-p)
Unbiased estimate of is:
e.g., for 2 variables
stands for Kronecker product
diagonal matrix with elements
A: Milk B: Cereals
We can get the estimate, how accurate we can achieve?
Both are frequent set
Not frequent set
Rule 6 is falsely recognized from estimated value!
Lower& Upper bound
Frequent set with high confidence
Frequent set without confidence
Let be a random variable with expected value and finite
variance .Then for any real
Where: , , ,
Bounds of the support vs. varying p
Still be able to derive the strong dependent itemsets from the randomized data
No false positive