1 / 15

Agenda

Agenda. What is (Web) data mining? And what does it have to do with privacy? – a simple view – Examples of data mining and "privacy-preserving data mining": Association-rule mining (& privacy-preserving AR mining) Collaborative filtering (& privacy-preserving collaborative filtering)

minor
Download Presentation

Agenda

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Agenda • What is (Web) data mining? And what does it have to do with privacy? – a simple view – • Examples of data mining and "privacy-preserving data mining": • Association-rule mining (& privacy-preserving AR mining) • Collaborative filtering (& privacy-preserving collaborative filtering) • A second look at ...privacy • A second look at ...Web / data mining • The goal: More than modelling and hiding – Towards a comprehensive view of Web mining and privacy. Threats, opportunities and solution approaches. • An outlook: Data mining for privacy

  2. Privacy Problems: Example 1 • Technical background of the problem: • The dataset allows for Web mining (e.g., which search queries lead to which site choices), • it violates k-anonymity (e.g. "Lilburn"  a likely k = #inhabitants of Lilburn)

  3. Privacy Problems: Example 2 Where do people live who will buy the Koran soon? • Technical background of the problem: • A mashup of different data sources • Amazon wishlists • Yahoo! People (addresses) • Google Maps • each with insufficient k-anonymity, allows for attribute matching and thereby inferences

  4. Predicting political affiliation from Facebook profile and link data (1): Most Conservative Traits Privacy Problems: Example 3 Lindamood et al. 09 & Heatherly et al. 09

  5. Predicting political affiliation from Facebook profile and link data (2): Most Liberal Traits per Trait Name Lindamood et al. 09 & Heatherly et al. 09

  6. "Privacy-preserving Web mining" example: find patterns, unlink personal data • Volvo S40 website targets people in 20s • Are visitors in their 20s or 40s? • Which demographic groups like/dislike the website? • An example of the "Randomization Approach" to PPDM: R. Agrawal and R. Srikant, "Privacy Preserving Data Mining", SIGMOD 2000.

  7. Randomization Approach Overview 30 | 70K | ... 50 | 40K | ... ... Randomizer Randomizer 65 | 20K | ... 25 | 60K | ... ... Reconstruct distribution of Age Reconstruct distribution of Salary ... Data Mining Algorithms Model

  8. Seems to work well!

  9. What is collaborative filtering? • "People like what • people like them • like" • – regardless of support and confidence

  10. User-based Collaborative Filtering Idea: People who agreed in the past are likely to agree again To predict a user’s opinion for an item, use the opinion of similar users Similarity between users is decided by looking at their overlap in opinions for other items Next step: build a model of user types  "global model" rather than "local patterns" as mining result

  11. 1. Privacy as confidentiality:"the right to be let alone" – and to hide data Data Is this all there is to privacy?

  12. 2. Privacy as control:informational self-determination • e.g. data privacy: "the right of the individual to decide what information about himself should be communicated to others and under what circumstances" (Westin, 1970) • behind much of data-protection legislation (see Eleni Kosta‘s talk) Data § § Don‘t do THIS !

  13. Discussion item: What is this an example of?Tracing anonymous edits in Wikipedia http://wikiscanner.virgil.gr/

  14. [Method: Attribute matching]

  15. Results (an example)

More Related