1 / 32

Data Mining

Data Mining. - A Powerful Computing Technology. Department of Computer Science Wayne State University. Road Map. Overview Recommender Systems Clustering Classification Association Analysis PageRank Social Networks. Different Forms of Data. Text Data. Different Forms of Data.

altessa
Download Presentation

Data Mining

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Data Mining - A Powerful Computing Technology Department of Computer Science Wayne State University

  2. Road Map • Overview • Recommender Systems • Clustering • Classification • Association Analysis • PageRank • Social Networks

  3. Different Forms of Data • Text Data

  4. Different Forms of Data • Image Data

  5. Different Forms of Data • Video Data

  6. Different Forms of Data • Network Data

  7. Why Data Mining is Important? • Difficulty of identifying patterns in big data. • Extracting only WANTED data within a short time. We are drowning in data, but starving for knowledge!

  8. How Data Mining can help? • We do not care if GOOGLE has more than billion web pages. • We only care about the information that is useful for us.

  9. What is Data Mining • The analysis of data to extract useful patterns or information from a large data collection. • Also known as: Knowledge Discovery in Databases • Learn More: http://en.wikipedia.org/wiki/Data_mining Automated Analysis of Massive Data

  10. Applications

  11. Data Miner • An educational tool that teaches you Data Mining techniques. • Consists of two basic parts such that, • Demonstration • Explains how to work with the interactive part. • Interactive part • Teaching data mining through user interaction.

  12. Recommender Systems • Goal: present information items that are likely to be of interest to the user. • Lots of online products, books, movies, etc. • Reduce my choices…please!!!! • Learn More: http://pespmc1.vub.ac.be/collfilt.html

  13. Recommender Systems • Netflix Recommender System

  14. Do you watch movies using Then you might like So on you might like these too Or may be you like If you have watched this movie This might catch your interest too

  15. Amazon Recommender System • Amazon Recommender System

  16. Data Miner - Recommender System • Recommendation based on content

  17. Recommendation

  18. Finding a Friend With Similar Taste YOU See what they like Measure the similarity Select your Neighbors

  19. Measuring the Similarity

  20. Cluster Analysis • Cluster: • A collection of data objects • Cluster Analysis: • Grouping some given objects with similar attributes. • Similar (or related) to one another within the same group • Dissimilar (or unrelated) to the objects in other groups • Learn More: http://home.dei.polimi.it/matteucc/Clustering/tutorial_html

  21. Cluster Analysis • Data Set: • Clusters: Flowers Fruit

  22. Clustering • Now you have seen Flowers and Fruits visually. • So to which cluster, would you add this object? Flowers Fruit Yes, to FRUIT!!

  23. Classification • Assigning given items to a known class which have items with similar attributes. • Explains through Decision Trees.

  24. Classification • PURE Classification. • Each branch contains animals belong to a single CLASS.

  25. Classification • You have learned what is Mammal and what is Bird. • Can you tell what is this? Yes, this is indeed a BIRD!!

  26. Association Analysis • Discover interesting relationships in a set of transactions. • Understand relationships between items. E.g. • If a customers buys shoes, then 10% of the times they also buys socks. • 60% of all shoppers will buy bread when they also purchase a pint of milk.

  27. Association Analysis • Items: • Transactions:

  28. PageRank • Links from popular and related web sites increases the popularity of the given web site. Yahoo Amazon Pillsbury YouTube Billboard Pandora Dominos Pizza Crayola Pizza Hut Danskin Shelfari

  29. Search Results • When searching on Google, it will list web sites related to the input text according to their importance.

  30. Social Networks • Social networking websites allow users to be part of a virtual community. E.g. Facebook, Twitter, MySpace • They provide users with simple tools to create a custom profile with text and pictures. • Users can share their lives with other people through these networks.

  31. Social Networks • Learn More: • http://en.wikipedia.org/wiki/Social_network • http://pc.net/glossary/definition/socialnetworking

  32. Thank You !! Enjoy the Day…

More Related