Introduction to web mining
1 / 10

Introduction to Web Mining - PowerPoint PPT Presentation

  • Uploaded on

Introduction to Web Mining. Spring 2013. What is data mining?. Data mining is extraction of useful patterns from data sources, e.g., databases, texts, web, images, etc. Patterns must be: valid, novel, potentially useful, understandable. Classic data mining tasks. Classification:

I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
Download Presentation

PowerPoint Slideshow about ' Introduction to Web Mining' - zasha

An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.

- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript

What is data mining
What is data mining?

  • Data mining is

    • extraction of useful patterns from data sources, e.g., databases, texts, web, images, etc.

  • Patterns must be:

    • valid, novel, potentially useful, understandable

Classic data mining tasks
Classic data mining tasks

  • Classification:

    mining patterns that can classify future (new) data into known classes.

  • Association rule mining

    mining any rule of the form X Y, where X and Y are sets of data items.

  • Clustering

    identifying a set of similarity groups in the data

Classic data mining tasks contd
Classic data mining tasks (contd)

  • Sequential pattern mining:

    A sequential rule: A B, says that event A will be immediately followed by event B with a certain confidence

  • Deviation detection:

    discovering the most significant changes in data

  • Data visualization

CS583, Bing Liu, UIC

Why is data mining important
Why is data mining important?

  • Huge amount of data

    • How to make best use of data?

    • Knowledge discovered from data can be used for competitive advantage.

  • Many interesting things that one wants to find cannot be found using database queries, e.g.,

    “find people likely to buy my products”


  • Web is an internet-based computer network that allows users of one computer to access information stored on another through the internet.

  • Client-server model, hypertext documents

  • Invented in 1989 by Tim Berners-Lee at CERN with HTTP/HTML

  • Mosaic (1993), Netscape(1994), Internet Explore (1995)

  • Related with Internet (ARPANET, TCP/IP)

Web mining
Web mining

  • traditional data mining

    • data is structured and relational

    • well-defined tables, columns, rows, keys, and constraints.

  • Web data

    • readily available data rich in features and patterns

    • Content/link/usage data

Topic description
Topic Description

  • Introduction to basic data mining: association and sequential mining, classification, clustering

  • Crawling, Web search and information retrieval

  • Social network analysis

  • Structure data extraction, information integration

  • Opinion mining and sentiment analysis

  • Web usage mining

Related fields
Related fields

  • Web mining is an multi-disciplinary field:

    Machine learning



    Information retrieval


    Natural language processing