1 / 15

Web mining and Social Networking

Web mining and Social Networking. Introduction Theoretical Backgrounds. Introduction. Background With the explosive growth of information over the internet, WWW has become a powerful platform to mine useful knowledge Problems in Web related researches Finding relevant information

rich
Download Presentation

Web mining and Social Networking

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Web mining and Social Networking Introduction Theoretical Backgrounds

  2. Introduction • Background • With the explosive growth of information over the internet,WWW has become a powerful platform to mine useful knowledge • Problems in Web related researches • Finding relevant information • Search engine – low precision and low recall. • Finding needed information • Query-based search – Doesn’t handle homograph. • Learning useful knowledge • Utilize the Web as knowledge base • Recommendation/personalization of information • Learning user navigational pattern • Web communities and social networking • Relationship among Web objects

  3. Introduction • Data Mining and Web Mining • Data Mining • Discovering hidden or unseen knowledge in the forms of pattern in huge data • Web Mining • The means of utilizing data mining method to induce and extract useful information from Web data information • Web content mining • Web structure mining • Web usage mining • Semantic Web mining

  4. Introduction • Characteristics of Web Data • The data on the web is huge • The data is distributed • The data is unstructured • The data is dynamic • Web community and Social Networking • An aggregation of web pages, users, and data

  5. Theoretical Backgrounds • Web Data Model • Web data can be expressed such as matrix, directed graph and click sequence and so on.

  6. Theoretical Backgrounds • Similarity Functions • Correlation-based Similarity • Cosine-based Similarity

  7. Theoretical Backgrounds • Eigenvector, Principal Eigenvector

  8. Theoretical Backgrounds • Singular Value Decomposition(SVD)

  9. Theoretical Backgrounds • Latent Semantic Analysis(LSA)

  10. Theoretical Backgrounds • Tensor Expression and Decomposition

  11. Theoretical Backgrounds • Performance measure • Precision • Recall • F-measure

  12. Theoretical Backgrounds • Mean Average Precision(MAP) • Discount cumulative gain(DCG) • In the cases of using a graded relevance scale

  13. Theoretical Backgrounds • Web Recommendation Evaluation Metrics • Mean Absolute Error (MAE) • Hit Ratio • Weighted Average Visit Percentage

  14. Theoretical Backgrounds • Basic Metrics of Social Network • Size – # of vertexes in the network • Centrality – Betweenness, Closeness, Degree • Density – existing edges / total possible edges in the network. • Degree( of network) - # of edges in the network. • Betweeness and Closeness • Clique – sub-set of a network

  15. Theoretical Backgrounds • Social Network over the web • Each web page = social entity, hyperlink = relationship • Centrality – closeness, degree, betweenness • Prestige – A prestige actor is one who receives a lot of inlinks

More Related