1 / 9

A Brief Introduction of PageRank

A Brief Introduction of PageRank. CSE 5243 Author: Y-K Shih Ohio State University Autumn 2012. Background. Besides the keywords, how are the other evidences to rate the importance of a webpage within a set of webpages ? Solution: Use the hyperlink structure

gloria
Download Presentation

A Brief Introduction of PageRank

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. A Brief Introduction of PageRank CSE 5243 Author: Y-K Shih Ohio State University Autumn 2012

  2. Background • Besides the keywords, how are the other evidences to rate the importance of a webpage within a set of webpages? • Solution: Use the hyperlink structure • E.g. a webpage linked by many webpages is probably important. • but this method is not global (comprehensive). • PageRank is developed by Larry Page in 1998.

  3. Idea • A graph representing WWW • Node: webpage • Directed edge: hyperlink • A user randomly clicks the hyperlink to surf WWW. • The probability a user stop in a particular webpage is the PageRank value. • A node that is linked by many nodes with high PageRank value receives a high rank itself; If there are no links to a node, then there is no support for that page.

  4. A simple version • u: a webpage • Bu: the set of u’sbacklinks • Nv: the number of forward links of page v • Initially, R(u) is 1/N for every webpage • Iteratively update each webpage’s PR value until convergence.

  5. Example 1 PageRank Calculation: first iteration

  6. Example 1 PageRank Calculation: second iteration

  7. Example 1 Convergence after some iterations

  8. A little more advanced version • Adding a damping factor d • Image that a surfer would stop clicking a hyperlink with probability 1-d • R(u) is at least (1-d)/(N-1) • N is total num. of nodes.

  9. Other applications • Social network (Facebook, Twitter, etc) • Node: Person; Edge: Follower / Followee / Friend • Higher PR value: Celebrity • Citation network • Node: Paper; Edge: Citation • Higher PR values: Important Papers. • Protein-protein interaction network • Node: Protein; Edge: Two proteins bind together • Higher PR values: Essential proteins.

More Related