1 / 5

It444 Project

It444 Project. Project. Project. Indexer:

ion
Download Presentation

It444 Project

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. It444 Project

  2. Project

  3. Project Indexer: Web crawler gives the indexer the full text of the pages it finds. Break up text to words stored in index database. With each index entry storing a list of documents in which the term appears and the location within the text where it occurs. This data structure allows rapid access to documents that contain user query terms. To improve search performance ignore (doesn’t index)  stop words (such as the, is, on, or, of, how, why, as well as certain single digits and single letters).

  4. Project Query processor: The query processor has several parts, including the user interface (search box), the “engine” that evaluates queries and matches them to relevant documents, and the results formatter. Ranker: a page with a higher Page Rank is deemed more important and is more likely to be listed above a page with a lower Page Rank. Some factors in computing a Page Rank : the popularity of the page, the position and size of the search terms within the page, frequent of word in document.

  5. Reference http://www.googleguide.com/google_works.html

More Related