It444 project
Download
1 / 5

It444 Project - PowerPoint PPT Presentation


  • 112 Views
  • Uploaded on

It444 Project. Project. Project. Indexer:

loader
I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
capcha
Download Presentation

PowerPoint Slideshow about ' It444 Project' - ion


An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript


Project1
Project

Indexer:

Web crawler gives the indexer the full text of the pages it finds. Break up text to words stored in index database. With each index entry storing a list of documents in which the term appears and the location within the text where it occurs. This data structure allows rapid access to documents that contain user query terms.

To improve search performance ignore (doesn’t index)  stop words (such as the, is, on, or, of, how, why, as well as certain single digits and single letters).


Project2
Project

Query processor:

The query processor has several parts, including the user interface (search box), the “engine” that evaluates queries and matches them to relevant documents, and the results formatter.

Ranker: a page with a higher Page Rank is deemed more important and is more likely to be listed above a page with a lower Page Rank.

Some factors in computing a Page Rank : the popularity of the page, the position and size of the search terms within the page, frequent of word in document.


Reference
Reference

http://www.googleguide.com/google_works.html


ad