It444 Project. Project. Project. Indexer:
Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.
Web crawler gives the indexer the full text of the pages it finds. Break up text to words stored in index database. With each index entry storing a list of documents in which the term appears and the location within the text where it occurs. This data structure allows rapid access to documents that contain user query terms.
To improve search performance ignore (doesn’t index) stop words (such as the, is, on, or, of, how, why, as well as certain single digits and single letters).
The query processor has several parts, including the user interface (search box), the “engine” that evaluates queries and matches them to relevant documents, and the results formatter.
Ranker: a page with a higher Page Rank is deemed more important and is more likely to be listed above a page with a lower Page Rank.
Some factors in computing a Page Rank : the popularity of the page, the position and size of the search terms within the page, frequent of word in document.