180 likes | 308 Views
This paper presents a novel approach to improving web search results by introducing two key metrics: diversity and information richness, alongside a new ranking scheme known as Affinity Ranking. The study highlights the challenges of ambiguous queries and the lack of user information needs, exemplified by questions like “足球” (football). Experiments demonstrate that incorporating Affinity Ranking enhances the variety of topics and the informativeness of search results, particularly within the top search listings. The findings suggest significant improvements in user satisfaction and search performance.
E N D
Improving Web Search Results Using Affinity Graph Advisor : Dr. Hsu Presenter : Jia-Hao Yang Author :Benyu Zhang , Hua Li , Yi Liu , Wensi Xi , Weiguo Fan SIGIR
Outline • Motivation • Objective • Definition • Methods (Affinity Ranking) • Experiments • Conclusion • Opinion SIGIR
Motivation • situation • Many of the queries are ambiguous. • the user’s information needs are unknown. • Ex : “足球” , 是只想要足球還是要找足球賽 • In traditional, precision and recall are two metrics, but these didn’t consider the content of documents. • Hyperlink SIGIR
Objective • Two metrics, diversity and information richness, have been proposed to improve this problem. • Re-ranking the top search results to satisfy the user’s information needs. SIGIR
Definition • Diversity measures the variety of topics in a group of documents. • Information richness measures how many different topics a single document contains. SIGIR
Methods • AG : According to vector space model, each document can be represented , • If we consider documents as nodes, the document collection can be modeled as a graph by generating the link between documents. d2 d3 d1 d4 SIGIR d5 d6
Methods(cont.) • Information richness : • 1st • 2nd SIGIR
Methods(cont.) • Diversity penalty : • 1st : • 2nd • 3rd , • 4th • 5th 2nd • Re-ranking : • The score-combination scheme uses a linear combination of two parts: • The rank-combination scheme of re-ranking uses a linear combination of the ranks based on full-text search and Affinity Ranking : SIGIR
Experiments (In Yahoo & ODP) • Affinity Ranking vs. K-Means Clustering SIGIR
Experiments (cont.) SIGIR
Experiments (cont.) SIGIR
Experiments (In Newsgroup) • Improve in Top 10 Search Results : • As the top 10 search results always receive the most attention of end-users, we show how Affinity Ranking affects the top 10 search results from the newsgroup data set. SIGIR
Experiments (cont.) • Improve within Top 50 Search Results SIGIR
Experiments (cont.) SIGIR
Experiments (α & β) SIGIR
A Case Study • Outlook print error : SIGIR
Conclusion • This paper proposed two new metrics, diversity and information richness, and a novel ranking scheme, Affinity Ranking, to measure the search performance. • By presenting wider topic coverage and more highly informative results in each topic in the top results, this method can effectively improve the search performance. SIGIR
Opinion • Future work : scaling the AR computation, to the Web scale. SIGIR