Web People Search via Connection Analysis Dmitri V. Kalashnikov, Zhaoqi (Stella) Chen, Sharad Mehrotra, Member, IEEE, and Rabia Nuray-Turan IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL.20, NO.11, NOVEMBER 2008. 指導老師：陳彥良教授 許秉瑜教授 報告人 ：楊詠喬 龍晶珠. Introduction (1/2).
Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.
Web People Search via Connection AnalysisDmitri V. Kalashnikov, Zhaoqi (Stella) Chen, Sharad Mehrotra, Member, IEEE, and Rabia Nuray-TuranIEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING,VOL.20, NO.11, NOVEMBER 2008
報告人 ：楊詠喬 龍晶珠
retrieves a fixed number (top K) of relevant pages
-- compute TF/IDF
-- extraction of Named entities (NEs)
and Web-related information
the entity-relationship (ER) graph
(3)web page ranking
focus on developing and learning a new
c(u,v) can help designing a better similarity function s(u,v)
s(u,v) lebals data with the threshold τ and the δ-band approach
-用來計算 feature-based similarity f(u,v)
The value of w-（ ）is chosen to be zero when is less than a certain threshold, and it is chosen to be 1 when it is above this threshold. The value for this threshold itself is learned from the data.
The remainder pages are displayed in the order of the affinity to the selected cluster.
Web people serch
2.middleware approach （ˇ）
Testing disambiguation quality—Experiment 1 （disambiguation quality : overall）
Testing disambiguation quality— Experiment 2 （disambiguation quality :group identification）
Testing disambiguation quality— Experiment 3 （disambiguation quality :queries with context）
Testing disambiguation quality— Experiment 4 （ quality of generating cluster sketches）
1.由於透過第三者 (NE extractor, GATE) 摘錄NEs， 一開始的下載及前處理，每個網頁需要用3.82秒。
2.假如用 server-side approach, 前處理過程就可以離線事先做好。