80 likes | 156 Views
Develop an application that identifies people across various websites. Discover detailed information about individuals through integrated search results for improved user experience. Utilize existing solutions programmed in Java for web applications, actively utilizing data generated from search engines. Implementation involves Google result analysis, keyword identification, and easy access through Java and Apache Tomcat.
E N D
Bc. Anton Balucha http://www.tonyb.sk/ Identificationofpeople • AssignmentfromsubjectInformationRetrieval
searchengineresults • manyinformationaboutmanypeople • strewed, notintegrated Anton Balucha - Identification of people Motivation
createanapplication, whichidentifyoccurenceof person on various web sites Anton Balucha - Identification of people Task
http://www.pipl.com– (easy to use , transparent list ofresults) • http://www.zabasearch.com (searchpeopleonly in USA) • http://www.wink.com (searchpeople on socialnetworks) • http://www.people.yahoo.com(searchpeoplewithsomeenteredparameters – mane, surname, town, state, e-mail) • https://addons.mozilla.org/sk/firefox/addon/3167 (pluginintoFirefoxbrowser) • http://www.peoplesearch.com (searchpeopleonly in USA in entered state) • http://www.peekyou.com(searchpeople on variousportals - Google+, Wikipedia, LinkedIn, Flickr, Twitter) • http://www.123people.com(searchpeople on variousportals - Google+, Wikipedia, LinkedIn, Flickr, Twitter) • http://www.bestpeoplesearch.com (searchpeopleonly in USA in entered state, possibility to hire person forsearching) Anton Balucha - Identification of people Existingsolutions
programmed in Java • web application • availablefrom z http://www.tonyb.sk/ • no staticdata • activeusingofresultsfromsearchengines Anton Balucha - Identification of people Desctioptionofsolution - architectural
Anton Balucha - Identification of people descriptionofsolution - implementation Google results web pages removediacritics remove stop words remove HTML stemming TF-IDF identifykeywords show results identifykeywords identifykeywords
intallationofJava intallationofApacheTomcat deploy externalapplications access to the Internet access to theapplication Anton Balucha - Identification of people InstallataionofApplication
[1] Michal Laclavík, Martin Šeleng: Vyhľadávanie informácií. Vyhľadávanie informácií. Dostupné na <http://vi.ikt.ui.sav.sk/> (11.12.2011) [2] PorterStemmer. Dostupné na <http://tartarus.org/martin/PorterStemmer/> (11.12.2011) Anton Balucha - Identification of people UsedLiterature