1 / 22

Tour the World: building a web-scale and landmark recognition engine

ICCV 2009. Tour the World: building a web-scale and landmark recognition engine. Yan-Tao Zheng1 , Ming Zhao2, Yang Song2, Hartwig Adam2 Ulrich Buddemeier2, Alessandro Bissacco2, Fernando Brucher2 Tat- Seng Chua1 , and Hartmut Neven2

dylan
Download Presentation

Tour the World: building a web-scale and landmark recognition engine

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. ICCV 2009 Tour the World: building a web-scale and landmark recognition engine Yan-Tao Zheng1, Ming Zhao2, Yang Song2, Hartwig Adam2 Ulrich Buddemeier2, Alessandro Bissacco2, Fernando Brucher2 Tat-Seng Chua1, and Hartmut Neven2 1 NUS Graduate Sch. for Integrative Sciences and Engineering, National University of Singapore, Singapore 2 Google Inc., U.S.A

  2. outline • Introduction • Approach ( Framework) • Experiments • Conclusion (Future Work)

  3. Introduction What is the motivation ? With the vast amount of landmark multimedia data on the web

  4. Introduction • Application • Provide clean landmark images for building virtualtourism of a large number of landmarks • Facilitate both content understanding and geo-location detection of images and video • Provide tour guide recommendation and visualization

  5. Introduction • Issuemust be tackled • No readily available list of landmarks in the world • Explore two source : • (1) geographically calibrated images in photo sharing websites • (2)travel guide articles from websites • Even if , it’s still challenging to collect true landmark image • Download landmark images from two sources: • (1)photo sharing websites (2)Google Image Search • Efficiency is a challenge for a large-scale system • Accomplish by three means: • (1)parallel computing (2)efficient clustering algo. • (3)efficient image matching by k-d tree indexing http://www.panoramio.com/

  6. Approachframework

  7. Approachframework Learning landmarks from GPS-tagged photos >> Perform the agglomerative hierarchical clustering on the photo’s GPS coordinates >> Validation criterion is unique number of authors of photos is larger than a threshold

  8. Approachframework Learning landmarks from travel guide articles >> with the hierarchy, we can extract city names from country in six continents >> satisfy following criteria, text is deemed to be a landmark candidate http://wikitravel.org/en/Taipei Set of images

  9. Approachframework Learning landmarks from travel guide articles >> Validating landmarks (1) if it is too long or most of its words are not capitalized (2) the number of unique authors of images in the cluster >> which reflects the popular appeal of landmarks Set of images

  10. Approachdiscover landmarks in the world Most of users are located in Europe and North America !!

  11. Approachlearning of landmark images Object matching based on local features • Detect interest point >>LoG filters [11] • Local descriptor >> SIFT [9] • Reduce the feature dimensionality to 40 >> Principle Component Analysis (PCA) [2] • The match interest points of two images are verified >> affine transformation [9]

  12. Approachmatch score • Match score which is the probability of a false positive By using a cumulative binomial distribution Can be estimated by Bayes Theorem [2]

  13. Approachmatch region Classified into two types: match edge and region overlap edge ---- Match edge ---- Region overlap edge

  14. ApproachGraph clustering Do not have a priori knowledge of the # of clusters >> k-means are unsuitable >> exploit the hierarchical agglomerative clustering [2] The distance of region

  15. ApproachCleaning visual model • Photographic v.s non-photographic image classifier • Based on Adaboost algorithm over low level visual features of color histogram and hough transform. • Adopt a multi-view face detector[15]

  16. Approachefficiency issues • Make efficiency essential in two aspects: (1) the landmark image mining (2) landmark recognition of query images • Achieve efficiency in three measures: • Parallel computing to mine true landmark images • Efficiency in hierarchical clustering • Indexing local feature for matching • Use k-d tree[1] ~0.2 sec in a P4 computer

  17. Experiments 174 landmarks are found to be common in both lists >> land mark is a perceptional and cognitive concept

  18. Experiments • Evaluation of landmark image mining • 1000 visual clusters are randomly selected 68 of them are found to be negative outliers (0.68%) • The classifier is trained based on ~5000 photographic and non-photographic images , while the face detector is base on [15] • After cleaning , cluster rate drops to 0.37%

  19. Experiments Evaluation of landmark recognitioin Positive testing 728 images from 124 randomly selected landmarks Negative testing Caltech-256 [5] Pascal VOC 07 [3]

  20. Experiments Recognition : local feature matching of query image against model images, NN principle A match is found when the match score is larger than the threshold Recognition accuracy: 80.8% fairly satisfactory Image content analysis and geo-location detection: 46.3% moderately satisfactory

  21. Conclusionfuture work • Conclusion • Build a world-scale landmark recognition engine • Utilize ~21.4M images to build up landmark visual model • Incorporates 5312 landmarks from 1259 cities in 144 countries • Future work • Multi-lingual aspect of landmark engine >> help to discover more landmarks and collect more clean landmark images in their native languages in the Internet

  22. Related work Thank You !! Related Work 3D visualization of landmarks http://www.cs.cornell.edu/~snavely/

More Related