1 / 33

Building Rome in a Day

Building Rome in a Day. Sameer Agarwal1 Noah Snavely2 Ian Simon1 Steven M. Seitz1 Richard Szeliski3 1University of Washington 2Cornell University 3Microsoft Research. Outline. 1. Introduction 2. System Design 3. Result 4. Conclusion. Introduction.

mikko
Download Presentation

Building Rome in a Day

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Building Rome in a Day Sameer Agarwal1 Noah Snavely2 Ian Simon1 Steven M. Seitz1 Richard Szeliski3 1University of Washington 2Cornell University 3Microsoft Research

  2. Outline • 1. Introduction • 2. System Design • 3. Result • 4. Conclusion

  3. Introduction • Entering the search term “Rome” on flickrreturns more than two million photographs. • 3D reconstruction • in Google Earth and Microsoft’s Virtual Earth

  4. Exploring Photo Collection in 3D

  5. Outline • 1. Introduction • 2. System Design • 1.pre-processing & feature extraction • 2.matching • 3.geometric estimation • 3. Result • 4. Conclusion

  6. Scene reconstruction • Automatically estimate • position, orientation, and focal length of cameras • 3D positions of feature points

  7. Feature detection Detect features using SIFT [Lowe, IJCV 2004]

  8. Feature detection Detect features using SIFT [Lowe, IJCV 2004]

  9. Feature detection Detect features using SIFT [Lowe, IJCV 2004]

  10. Feature matching Match features between each pair of images approximate nearest neighbor matching

  11. Feature matching Refine matching using RANSAC [Fischler & Bolles 1987] to estimate fundamental matrices between pairs

  12. Structure from motion structure for motion: automatic recovery of camera motion and scene structure from two or more images. It is a self calibration technique and called automatic camera tracking or match moving. Unknown camera viewpoints

  13. p4 minimize p1 p3 f(R,T,P) p2 p5 p7 p6 Structure from motion rotations R, positions t, and 3D point locations P that minimize sum of squared reprojection errors f Camera 1 Camera 3 R1,t1 R3,t3 Camera 2 R2,t2

  14. Incremental structure from motion

  15. Incremental structure from motion

  16. Incremental structure from motion

  17. Vocabulary trees (Nister & Stewenius, 2006) • Computational efficiency • k-means tree is used to quantize the feature descriptors

  18. TF-IDF(term frequency–inverse document frequency) • Consider a document containing 100 words wherein the word cow appears 3 times. • (TF) = (3 / 100) = 0.03. • Assume we have 10 million documents and cow appears in one thousand of these. • (IDF) = log(10 000 000 / 1 000) = 4.

  19. TF-IDF score is the product of these quantities: 0.03 × 4 = 0.12 • The word is important if the TF-IDF score is large 某一特定文件內的高詞語頻率,以及該詞語在整個文件集合中的低文件頻率,可以產生出高權重的TF-IDF。因此,TF-IDF傾向於過濾掉常見的詞語,保留重要的詞語。

  20. Query expansion • Large-scale image matching • Better approach: use bag-of-words technique to find likely matches • For each image, find the top M scoring other images, do detailed SIFT matching with those

  21. Outline • 1. Introduction • 2. System Design • 3. Result • 4. Conclusion

  22. Matching and reconstruction statics for the three sets

  23. Building Rome in a Day St. Peter’s Basilica Colosseum Trevi Fountain Rome, Italy. Reconstructed 150,000 in 21 hours on 496 machines

  24. Dubrovnik, Croatia. 4,619 images (out of an initial 57,845). Total reconstruction time: 23 hours Number of cores: 352

  25. Dubrovnik Dubrovnik, Croatia. 4,619 images (out of an initial 57,845). Total reconstruction time: 23 hours Number of cores: 352

  26. San Marco Square San Marco Square and environs, Venice. 14,079 photos, out of an initial 250,000. Total reconstruction time: 3 days. Number of cores: 496.

  27. Outline • 1. Introduction • 2. System Design • 3. Result • 4. Conclusion

  28. Conclusion • Our experimental results demonstrate that it is now possible to reconstruct cities consisting of 150K images in less than a day on a cluster with 500 compute cores. Large-scale image matching 3D models http://grail.cs.washington.edu/rome/ http://phototour.cs.washington.edu/applet/index.html

More Related