1 / 44

875: Recent A dvances in Geometric C omputer V ision & Recognition

875: Recent A dvances in Geometric C omputer V ision & Recognition. Jan-Michael Frahm Spring 2014. Introductions. Grade Requirements. Presentation of 2 papers in class 30 min talk, 10 min questions Papers for selection must come from: top journals: IJCV, PAMI, CVIU, IVCJ

zarita
Download Presentation

875: Recent A dvances in Geometric C omputer V ision & Recognition

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. 875: Recent Advances in Geometric Computer Vision & Recognition Jan-Michael Frahm Spring 2014

  2. Introductions

  3. Grade Requirements • Presentation of 2 papers in class • 30 min talk, • 10 min questions • Papers for selection must come from: • top journals: IJCV, PAMI, CVIU, IVCJ • top conferences: CVPR (2010,2011), ICCV (2011), ECCV (2010), • approval for all other venues is needed • Final project • evaluation, extension of a recent method from the above

  4. Grading • 20% first presentation • 20% second presentation • 30% final project • 30% attendance & class participation

  5. Schedule • Jan. 7th, Introduction • Jan 7th, Uncertainty in Stereo (guest PhilipposMordohai) (substitute for Jan 13th class) • Jan 15th , Large-scale image localization basic concepts, First paper selection (Large –scale localization) • Jan 20th, MLK holiday no class • Jan 22nd-29th, Large-scale localization basic concepts • Feb. 3rd, 1. round of presentations starts • Mar. 10th, 12th Spring break (no class) • Mar. 17th, Modeling dynamic objects/scenes basic concepts, Second paper selection, final project definition • Mar. 19st,Modeling dynamic objects • Mar. 24th, 2. round of presentations starts • Apr. 21st, 23rd , final project presentation

  6. How to give a great presentation • Structure of the talk: • Motivation (motivate and explain the problem) • Overview • Related work (short concise discussion) • Approach • Experiments • Conclusion and future work

  7. How to give a great presentation • Use large enough fonts • 5-6 one line bullet items on a slide max • Keep it simple • No complex formulas in your talk • Bad Powerpoint slides • How to for presentations

  8. How to give a great presentation • Abstract the material of the talk • provide understanding beyond details • Use pictures to illustrate • find pictures on the internet • create a graphic (in ppt, graph tool) • animate complex pictures

  9. How to give a good presentation • Avoid bad color schemes • no red on blue looks awful • Avoid using laser pointer (especially if you are nervous) • Add pointing elements in your presentation • Practice to stay within your time! • Don’t rush through the talk!

  10. Brush up on Stereo Reconstruction

  11. Stereo • Extraction of 3D information from 2D images Images 3D Point Cloud Stereo

  12. Binocular stereo • Given a calibrated binocular stereo pair, fuse it to produce a depth image • Humans can do it Stereograms: Invented by Sir Charles Wheatstone, 1838

  13. Depth Recovery by Stereo d9 d8 Search Space d7 d6 d5 d4 d3 d2 d1 reference image matching image Depth Epipolar line

  14. Depth Recovery from Stereo Depth Map d9 d8 Search Space d7 d6 d5 d4 Pixel Matching Ground Truth d3 d2 d1 reference image matching image Depth Epipolar line Matching Cost depth Pixel similarity: measured by color differences

  15. Matching criteria • Raw pixel values (correlation) • Band-pass filtered images [Jones & Malik 92] • “Corner” like features [Zhang, …] • Edges [many people…] • Gradients [Seitz 89; Scharstein 94] • Rank statistics [Zabih & Woodfill 94] • Intervals [Birchfield and Tomasi 96] • Overview of matching metrics and their performance: • H. Hirschmüller and D. Scharstein, “Evaluation of Stereo Matching Costs on Images with Radiometric Differences”, PAMI 2008 slide: R. Szeliski

  16. Adaptive Weighting • Boundary Preserving • More Costly

  17. Simplest Case: Parallel images • Image planes of cameras are parallel to each other and to the baseline • Camera centers are at same height • Focal lengths are the same slide: S. Lazebnik

  18. Simplest Case: Parallel images • Image planes of cameras are parallel to each other and to the baseline • Camera centers are at same height • Focal lengths are the same • Then, epipolar lines fall along the horizontal scan lines of the images slide: S. Lazebnik

  19. Essential matrix for parallel images Epipolar constraint: R = I t = (T, 0, 0) x x’ t

  20. Essential matrix for parallel images Epipolar constraint: R = I t = (T, 0, 0) x x’ t

  21. Aggregation Structure Matching Cost depth Search Space Pixelwise Costs

  22. Aggregation Structure Search Space Search Space Cost aggregation: cutting the cost volume. Cost Volume

  23. Aggregation Structure Cost of the center pixel Treat neighbors equally Costs of neighboring pixels Fronto-Parallel Plane Sum of Absolute Differences (SAD) Depth Map Cost Volume

  24. Aggregation Structure Weighted cost of the center pixel Weighted costs of neighboring pixels • Color differences • Spatial distances Adaptive Weight Yoon and Kweon, PAMI 2006 Depth Map Cost Volume

  25. Aggregation Structure Adaptive Weight Oriented Plane Lu et al., CVPR 2013 Depth Map Cost Volume

  26. Your basic stereo algorithm For each epipolar line For each pixel in the left image • Improvement: match windows • This should look familar... • compare with every pixel on same epipolar line in right image • pick pixel with minimum match cost slide: R. Szeliski

  27. Depth Map Computation Image Resolution : the total number of pixels • Local methods • Depth with the minimum cost • Complexity: • Global methods • Pairwise interactions • Complexity: bN pixels N pixels aN pixels Scharstein and Szeliski, “A taxonomy and evaluation of dense two-frame stereo correspondence algorithms", IJCV 2002

  28. Depth from disparity X z x x’ f f BaselineB O O’ Disparity is inversely proportional to depth!

  29. Depth Sampling Depth sampling for integer pixel disparity Quadratic precision loss with depth!

  30. Depth Sampling Depth sampling for wider baseline

  31. Depth Sampling Depth sampling is in O(resolution6)

  32. Failures of correspondence search Occlusions, repetition Textureless surfaces Non-Lambertian surfaces, specularities slide: S. Lazebnik

  33. How can we improve window-based matching? • The similarity constraint is local (each reference window is matched independently) • Need to enforce non-local correspondence constraints slide: S. Lazebnik

  34. Non-local constraints • Uniqueness • For any point in one image, there should be at most one matching point in the other image slide: S. Lazebnik

  35. Non-local constraints • Uniqueness • For any point in one image, there should be at most one matching point in the other image • Ordering • Corresponding points should be in the same order in both views slide: S. Lazebnik

  36. Non-local constraints • Uniqueness • For any point in one image, there should be at most one matching point in the other image • Ordering • Corresponding points should be in the same order in both views Ordering constraint doesn’t hold slide: S. Lazebnik

  37. Non-local constraints • Uniqueness • For any point in one image, there should be at most one matching point in the other image • Ordering • Corresponding points should be in the same order in both views • Smoothness • We expect disparity values to change slowly (for the most part) slide: S. Lazebnik

  38. Multiple-baseline stereo results I1 I2 I10 M. Okutomi and T. Kanade, “A Multiple-Baseline Stereo System,” IEEE Trans. on Pattern Analysis and Machine Intelligence, 15(4):353-363 (1993).

  39. Plane Sweep Stereo • Choose a reference view • Sweep family of planes at different depths with respect to the reference camera input image input image reference camera • Each plane defines a homography warping each input image into the reference view R. Collins. A space-sweep approach to true multi-image matching. CVPR 1996.

  40. Real-time 3D reconstruction from video “Real-Time Plane-sweeping Stereo with Multiple Sweeping Directions", CVPR 2007 warped images SAD as similarity (darker is higher similarity) 3D scene

  41. Real-time 3D reconstruction from video “Real-Time Plane-sweeping Stereo with Multiple Sweeping Directions", CVPR 2007 warped images SAD as similarity (darker is higher similarity) 3D scene

  42. Real-time 3D reconstruction from video “Real-Time Plane-sweeping Stereo with Multiple Sweeping Directions", CVPR 2007 warped images Multi-way sweep SAD as similarity (darker is higher similarity) 3D scene

  43. 3D reconstruction from video view 1 view N

  44. 3D reconstruction from video

More Related