1 / 26

Multi-view Traffic Sign Detection, Recognition and 3D Localisation

Multi-view Traffic Sign Detection, Recognition and 3D Localisation. Radu Timofte, Karel Zimmermann, and Luc Van Gool. Problem definition. Input: Large set of views and corresponding camera locations Output: List of traffic signs. Outline. Single view

jbasil
Download Presentation

Multi-view Traffic Sign Detection, Recognition and 3D Localisation

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Multi-view Traffic Sign Detection, Recognition and 3D Localisation Radu Timofte, Karel Zimmermann, and Luc Van Gool

  2. Problem definition • Input: Large set of views and corresponding camera locations • Output: List of traffic signs

  3. Outline Single view • Segmentation – very fast bounding box selection process with FN -> 0. • Traffic signs are designed to be well distinguishable from background => have distinctive colors and shapes. • Detection – Adaboost classifiers of bounding boxes. • Recognition – based on SVM classifiers. Multi-view • Global optimization – over single-view detections constrained by 3D geometry

  4. Color-based segmentation (thresholding) • Estimation of connected components of a thresholded image (T = [0.5,0.2,-0.4,1.0]T)

  5. Shape-based segmentation • Searching for specific shapes (rectangles, circles, triangles). • Not all traffic signs are locally threshold separable. • More time consuming, many responses for small shapes (every small region is approximately some basic shape).

  6. Learning segmentation • There are thousands of possible settings of such methods, e.g. different projections from color space. • Learning is searching for a reasonable subset of these methods/settings. • Optimal trade-off among FN, FP and the number of methods: T* = argmin (FP(T) + K1FN(T)+K2card(T)) • Boolean Linear Programming selects ≈ 50 methods out of 10000 in 2 hours. • Segmentation results for example: FNBB = 1.5%, FP = 3443/ 2Mpxl image, (FNTS = 0.5%)

  7. How does the output of segmentation looks like?

  8. Detection • Detection: suppression of bounding boxes which does not like a traffic sign. • Haar features computed on each channel of HSI space. • Separated shape-specific cascades of Adaboost classifiers. • Detection (+segmentation) results:

  9. How does the output of detection looks like?

  10. 3D optimization - introduction • Single view detection and recognition is just preprocessing, the final decision is subject of the global optimization over multiple views. • The idea is based on Minimum Description Length, i.e. explaining detected bounding boxes by the lowest number of real world traffic signs. • If detections satisfy some visual and geometrical constrains, then all of these detections are explainable by one real world traffic sign.

  11. Minimum Description Length in 3D

  12. Minimum Description Length in 3D

  13. Minimum Description Length in 3D

  14. Minimum Description Length in 3D

  15. Minimum Description Length in 3D

  16. Minimum Description Length in 3D

  17. Minimum Description Length in 3D

  18. Minimum Description Length in 3D

  19. Minimum Description Length in 3D

  20. Problem formulation max XT X x∊{0,1}

  21. Example with 16 views

  22. Example with 16 views

  23. Results • The summary of 3D results: • The average accuracy of 3D localisation is of 24.54 cm.

  24. Visualisation of 3D results in one camera

  25. 3D visualisation

  26. Conclusions • Traffic Sign Detection, Recognition and 3D Localisation is a challenging problem. • We propose a multi-view scheme, which combines 2D and 3D analysis. • The main contributions are: • Boolean Linear Programming formulation for fast candidate extraction in 2D • Minimum Description Length formulation for best 3D hypothesis selection • Work in progress…

More Related