1 / 38

Vision in Man and Machine . STATS 19 SEM 2. 263057202. Talk 2.

Vision in Man and Machine . STATS 19 SEM 2. 263057202. Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology. www.stat.ucla/~yuille. The Purpose of Vision. “To Know What is Where by Looking”. Aristotle. (384-322 BC).

linh
Download Presentation

Vision in Man and Machine . STATS 19 SEM 2. 263057202. Talk 2.

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Vision in Man and Machine.STATS 19 SEM 2. 263057202. Talk 2. Alan L. Yuille. UCLA. Dept. Statistics and Psychology. www.stat.ucla/~yuille

  2. The Purpose of Vision. • “To Know What is Where by Looking”. Aristotle. (384-322 BC). • Information Processing: receive a signal by light rays and decode its information. • Vision appears deceptively simple, but there is more to Vision than meets the Eye.

  3. Ames Room

  4. Perspective.

  5. Curved Lines?

  6. Brightness of Patterns: Adelson (MIT)

  7. Visual Illusions • The perception of brightness of a surface, • or the length of a line, • depends on context. • Not on basic measurements like: • the no. of photons that reach the eye • or the length of line in the image..

  8. Perception as Inference • Helmholtz. 1821-1894. • “Perception as Unconscious Inference”.

  9. Ball in a Box. (D. Kersten)

  10. How Hard is Vision? • The Human Brain devotes an enormous amount of resources to vision. • (I) Optic nerve is the biggest nerve in the body. • (II) Roughly half of the neurons in the cortex are involved in vision (van Essen). • If intelligence is proportional to neural activity, then vision requires more intelligence than mathematics or chess.

  11. Vision and the Brain

  12. Half the Cortex does Vision

  13. Vision and Artificial Intelligence • The hardness of vision became clearer when the Artificial Intelligence community tried to design computer programs to do vision. ’60s. • AI workers thought that vision was “low- level” and easy. • Prof. Marvin Minsky (pioneer of AI) asked a student to solve vision as a summer project.

  14. Chess and Face Detection • Artificial Intelligence Community preferred Chess to Vision. • By the mid-90’s Chess programs could beat the world champion Kasparov. • But computers could not find faces in images.

  15. Man and Machine. • David Marr (1945-1980) • Three Levels of explanation: 1. Computation Level/Information Processing 2. Algorithmic Level 3. Hardware: Neurons versus silicon chips. Claim: Man and Machine are similar at Level 1.

  16. Vision: Decoding Images

  17. Vision as Probabilistic Inference • Represent the World by S. • Represent the Image by I. • Goal: decode I and infer S. • Model image formation by likelihood function, generative model, P(I|S) • Model our knowledge of the world by a prior P(S).

  18. Bayes Theorem • Then Bayes’ Theorem states we show infer the world S from I by • P(S|I) = P(I|S)P(S)/P(I). • Rev. T. Bayes. 1702-1761

  19. Bayes to Infer S from I • P(I|S) likelihood function . P(S) prior. .

  20. Technically very interdisciplinary • But applying Bayes is not straightforward. • A beautiful theory is being developed adapting techniques from Computer Science, Engineering, Mathematics, Physics, and Statistics. • E.G. Probabilistic Reasoning (Pearl CS), Level Sets (Osher Maths).

  21. Examples • Generative Models • Visual Inference: (1) Estimating Shape. (2) Segmenting Images. (3) Detecting Faces. (4) Detecting and Reading Text.

  22. Generative Models Learn Generative Models from a few images and then generate new images.

  23. Uses of Generative Models Univ. Oxford

  24. Shape Inference: (Zhu Lab)

  25. Shape and Photometry ( Soatto Lab) • Estimate geometry (shape) and photometry from multiple images. Jin-Soatto-Yezzi

  26. Compare ground truth (Soatto Lab) Estimated shape Alternative algorithm Ground truth Jin-Soatto-Yezzi 11/1/02

  27. Compare w. ground truth (Soatto Lab) Generated Image: synthesized from novel viewpoint and illumination. Ground Truth: same lighting and viewpoint Jin-Soatto-Yezzi 11/1/02

  28. Segmentation (Level Sets)

  29. Segmentation (Level Sets)

  30. Segmenting Images (Zhu Lab) • Characterize the set of image patterns that occur in natural images. Provide mathematical models. P(I|S) and P(S).

  31. Face and Text Detection.

  32. Back to the Brain • Top-Level; compare human performance to Ideal Observers. • Explain human perceptual biases (visual illusions) as strategies that are “statistical effective”.

  33. Brain Architecture • The Bayesian models have interesting analogies to the brain. • Generative Models require top-down processing

  34. High-Level Tells Low-Level to Shut Up (Kersten Lab)

  35. High-Level Tells Low-Level to Shut up (Kersten Lab)

  36. Conclusion • Vision is unconscious inference. • Theory of Vision for Man and Machine. • See more about Vision at UCLA in the Vision and Image Science Collective • http://visciences.ucla.edu

More Related