180 likes | 294 Views
This resource delves into the realm of vision-based interaction, tracing its origins and development since the 1970s with pioneers like Myron Krueger. It outlines the fundamental principles, such as the separation of video from background using chroma key techniques, and introduces 'verbs' and 'nouns' in vision-based UIs, which involve detecting and recognizing objects and their interactions. Additionally, it highlights key tools, including Intel's OpenCV and the Papier-Mâché framework, and recommends foundational texts for further learning in computer vision and interactive systems.
E N D
Vision-based Interaction Scott Klemmer 11 November 2004
cs547: Michael Shiloh Teleo: Rapid Prototyping Toolkit Making Things Vision-based Interaction
The first vision-based interface • Myron Krueger used computer vision to create Responsive Environments (1970s) • “Reaction is the Medium” • http://www.artmuseum.net/w2vr/timeline/videoplace_video.html Vision-based Interaction
How it works • Video and background are separated in analog using chroma key techniques(think broadcast news) • The first and last points of each raster are stored in the computer, and represent the person’s outline Vision-based Interaction
Vision-based UIs: “Verbs” • Detecting and Tracking elements of a certain type in a scene • Capturing contents of detected objects • Recognizing individual members in an object class Vision-based Interaction
Vision-based UIs: “Verbs” • Detecting and Tracking elements of a certain type in a scene Vision-based Interaction
Vision-based UIs: “Verbs” • Capturing contents of detected objects Vision-based Interaction
Vision-based UIs: “Verbs” • Recognizing individual members in a class Vision-based Interaction
Vision-based UIs: “Nouns” • People (one or multiple) • Bodies • Faces • Hands • Documents • Objects Vision-based Interaction
Vision-based UIs: “Nouns” • People (one or multiple) • Bodies • Faces • Hands • Documents • Objects Vision-based Interaction
Vision-based UIs: “Nouns” • People (one or multiple) • Bodies • Faces • Hands • Documents • Objects Vision-based Interaction
I N F R A S T R U C T U R E Background Subtraction Vision-based Interaction
Image Moments (of Inertia) • 0th moment is mass(total number of pixels) Vision-based Interaction
Image Moments (of Inertia) • 1st moment is center Vision-based Interaction
Image Moments (of Inertia) • 2nd moment is orientation Vision-based Interaction
Tools for Vision apps • Intel’s OpenCV • C API to highly optimized image processing functions (threshold, dilate, optical flow, …) • http://www.intel.com/research/mrl/research/opencv • Fast to run! Slow to develop • Great for vision folks; too low-level for app folks • Papier-Mâché • Java API (and to some extent visual UI) for vision (and other physical input) • http://guir.berkeley.edu/papier-mache • Fast to develop! Slow to run • Great for app folks; ~5 fps can sometimes be too slow Vision-based Interaction
Good Vision Books • Computer Vision: A Modern Approach • David Forsyth and Jean Ponce (2003) • Fantastic book; but goal is more theoretical understanding than practical application • Robot Vision • Berthold Horn (1987) • More focused on apps and interactive algorithms • Somewhat out of date Vision-based Interaction
iRoom demo after class… Vision-based Interaction