Petacat : Applying ideas from Copycat to image understanding

Petacat: Applying ideas from Copycat to image understanding

How Streetscenes Works(Bileschi, 2006) 1. Densely tile the image with windows of different sizes. 2. HMAX C2 features are computed in each window. 3. The features in each window are given as input to each of five trained support vector machines (“pedestrian”, “car”, “bicycle”, “building”, “tree”) 4. If any return a classification with score above a learned threshold, that object is said to be “detected” . …

Object detection (here, “car”) with HMAX model (Bileschi, 2006)

Limitations of Streetscenes approach for “image understanding”

Limitations of Streetscenes approach for “image understanding” • Exhaustive search – not scalable • Does not recognize spatial and abstract relationships among objects for whole scene understanding • Has no prior knowledge about object categories and their place in “conceptual space” • HMAX model is completely feed-forward; no feedback to allow context to aid in scene understanding. • Where should feedback come in?

Representation of High-Level Knowledge: A Simple Semantic Network (or “Ontology”) “Dog walking” Person Dog leash holds attached to action action walking

But...

Modified Ontology “Dog walking” Person Dog leash holds attached to Dog Group action action running walking

Modified Ontology “Dog walking” Person Dog leash holds attached to Dog Group action action Allowing “conceptual slippage” running walking

But...

Modified Ontology “Dog walking” holds attached to leash Dog Group Person Dog action Cat action walking running Iguana

But...

Modified Ontology “Dog walking” Person Dog leash Helicopter Bicycle Car holds attached to Dog Group action action Cat running Iguana walking

But...

Lawn mower Attached to Helicopter Gasoline Fanny pack Sidewalk Beach Stick Inside Runway Sky Leash Army Grass Airplane Ground Outside Dog Person Dog walking Dog grooming Holding Tree Backpack Standing Close to Above Walking Car Running Track Left of Far from

Need dynamical process of constructing representation.

Need dynamical process of constructing representation. Information gained during the unfolding of perception feeds back to guide the directions the perceptual process takes.

Need dynamical process of constructing representation. Information gained during the unfolding of perception feeds back to guide the directions the perceptual process takes. • Ongoing perception of “context” brings in appropriate concepts and conceptual slippages, and avoids exhaustive search

Need dynamical process of constructing representation. Information gained during the unfolding of perception feeds back to guide the directions the perceptual process takes. • Ongoing perception of “context” brings in appropriate concepts and conceptual slippages, and avoids exhaustive search • Prior, higher-level knowledge interacts with lower-level vision in both directions (bottom-up and top-down).

Need dynamical process of constructing representation. Information gained during the unfolding of perception feeds back to guide the directions the perceptual process takes. • Ongoing perception of “context” brings in appropriate concepts and conceptual slippages, and avoids exhaustive search • Prior, higher-level knowledge interacts with lower-level vision in both directions (bottom-up and top-down). • Concepts are “fluid”, allowed to “slip” in certain contexts.

Need dynamical process of constructing representation. Information gained during the unfolding of perception feeds back to guide the directions the perceptual process takes. • Ongoing perception of “context” brings in appropriate concepts and conceptual slippages, and avoids exhaustive search • Prior, higher-level knowledge interacts with lower-level vision in both directions (bottom-up and top-down). • Concepts are “fluid”, allowed to “slip” in certain contexts. • This allows perception of essential similarity in the face of superficial differences—i.e., analogy-making.

Active Symbol Architecture(Hofstadter et al., 1995)

Active Symbol Architecture(Hofstadter et al., 1995) • Basis for • Copycat (analogy-making), Hofstadter & Mitchell • Tabletop (anlaogy-making), Hofstadter & French • Metacat(analogy-making and self-awareness), Hofstadter & Marshall and many others…

Semantic network Active Symbol Architecture(Hofstadter et al., 1995) Workspace Temperature Perceptual agents (codelets)

Petacat:(Descendant of Copycat)Integration of Active Symbol Architecture and HMAX Initial task: Decide if image is an instance of “taking a dog for a walk”, and if so, how good an instance it is.

Semantic Network indoors taking a dog for a walk has location Object outdoors has component has component has component Action grass is in front of a sidewalk beach a person Spatial Relation dog is on is on is touching leash is touching is on road a has action is behind is next to has action horse cat has component trail belt walks walks rope is in front of runs has location is touching flies string drives stands swims sits

Semantic Network indoors Property links taking a dog for a walk has location Object outdoors Slip links has component has component has component Action grass is in front of a sidewalk beach a person Spatial Relation dog is on is on is touching leash is touching is on road a has action is next to is behind has action horse cat has component trail belt walks walks rope is in front of runs has location is touching flies string drives stands swims sits

Semantic Network indoors Property links taking a dog for a walk has location Object outdoors Slip links has component has component has component Action grass is in front of a sidewalk beach a person Spatial Relation dog is on is on is touching leash is touching is on road a has action is next to is behind has action horse cat has component trail belt walks walks rope is in front of runs has location is touching flies string drives stands swims sits Properties of nodes

Workspace

Semantic network Workspace

Semantic network Perceptual Agents (Codelets) Codelets as active symbols

indoors taking a dog for a walk has location Object outdoors has component has component has component Action is in front of grass a beach a person is on dog is on is touching Spatial Relation leash is touching is on road a sidewalk has action is behind is next to has action horse cat has component trail belt walks walks rope is in front of runs has location is touching flies string drives stands swims sits

indoors taking a dog for a walk has location Object outdoors has component has component has component Action is in front of grass a beach a is on person dog is on is touching Spatial Relation leash is touching is on road a sidewalk has action is behind is next to has action horse has component cat trail belt walks walks rope is in front of runs has location is touching flies string drives stands swims sits

indoors taking a dog for a walk has location outdoors Object has component has component has component Action is in front of grass a beach a is on person dog is on Spatial Relation is touching is on sidewalk leash is touching has action road a is behind is next to has component has action horse cat trail belt walks walks rope is in front of has location is touching runs flies string drives stands swims sits

indoors taking a dog for a walk has location Object outdoors has component has component has component Action is in front of grass a beach a is on person dog is on is touching Spatial Relation leash is touching is on road a sidewalk has action is behind is next to has action horse cat has component trail belt walks walks rope is in front of runs has location is touching flies string drives stands swims sits

Illustration of what we plan to have happen – not a real run of Petacat Dog?

Illustration of what we plan to have happen – not a real run of Petacat Person? Dog? Dog?

Illustration of what we plan to have happen – not a real run of Petacat Person? Dog? Dog? Sidewalk?

Illustration of what we plan to have happen – not a real run of Petacat Outdoors? Person? Dog? Dog? Dog? Sidewalk?

Illustration of what we plan to have happen – not a real run of Petacat Outdoors? Person? Dog? Dog? Dog? Sidewalk? Scout codelets: Send C1 features in window to corresponding SVM. If positive result, post builder codeletwith urgency equal to SVM’s confidence.

Illustration of what we plan to have happen – not a real run of Petacat Outdoors? positive: 0.7 Person? negative Dog? negative Dog? negative Dog? positive: 0.8 Sidewalk? positive: 0.4 Scout codelets: Send C1 features in window to corresponding SVM. If positive result, post builder codeletwith urgency equal to SVM’s confidence.

Illustration of what we plan to have happen – not a real run of Petacat Outdoors? positive: 0.7 Person? negative Dog? negative Dog? negative Dog? positive: 0.8 Sidewalk? positive: 0.4 Builder codelets: Ask HMAX to compute C2 features using prototypes specific to the object (or scene), and send them to corresponding SVM. If positive, decide to build structure with probability equal to SVM confidence. Break competing structures if necessary.

Illustration of what we plan to have happen – not a real run of Petacat Outdoors Dog Builder codelets: Ask HMAX to compute object-/scene-specific C2 features, and send them to corresponding SVM. If positive, decide to build structure with probability equal to SVM confidence. Break competing structures if necessary.

indoors taking a dog for a walk has location Object outdoors has component has component has component Action is in front of grass a beach a is on person dog is on is touching Spatial Relation leash is touching is on road a sidewalk has action is behind is next to has action horse cat has component trail belt walks walks rope is in front of runs has location is touching flies string drives stands swims sits

Petacat : Applying ideas from Copycat to image understanding

Petacat : Applying ideas from Copycat to image understanding

Presentation Transcript

PowerPoint Lesson 1 PowerPoint Basics

PowerPoint tutorial MENU (click to choose menu item)

Powerpoint Tips

Applying for a job

PowerPoint

Understanding Networks can help build the CBR Africa Network

2012 PowerPoint Template Version 2.0

PowerPoint

Today’s topic is.. Image Understanding

Level 2 PowerPoint Training

eLearning Presentation

PowerPoint Tutorial 2 Applying and Modifying Text and Graphic Objects

畵像認識 / 理解 image recognition / understanding

PowerPoint: Presentation Tips

Creating A PowerPoint Presentation

PowerPoint Presentation

Fine Tune Image

From Ideas to Implementation

Intermediate PowerPoint

How to Create a PowerPoint Presentation

Chapter 1 Lecture Outline See PowerPoint Image Slides for all figures and tables pre-inserted into