Richer Human-Machine Communication in Attributes-based Visual Recognition

Richer Human-Machine Communication in Attributes-based Visual Recognition Devi Parikh TTIC

Traditional Recognition Dog Chimpanzee Tiger ???

Attributes-based Recognition Furry White Black Big Stripped Yellow Stripped Black White Big Dog Chimpanzee Tiger

Applications Zero-shot learning Attributes provide a mode of communication between humans and machines! A Zebra is… White Black Stripped Zebra Image description Stripped Black White Big

Agenda Enriching the mode of communication • Nameable and Discriminative Attributes (to appear CVPR 2011) • Relative Attributes (under review) Kristen Grauman

Attributes Attributes are most useful if they are • Discriminative • Nameable

Interactive system 1. Name: Fluffy 2. Name: x 3. Name: Metal … How do we show the user a candidate-attribute? How do we ensure proposals are discriminative? How do we ensure proposals are nameable?

Attribute visualization

Attribute Visualization

Ensure Discriminability Normalized cuts Max Margin Clustering

Ensure Nameability 1. Name: Fluffy 2. Name: x 3. Name: Metal …

Ensure Nameability 1. Name: Fluffy 2. Name: x 3. Name: Metal … Mixture of Probabilistic PCA

Interactive System

Evaluation • Outdoor Scenes • Animals with Attributes • Public Figures Face • Gist and Color features (LDA)

Interactive System

Evaluation • Annotate all candidates off-line “Black” … ~25000 responses

Evaluation • Annotate all candidates off-line … ~25000 responses “Spotted”

Evaluation • Annotate all candidates off-line … ~25000 responses Unnameable

Evaluation • Annotate all candidates off-line … ~25000 responses “Green”

Evaluation • Annotate all candidates off-line … ~25000 responses “Congested”

Evaluation • Annotate all candidates off-line … ~25000 responses “Smiling”

Results Structure exists in nameability space allowing for prediction Our active approach discovers more discriminative splits than baselines

Results Comparing to discriminative-only baseline

Results Comparing to descriptive-only baseline

Results Automatically generated descriptions

Summary • Machines need to understand us • Attributes need to be detectable & discriminative • We need to understand machines • Attributes need to be nameable • Interactive system for discovering attributes • Relative Attributes • More precise communication • Helps machines (zero-shot learning) • Helps humans (image descriptions)

Relative Attributes

Summary • Machines need to understand us • Attributes need to be detectable & discriminative • We need to understand machines • Attributes need to be nameable • Interactive system for discovering attributes • Relative Attributes • More precise communication • Helps machines (zero-shot learning) • Helps humans (image descriptions)

Human-Debugging Larry Zitnick (CVPR 2008, 2010, 2011, under review, in progress)

Thank you.

Richer Human-Machine Communication in Attributes-based Visual Recognition

Richer Human-Machine Communication in Attributes-based Visual Recognition

Presentation Transcript

Visual Word Recognition

6.S093 Visual Recognition through Machine Learning Competition

Visual Word Recognition

Visual Pattern Recognition

Enhancing Human-Machine Communication via Visual Attributes

Visual Communication

Visual Object Recognition

Human-Computer Vision-based Communication

Visual Object Recognition

6.S093 Visual Recognition through Machine Learning Competition

Visual Object Recognition

6.S093: Visual Recognition through Machine Learning Competition

Visual communication

Machine recognition of human activities : a survey

On Visual Recognition

Graduate Attributes: Communication

Visual Object Recognition

Visual Recognition

The Visual Recognition Machine

Visual Communication

Human Emotion Recognition using Machine Learning