Machine Learning: Foundations Course TAU – 2012A Prof. Yishay Mansour

Machine Learning: Foundations CourseTAU – 2012AProf. YishayMansour • TextonBoost:Joint Appearance, Shape and Context Modeling for Multi-Class Object Recognition and Segmentation J. Shotton*, J. Winn†, C. Rother†, and A. Criminisi† • * University of Cambridge • † Microsoft Research Ltd, Cambridge, UK Yaniv Bar March 2013

Goal • Simultaneous recognition and segmentation: • Efficiently detect a large number of object classes and give a pixel-perfect segmentation of an image into these classes.

Data and Classes • Original Paper: 3 DBs. Main DB: MSRC 21. • MSRC 21-Class Object Recognition Database • 591 hand-labelled images • Original main DB was updated to MSRC 23. • MSRC 23-Class Object Recognition Database • 592 hand-labelled images

High Level Approach • High-level description of approach: Learn classifier based on relative texture locations for each class. Classification is then refined. Given an image, for each pixel: - Texture-Layout features are calculated - A boosting classifier gives the probability of the pixel belonging to each class - The discriminative model combines the boosting output with low-level color, location, and edge information; image receives final label.

Texture layout Features • Most important part of the model is the Shape/Context Potential – it is significant for object recognition and very rough segmentation results. • Other potential such as Edge and Color refine the segmentation results. (a) Original image, (b) Shape, (c) (b)+edge, (d) (c)+color

For modeling object shape, appearance and context we use a New texton-based features. This feature (texton) compact and efficient characterisation of local texture.

What Are Textons • The task is to recognize surfaces made from different materials on the basis of their texture appearance. • Different materials show different texture appearance. • Moreover, texture appearance of the same material changes dramatically due to different viewpoint/lighting settings (specularities, shadows, and occlusions).

Input image Calculating Texture-Layout features • Computing texton maps: Clustering  • Responses are clustered with K-means Texton map Colours  Texton Indices Filter Bank • Each pixel is assigned a texton number • Convolve 17-D filter bank (composed of gaussians, dogs, logs) with all training images

Capturing appearance:

How Texture-Layout features jointly model texture and layout:

Learning • Learning is done with Joint Boost algorithm – A version of Multi class gentle boost algorithm. • I’ve used both AdaBoost.M1 and AdaBoost.Mh (multiclass reduction to binary which is due to the fact that AdaBoosting is only for binary classification).

The Good and Bad • The Good: Provides reasonable recognition + segmentation for many classes. Also, combines several good ideas. Most of previous works didn’t tackle the problem as a whole – rather, problems were treated separately. • The Bad: Does not beat past work (in terms of quantitative recognition results) and a bit hacky.

Code-Sequence of execution 1. imagesTextonization.m (extract efficient images characterization) 2. calcModelFeatures.m (calculate the appearance (shape) potential context) 3. trainModel.m (build a classification model) 4. testModel.m (test the classification model with test data)

Results

Machine Learning: Foundations Course TAU – 2012A Prof. Yishay Mansour

Machine Learning: Foundations Course TAU – 2012A Prof. Yishay Mansour

Presentation Transcript

Machine Learning Part 2: Intermediate and Active Sampling Methods

Foundations of Computer Science from Data Manipulation to Theory of Computation

First Foundations in Pathology, Part 2: Acute and Chronic Inflammation

Machine Learning Methods for Human-Computer Interaction

Chapter 1: Foundations: Logic and Proofs

EMERGING SYSTEMS FOR LARGE-SCALE MACHINE LEARNING

A New Parallel Framework for Machine Learning

Muscle Contraction

Performance Evaluation of Machine Learning Algorithms

Supervised Learning

Tamara Berg Machine Learning

Machine Learning for High-Throughput Biological Data

Beyond Convexity – Submodularity in Machine Learning

Lecture 5: Turning Machine

Chapter 1. The Foundations: Logic and Proofs

Machine Learning

Graphical Models in Machine Learning

First Foundations: Essentials of General Human Pathology

Machine Learning

Foundations VI: Foundations VI: Discovery, Access and Semantic Integration