580 likes | 830 Views
Shape-based Object Recognition. Thomas B. Sebastian. Brown University. Summary. Shape recognition is important but challenging task Three main contributions to computer vision: Robust shape recognition algorithm for matching shock graphs Relies on edit distance between shock graphs
E N D
Shape-based Object Recognition Thomas B. Sebastian Brown University
Summary • Shape recognition is important but challenging task • Three main contributions to computer vision: • Robust shape recognition algorithm for matching shock graphs • Relies on edit distance between shock graphs • Gives excellent recognition rates • Useful in indexing large shape databases • Generic curve matching algorithm with several applications • Effective segmentation method, which is useful in several medical applications
Contents • Introduction • Shape recognition using shock graphs • Edit-distance algorithm • Edit-costs using curve matching • Results • Indexing into large shape databases • Segmentation using skeletally coupled deformable model • Additional applications of curve matching
Introduction • Recognition is a challenging problem because it needs to integrate multiple cues like color, texture and shape • Shape is an important cue • However, shape is least studied, and is difficult to extract, characterize and represent
Example 1: View-point variations Robustness to transformations Recognition algorithms have to overcome several visual transformation challenges
Robustness to transformations (cont.) Example 2: Partial occlusion
Robustness to transformations (cont.) Example 3: Articulation
Contents • Introduction • Shape recognition using shock graphs • Edit-distance algorithm • Edit-costs using curve matching • Results • Indexing into large shape databases • Segmentation using skeletally coupled deformable model • Additional applications of curve matching
Real Example Shock graph representation of shapes • Shocks (or medial axis or skeleton) are locus of centers of maximal circles that are bitangent to shape boundary Shape boundary Shocks
Distance between shapes • Distance between shapes is important for recognition • Shape space is collection of all shapes • Shape is a point • Shape deformation sequence is a path • Cost of optimal deformation sequence is distance from A to B
Deformation paths • Number of deformation paths between two shapes is infinite • Finding the optimal deformation path is intractable • Three measures are employed in our edit-distance algorithm to make the problem tractable • Avoidance of complexity-increasing deformation paths • Partitioning of shape space using shock graph topology • Discretization of deformation paths
Avoidance of complexity-increasing paths • Complexity-increasing shape deformation paths are not optimal • Represent a deformation path by a pair of simplifying deformation paths from A, B to a simpler shape C
Changes in shock graph topology • Shock graph topology is unaltered for most shape deformations • At transition shapes small changes lead to abrupt changes in the shock graph topology
Partitioning of shape space • Shape cell: Collection of shapes with same shock graph topology • Transition shapes form the boundary between shape cells Shape cell 2 Shape cell 1
Shape deformation bundle:Collection of deformation paths passing through same set of shape cells • Represented by set of transition shapes it passes through Discretization of deformation paths
Deformation sequence as graph edits • Shock graph edit operation transforms a shape to adjacent transition shape
Edit-distance algorithm • Optimal deformation sequence is found using adynamic-programmingbasedpolynomial-time edit-distance algorithm [SODA 99, 01] • Proposed for comparing character strings [Wagner, Fischer] • Extended to matching rooted trees [Tai, Zhang, Shasha] • Employed in traditional edit-distance operations (i) delete, (ii) re-label
Splice: deletes a shock branch and merges the remaining two • Contract:deletes a shock branch between degree-three nodes • Merge:combines two branches at a degree-two node Edit-distance for shock graphs • Four edit operations are needed for shock graphs • Deform:changes the attributes of a shock branch
Contents • Introduction • Shape recognition using shock graphs • Edit-distance algorithm • Edit-costs using curve matching • Results • Indexing into large shape databases • Segmentation using skeletally coupled deformable model • Additional applications of curve matching
Assigning costs to edit operations • Edit costs must be • Consistent with each other • Consistent with our perceptual metrics of similarity • Our approach is twofold • Define the cost of deform edit • Define cost of other edits as limits of deform cost
Deform cost as joint-curve matching • Deform cost relies on extending a metric developed to compare curves [MMBIA00, IWVF01] • Each shock edge corresponds to two boundary segments • Matching shock edges can be viewed as joint-curve matching
Curve matching • Goal is to find • Optimal alignment (pairing of points in C, C’) • Distance (cost to deform C to C’)
Cost of deforming curves • Assumption: Cost of matching curves is equal to the sum of costs of matching infinitesimal sub-segments • Cost of infinitesimal sub-segments is measured in terms of length and curvaturedifferences
Representation of alignment • Alignment is a pairing of points of C, C’ • Treat curves C and C’ as the x and y-axes • Alignment is then a curve in 2D space, called alignment curve
Template consisting of nine incoming curves (at different slopes) is used to update the cost • Optimal deformation path is found by back-tracking Finding optimal alignment curve • Use dynamic programming to find optimal alignment curve • Finding optimal alignment curve corresponds to a shortest-path problem
Deform cost between shock edges • Deform cost between shock edges consists of • Length differences of the boundary segments • Curvature differences of boundary segments • Difference in width of shape • Difference in relative orientation of boundary segments
Edit-distance between shapes • Edit-distance is defined as the sum of the cost of edits in optimal edit sequence
Contents • Introduction • Shape recognition using shock graphs • Edit-distance algorithm • Edit-costs using curve matching • Results • Indexing into large shape databases • Segmentation using skeletally coupled deformable model • Additional applications of curve matching
Matching results* Edit-distance algorithm gives intuitive results * Same colors indicate matching edges; gray-colored edges are pruned
Boundary noise • Shock graphs are sensitive to boundary noise In optimal edit sequence “noisy” branches are pruned
Articulation • Shock graphs represents object parts and part hierarchy Edit-distance is robust in presence of part-based changes
Viewpoint variation Smooth change • Deform edit handles smooth changes • Splice and contract edits handle abrupt changes Abrupt change
Partial occlusion Edit-distance is robust to partial occlusion
Indexing into shape databases* Edit-distance algorithm allows 100% shape recognition between different shape categories * Results duplicated in two databases: 99 shapes and 216 shapes
Indexing Results It also allows nearly 100% recognition between shapes in the same shape category
Contents • Introduction • Shape recognition using shock graphs • Edit-distance algorithm • Edit-costs using curve matching • Results • Indexing into large shape databases • Segmentation using skeletally coupled deformable model • Additional applications of curve matching
Solutions • Develop a coarse-scale version of match • Use exemplars to represent shape categories Challenges • Shock graph matching is computationally expensive • Each match takes 2-3 minutes • Querying 1000 shape database takes >2000 minutes (35 hours)
Compute edit costs using nodes in shock graph • Represent each shock link by its end points • Significant speed-up in computation time • Coarse-scale matching 2-3 seconds • Shock graph edit distance 2-3 minutes Coarse-scale shock graph matching
Coarse-scale matching results Top matches contain most of shapes from correct category
Coarse vs. fine matching results • Two-stage indexing strategy • Stage 1: Coarse-scale matching to eliminate distant shapes • Stage 2: Fine-scale matching to refine the top matches Coarse Fine Coarse - Top 158 matches have 54 fish shapes Fine - Top 61 matches have 54 fish shapes
Exemplars for shape categories • Comparing query with all shapes is unnecessary • If query is “distant” from a fish, it will be distant from all fish • In context of all shapes fish will be tightly clustered Each category is represented by a few exemplars
Exemplar-based indexing results • Indexing results are excellent using a large database (1032 shapes, 200 exemplars) • Correct category is selected in top 1, 2 and 5 matches with 78%, 91%, and 99% success rate respectively
Number of exemplars • Using fewer exemplars suggest a hierarchical representation • 2-3 primary exemplars rule out 75% of categories • Additional 2-3 auxiliary exemplars rule out another 50% of categories
Contents • Introduction • Shape recognition using shock graphs • Edit-distance algorithm • Edit-costs using curve matching • Results • Indexing into large shape databases • Segmentation using skeletally coupled deformable model • Additional applications of curve matching
Motivation • Traditional deformable models have convergence problems
Skeletally coupled deformable model (SCDM) • The underlying principle of skeletally coupled deformable model is use of skeleton between growing seeds to modulate their growth • Growth of seeds is governed by forces at skeletal point and competing boundary
Convergence of SCDM • SCDM solves the convergence problem of traditional deformable models
Carpal Bone Segmentation Results Results are clinically meaningful
3D Model of Carpal Bones 3D model is useful in studying carpal kinematics, creating computational atlases etc.
Contents • Introduction • Shape recognition using shock graphs • Edit-distance algorithm • Edit-costs using curve matching • Results • Indexing into large shape databases • Segmentation using skeletally coupled deformable model • Additional applications of curve matching