450 likes | 524 Views
Explore cutting-edge technology in 3D analysis to enhance images, extract pen strokes, enable character recognition, and create 3D models from 2D images. The system uses innovative techniques for preprocessing, line extraction, and curve matching. Unlock the potential of spatial modeling for various applications.
E N D
3D Handwriting AnalysisA. Razdan, J. Femiani, J. RowePartnership for Research in Spatial Modeling (PRISM) Dr. Anshuman Razdan Director (razdan@asu.edu)
Parsing the OCR Problem • Preprocessing and Image enhancement • Pen Stroke Creation • Character recognition • Word recognition
Image Enhancement • Preprocessing includes enhancing and refining the raw image. • Identifying and extracting blurred, stained, faded, bled through, or transferred characters, etc. • New PRISM method specifically identifies and analyzes linear structures (line strokes). • This technique works in both 3D (CT, MRI) and 2D (images) domains.
Image Refinement • 1D and 2D function models based on the 3 observed shape characteristics have been developed, and enhanced images are derived from their second derivatives. • A two-stage algorithm is developed to extract line and net patterns. Line and net patterns are first enhanced and then extracted by applying threshold value. • Line and net patterns in a noisy environment exist in many imaging technologies • Examples: Roads and rivers in satellite photos, curves in finger prints, blood vessels in CT angiography
Enhancement & Thresholding Original image Enhanced image Line extraction by thresholding
Flat Land: A Romance of Many Dimensions • You have to view the problem in at least one dimension higher than the data to get a sense of it(Flatland: A Romance of Many Dimensions: by Edwin A. Abbott, A Square, circa. 1884) Observer in 2D Land KING of 1D Land woman You are in 3D looking down at 2D space High Priest
Flat Land Conclusion • 1D (line) embed in 2D space (paper surface) • 2D (images) embed in 3D space (like this room) • 3D (objects) embedded in 4D or 5D space …. • Given this argument, using 3D space for understanding 2D images makes sense….
3D Pen Trace Recreation • Concept of raising or embedding 2D image in 3D space a.k.a Flat Land. • Understanding ink flow and information embedded in the pen strokes • Theory of Volume Modeling and Iso-surface Extraction
Chain Codes or Pen Traces • For any character matching/recognition algorithm to work efficiently it needs to unravel the stroking of the pen. • This means figuring out the chain code. Since it is not available in 2D bitmap we do it using 3D.
Pen Stroking • Pressure is applied to via the pen and is different in upstrokes and down strokes and also angle of writing. • There is flow of ink from the pen to the paper. Crossovers result in darker images
How 2D is raised to 3D • A transfer function is applied which converts intensity at each pixel into a height function and also a density function • Results in Volumetric data same as CT or MRI H(i,j) = F(x,y, I(x,y)) D(i,j,k) = I(x,y) Vol Func(x,y,H(i,j)) = D(I(x,y)) 2D Image Transformed into 3D
Marching Cubes • Marching cubes is used for making 3D surfaces from volumetric data such as MRI, CAT scan, etc.
MC: Thresholding • Explanation of how Marching Cubes uses predefined triangulations for each cube to form a whole mesh.
Volume Blurring • Start with Volume Function (V) on raw image (left image) • Apply Marching Cubes on V (middle image) • Create V’ = GnV (Blurring filter applied n times and then MC to create right image). Gn is the secret sauce.
The Problem • Given two curves X1 and X2, one can ask two distinct questions: • Curve matching i.e. • Is X1 = X2 ? • Or one a subset of the other curve • Or how similar are the two curves? • Curve alignment i.e. • What is the rotation and translation required to align one curve with the other?
Conclusions • Novel method to unravel strokes, characters and letterforms in complex handwritten documents. • Segments by Region/Row irrespective of scale, orientation, or position. • Geometry based curve matching technique for character recognition (dictionary generation, text recognition, and translation) • Language independence • Doesn’t need expensive scanning equipment (we paid $24.99). • Can be combined with existing technologies. • Provisional Patent filed in April 2003. Full patent filing spring 2004.
Weaknesses • Requires continuous tone original source (can not address single bit image i.e. FAX). • Can be computationally expensive for certain applications such as forgery but the technology is built to take advantage of parallelization.
Opportunities • Extend concept of volumes to other applications • Forensics (Offline comparisons) • Biometrics (Online authentication – wacom demo) • Forgery detection • Number extraction from noisy background (Currencies) • Opportunities for derivative patents
Gaps • Need to combine power of Stroke extraction and curve matching with traditional HMM and other statistical methods or commercial engines. • Man power/expertise required • AI/Statistics/traditional char recognition expert to create powerful hybrid engine • Language specific expert/paleographer • Requires productization and field testing.
Threats • Competition by 2D solutions and existing technologies. • Lack of awareness of the capabilities of 3D analytical tools in OCR world. • Geometry solution in a world seeped in statistical methods. • Establishing validity of the 2D - 3D conversion algorithm
Two labs on campus 0ne moving to bigger space in BY – downtown Tempe. Additional 8000 sq ft slated for a new project (Decision Theatre) in downtown Tempe. 24 proc SGI, 20+ workstations (Unix, PC and Linux) Four 3D Laser scanners for inanimate objects 3D face scanner (recent acquisition) 2 Rapid Prototyping machines PRISM Infrastructure
Image Refinement • Biomedical Examples: White matter in brain MRI scans, cell spindle fibers, membranes in laser confocal microscopic data. Fungus membrane Brain MRI Scan Mouse egg
Image Refinement • Blood Vessel • 3 characteristics (Chaudhuri et al) • Piecewise linear segments • Cross section as a Gaussian function • Relatively constant width
2D Line Model Blood Vessel (x,y)
2D Case: 2nd Derivatives C: constant, N: noise
Enhancement • Maximal eigenvalue as an enhanced image Enhanced Image
Results Crest lines extraction A synthetic image Matched filters Our method
Distance Between Two Functions Case 1: f and g continuous over [0,1] Case 2: f over [0,1] and g over [0,d], d <= 1 Penalty function
Curve Shape Measures • Shape Measures or Properties • Curvature (planar) • Torsion (space curves) • Total or absolute Curvature (space) • Classical Differential geometry says if the curvatures are identical then so are the curves subject to position and rotation
Curve Matching • Remember • Writing in terms of curvatures • What about partial match? • Or the general case