210 likes | 340 Views
This research explores a novel approach to multi-class text categorization, addressing the challenges of classifying documents with unobserved labels. Utilizing a Constrained Conditional Model (CCM), we integrate hidden aspect variables to introduce constraints, enhancing the predictive power for previously unseen classes. By defining a structured labeling space, we demonstrate how aspects influence classification and present experimental results showing improved accuracy. This work aims to provide insights into better handling complex categorization tasks in various domains, including sports, health, and business.
E N D
Aspect Guided Text Categorization with Unobserved Labels Dan Roth, YuanchengTu University of Illinois at Urbana-Champaign
Text Categorization Sports Health Business … Science C1 C2 C3 … CN • An archetypical Multi-Class Classification (MCC) problem • F : X → Y • a document, d ∈ X , a collection of classes Y = {c1, c2, . . . , cN}
Motivation: what are we missing? C1 C2 C3 … CN Sports Health Business … Science Class labels (Y) contain information which can help classification How can we explore the label space?
Aspect Variables Show me where I can eat nearby X Find nearest restaurant findnearestrestaurant Null Null Y Manner z5 z3 z4 z1 z2 Action Detail Modifier Topic
Significance of the Aspect Variables If Topic = “restaurant”, then Action ≠ “turn” Observed Label 1. turn on the radio 2. GPS navigation Unobserved Label turn on GPS • Predicting better aspects implies predicting better class labels • Adding constraints to the aspect space • Predicting previously unobserved labels
Outline • Car Command Text Categorization Task • Data and aspects • Unobserved labels • Constrained Conditional Model (CCM) • Aspects variables to introduce constraints • Objective function • Training and Inference • Experimental Results • Standard multiclass classification setting • Predicting Unobserved Labels • Conclusion
Adding Constraints by Hidden Aspects x3 x4 x1 x5 x2 x7 x6 y1 Y Intuition: introduce structure on hidden variables X
Adding Constraints Through Hidden Aspects Z2 Z1 Z3 Z4 x3 x4 x1 x5 x2 x7 x6 y1 Y Z5 Use constraints to capture the dependencies X
Penalty for violating the constraint. Weight Vector for “local” learners How far away is y from a “legal” assignment Aspect functions Objective Function of CCM
Training and Inference • Learning + Inference (L+I) • Ignore constraints during training • Inference Based Training (IBT) • Consider constraints during training • References to CCM (aka ILP formulation) • Roth&Yih04, Has been shown useful in the context of many NLP problems: SRL, Summarization; Co-reference; Information Extraction; Transliteration • 07; Punyakanok et.al 05,08; Chang et.al 07,08; Clarke&Lapata06,07; Denise&Baldrige07;Goldwasser&Roth'08; Martin,Smith&Xing'09
Z2 Z3 Z1 Z4 Z5 x3 x4 x1 x5 x2 x7 x6 Learning + Inference Learning + Inference (L+I) Learn models independently f1(x) f2(x) f3(x) Y f4(x) f5(x) X
True Global Labeling Y -1 1 -1 -1 1 Local Predictions Apply Constraints: Y’ -1 1 -1 1 1 x3 x4 x1 x5 x2 x7 x6 Y’ -1 1 1 1 1 Inference Based Training Example: Perceptron-based Global Learning f1(x) X f2(x) f3(x) Y f4(x) f5(x)
Outline • Car Command Text Categorization Task • Data and aspects • Unobserved labels • Constrained Conditional Model (CCM) • Aspects variables to introduce constraints • Objective function • Training and Inference • Experimental Results • Standard multiclass classification setting • Predicting Unobserved Labels • Conclusion
Evaluation Metrics • Standard Accuracy • The percentage of correctly labeled examples • Weighted Aspect-based Metric (WAM) • A weighted Hamming distance computed at the aspect level
Experiments and Evaluation • Standard MCC Setting
Experiments and Evaluation • Standard MCC Setting
Experiments and Evaluation • Predicting Unobserved Labels Observed Label 1. turn on the radio 2. GPS navigation Unobserved Label turn on GPS
Conclusion • Summary • Text Categorization with a meaningful, structured label space • A model that exploits the structure by adding hidden aspect variables • Adding constraints and reformulating the task as a structure prediction problem • Predicting unobserved new labels
Thank You! AND Questions?