1 / 9

Question Classification using Latent Focus Words

Question Classification using Latent Focus Words. Presented by: Anand Iyer Ritvik Mudur. Motivation. Motivation. Question Answering systems are slowly becoming more prevalent Web search Yahoo answers Forums Question classification is an integral part of any QA system

scorpio
Download Presentation

Question Classification using Latent Focus Words

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Question Classification using Latent Focus Words Presented by: Anand Iyer Ritvik Mudur

  2. Motivation

  3. Motivation • Question Answering systems are slowly becoming more prevalent • Web search • Yahoo answers • Forums • Question classification is an integral part of any QA system • Classify each question into a pre-defined set of classes • Fine and coarse • Limits the range of data/corpora to be searched • Makes the problem tractable (in many cases)

  4. Question Focus Words (QFWs) • Word (or phrase) indicating the main focus of the question • Excluding the interrogative words • Example • HUM:ind - Who was the first person to reach the North Pole ? • Focus word: person • Commonly found using heuristics • Templates: • ‘What|Whichtype|kind|relativeof’ • Head word of first NP/VP in the question • Can also “learn” the focus word with a set of labeled “gold” QFWs

  5. Latent QFWs • We could also consider the focus words as “latent” or hidden before training • Use a gradient-descent approach to find the focus words that result in the most confident (and accurate) classifications on our training set

  6. Results • Compared to a baseline system that used only heuristics for QFW selection • Used a perceptron classifier • Begin with a single random initialization • Begin with an ensemble of 20 random initializations • Using a voting system in the first iteration

  7. Conclusion • Comparable results • Many more avenues to explore • Different training approaches • Multiple QFWs • Space of latent QFWs • Hybrid approach (heuristics + latent focus words) • Exploits ambiguity • Scales easier • To more specific classes (new heuristics can be tedious to introduce) • To more data (do not require “gold” focus words)

  8. Questions

  9. Acknowledgements THANK YOU Mihai Surdeanu for your guidance and support

More Related