1 / 9

Sentence Unit Detection in Conversational Dialog Speech

Sentence Unit Detection in Conversational Dialog Speech. Add graphic of sound wave file Text transcription of interspersed speaker words without . , ?. Elizabeth Lingg Tejaswi Tennetti Anand Madhavan. Sentence Units. Questions Back Channels Statements. Data Used.

dacia
Download Presentation

Sentence Unit Detection in Conversational Dialog Speech

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Sentence Unit Detection in Conversational Dialog Speech Add graphic of sound wave file Text transcription of interspersed speaker words without . , ? Elizabeth Lingg TejaswiTennetti Anand Madhavan Sentence Units Questions Back Channels Statements

  2. Data Used LDC2009T01: Annotated metadata Fisher data Switchboard corpus POS tags Disfluencies marked

  3. Prediction results Final results of predictions with the best features chosen

  4. Effect of POS tags Many graphs showing 1-pre-gram, 2-pre-gram, 1-post-gram, 2-post-gram and all of them together Vary with cross validation bins used? Vary with many classifiers Above on a per-sentence-unit type basis

  5. Effect of special words for backchannel identification Club words like ‘mhm’, ‘oh yeah’ etc into a separate class and see if it helps in predicting backchannel better Effects on other sentence units

  6. Miscellaneous Previous sentence class prediction (faked as well as true) Length of sentence so far or number of words so far (that have not been classified yet)

  7. Prosodic features F0 F0 normalized Pause duration for speaker Energy Length of word Pause length before word Word pitch range Energy Energy normalized

  8. Prosodic features n-gram prosodic features

  9. References Enriching Speech Recognition With Automatic Detection of Sentence Boundaries and Disfluencies, Yang Liu, Elizabeth Shriberg, Andreas Stolcke, Dustin Hillard, Mari Ostendorf and Mary Harper ...

More Related