Classification of Discourse Functions of Affirmative Words in Spoken Dialogue

INTERSPEECH, Antwerp, August 2007 Classification of Discourse Functions of Affirmative Words in Spoken Dialogue Agustín Gravano, Stefan Benus, Julia Hirschberg Shira Mitchell, Ilia Vovsha Spoken Language Processing Group Columbia University

Cue Words • Ambiguous linguistic expressions used for • Making a semantic contribution, or • Conveying a pragmatic function. Examples: now, well, so, alright, and, okay, first, by the way, on the other hand. • Single affirmative cue words • Examples: alright, okay, mm-hm, right, uh-huh, yes. • May be used to convey acknowledgment or agreement, to change topic, to backchannel, etc. Agustín Gravano INTERSPEECH 2007

Research Goals • Learn which features best characterize the different functions of single affirmative cue words. • Determine how these can be identified automatically. • Important in Spoken Dialogue Systems: • Understand user input. • Produce output appropriately. Agustín Gravano INTERSPEECH 2007

Previous Work • Classification of cue words into discourse vs. sentential use. • Hirschberg & Litman ’87, ’93; Litman ’94; Heeman, Byron & Allen ’98; Zufferey & Popescu-Belis ’04. • In our corpus: • right: 15% discourse, 85% sentential. • All other affirmative cue words: 99% disc., 1% sent. • Discourse vs. sentential distinction insufficient. Need to define new classification tasks. Agustín Gravano INTERSPEECH 2007

Talk Overview • Columbia Games Corpus • Classification tasks • Experimental features • Results Agustín Gravano INTERSPEECH 2007

The Columbia Games Corpus • 12 spontaneous task-oriented dyadic conversations in Standard American English. • 2 subjects playing computer games; no eye contact. Agustín Gravano INTERSPEECH 2007

The Columbia Games CorpusFunction of Affirmative Cue Words Cue Words • alright • gotcha • huh • mm-hm • okay • right • uh-huh • yeah • yep • yes • yup Functions • Acknowledgment / Agreement • Backchannel • Cue beginning discourse segment • Cue ending discourse segment • Check with the interlocutor • Stall / Filler • Back from a task • Literal modifier • Pivot beginning: Ack/Agree + Cue begin • Pivot ending: Ack/Agree + Cue end 7.9% of the words in our corpus Agustín Gravano INTERSPEECH 2007

The Columbia Games CorpusFunction of Affirmative Cue Words • Literal Modifier that’s pretty much okay • Backchannel Speaker 1: between the yellow mermaid and the whaleSpeaker 2:okaySpeaker 1: and it is • Cue beginning discourse segment okay we gonna be placing the blue moon Agustín Gravano INTERSPEECH 2007

The Columbia Games CorpusFunction of Affirmative Cue Words • 3 trained labelers • Inter-labeler agreement: • Fleiss’ Kappa = 0.69 (Fleiss ’71) • In this study we use the majority label for each affirmative cue word. • Majority label: label chosen by at least two of the three labelers. Agustín Gravano INTERSPEECH 2007

MethodTwo new classification tasks • Identification of a discourse segment boundary function • Segment beginning vs. Segment end vs. No discourse segment boundary function • Identification of an acknowledgment function • Acknowledgment vs. No acknowledgment Agustín Gravano INTERSPEECH 2007

MethodMachine Learning Experiments • ML Algorithm • JRip: Weka’s implementation of the propositional rule learner Ripper (Cohen ’95). • We also tried J4.8, Weka’s implementation of the decision tree learner C4.5 (Quinlan ’93, ’96), with similar results. • 10-fold cross validation in all experiments. Agustín Gravano INTERSPEECH 2007

MethodExperimental features • IPU (Inter-pausal unit) • Maximal sequence of words delimited by pause > 50ms. • Conversational Turn • Maximal sequence of IPUs by the same speaker, with no contribution from the other speaker. Agustín Gravano INTERSPEECH 2007

MethodExperimental features • Text-based features • Extracted from the text transcriptions. • Lexical id; POS tags; position of word in IPU / turn; etc. • Timing features • Extracted from the time alignment of the transcriptions. • Word / IPU / turn duration; amount of overlap; etc. • Acoustic features • {min, mean, max, stdev} x {pitch, intensity} • Slope of pitch, stylized pitch, and intensity, over the whole word, and over its last 100, 200, 300ms. • Acoustic features from the end of the other speaker’s previous turn. Agustín Gravano INTERSPEECH 2007

ResultsDiscourse segment boundary function (1) Majority class baseline: NO BOUNDARY. (2) Calculated wrt each labeler’s agreement with the majority labels. Agustín Gravano INTERSPEECH 2007

ResultsAcknowledgment function (1) Baseline based on lexical identity: {huh, right } no ACK all other words ACK (2) Calculated wrt each labeler’s agreement with the majority labels. Agustín Gravano INTERSPEECH 2007

Best-performing features Agustín Gravano INTERSPEECH 2007

ResultsClassification of individual words • Classification of each individual word into its most common functions. • alright Ack/Agree, Cue Begin, Other • mm-hm  Ack/Agree, Backchannel • okay  Ack/Agree, Backchannel, Cue Begin, Ack+CueBegin, Ack+CueEnd, Other • right  Ack/Agree, Check, Literal Modifier • yeah  Ack/Agree, Backchannel Agustín Gravano INTERSPEECH 2007

ResultsClassification of the word ‘okay’ (1) Majority class baseline: ACK/AGREE. (2) Calculated wrt each labeler’s agreement with the majority labels. Agustín Gravano INTERSPEECH 2007

Summary • Discourse/sentential distinction is insufficient for affirmative cue words in spoken dialogue. • Two new classification tasks: • Detection of an acknowledgment function. • Detection of a discourse boundary function. • Best performing ML models: • Based on textual and timing features. • Slight improvement when using acoustic features. Agustín Gravano INTERSPEECH 2007

Further Work • Gravano et al, 2007 On the role of context and prosody in the interpretation of ‘okay’.ACL 2007, Prague, Czech Republic, June 2007. • Benus et al, 2007 The prosody of backchannels in American English. ICPhS 2007, Saarbrücken, Germany, August 2007. Agustín Gravano INTERSPEECH 2007

INTERSPEECH, Antwerp, August 2007 Classification of Discourse Functions of Affirmative Words in Spoken Dialogue Agustín Gravano, Stefan Benus, Julia Hirschberg Shira Mitchell, Ilia Vovsha Spoken Language Processing Group Columbia University

Agustín Gravano INTERSPEECH 2007

Classification of Discourse Functions of Affirmative Words in Spoken Dialogue

Classification of Discourse Functions of Affirmative Words in Spoken Dialogue

Presentation Transcript

Spoken Dialogue Systems

Applications of Discourse Structure for Spoken Dialogue Systems

Overview of Issues in Discourse and Dialogue

Theories of Discourse and Dialogue

Classification of Functions

Spoken Discourse

Spoken Dialogue Systems

The language of Spoken Discourse:

Discourse Functions

Discourse and Dialogue Processing in Spoken Intelligent Tutoring Systems

Spoken Dialogue Technology

Discourse Annotation for Improving Spoken Dialogue Systems

Spoken Dialogue Systems

Discourse Annotation for Improving Spoken Dialogue Systems

Spoken Dialogue Systems

Components of Spoken Dialogue Systems

Spoken Dialogue Systems

Spoken Dialogue Systems

Spoken Dialogue Systems

Spoken Dialogue Systems