1 / 12

Voice quality and F0 cues for affect expression

Voice quality and F0 cues for affect expression. By I. Yanushevskaya , C. Gobl and N. Chasaide. Outline. Introduction Synthetic stimuli Experiment setup Result Conclusion. Introduction. F0 cues are crucial for emotional speech What about Voice Quality ? Base on previous works:

sonora
Download Presentation

Voice quality and F0 cues for affect expression

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Voice quality and F0 cues for affect expression By I. Yanushevskaya, C. Gobl and N. Chasaide

  2. Outline • Introduction • Synthetic stimuli • Experiment setup • Result • Conclusion

  3. Introduction • F0 cues are crucial for emotional speech • What about Voice Quality? • Base on previous works: • Adding voice quality cues enhance speech synthesis • Several voice quality stimuli have similar result: • Tense ~= Harsh • Breathy ~= whisper • Varying voice quality can influence listener’s judgment • Want to know the effect of varying voice quality only.

  4. Synthetic stimuli • 15 synthetic stimuli: Jaadjö (Hello Goodbye) • KLSYN88 as formant synthesizer • 3 groups stimuli: “VQ”, “F0”, “VQ+F0”

  5. KLSYN88

  6. VQ only stimuli • Modal, breathy, whispery, lax-creaky, tense stimuli • Omit harsh, creaky included in previous work • Modal: Copy the natural utterance to KLSYN88 • Breathy: lower AV, higher OQ, lower SQ, higher TL, wider B1 • Whispery: Aspiration noise • Lax-creaky: Creaky+Breathy-Whispery • Tense: lower OQ, higher SQ, lower TL, narrower B1 higher F0 • NOT normalized with F0

  7. F0 only stimuli

  8. VQ+F0 stimuli Are these good pairs? We’ll see….

  9. Experiment setup • 20 native speakers • 10 of 15 stimuli presented • Response a pair of opposite affective attribute • sad-happy • Intimate-formal • Relaxed-stressed • Bored-interested • Apologetic-indignant • Fearless-scared • ANOVA

  10. Result

  11. Conclusion • Showed that some voice quality is more related than other in some emotions. • X Intimacy, sadness -> breathy • O -> lax-creaky • Voice quality is averagely better than F0 cues on speech synthesis • Maybe because the voice quality already includes the information of F0

  12. Thanks for your attention

More Related