1 / 18

XV Pacific Voice Conference PVSF-PIXAR 3-11-05

urbano
Download Presentation

XV Pacific Voice Conference PVSF-PIXAR 3-11-05

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Physical modeling of speech and voice quality XV Pacific Voice ConferencePVSF-PIXAR3-11-05 Brad Story Dept. of Speech, Language and Hearing Sciences University of Arizona Tucson, AZ

  2. Physical model of human sound production: A mathematical representation of the physical processes that produce the sounds of speech and song.

  3. Filter: Air spaces created by the trachea, pharynx, oral cavity, nasal passages Trachea Primary Source: Vibration of the vocal folds creates a time-varying airflow (glottal flow). Sound Production = Combination of Sound Sources and Filters Nasal tract

  4. Source Model: vocal fold vibration, glottal area, glottal flow, etc. F0, intensity, breathy/pressed, registers, biphonation, etc. Filter Model: acoustic wave propagation through the air spaces of the trachea, vocal and nasal tracts Formant frequencies (F1, F2, F3, …) Time variation of vocal tract shape Time variation of glottal parameters speech/song/other

  5. Sagittal view Tubular approximation vocal folds output pressure Source: glottal area/flow Low F0 Med F0 High F0

  6. Output sound pressure F3 Spectrogram F2 F1 F0 contour F0

  7. Change the vocal tract shape to produce speech source filter output

  8. Output sound pressure F3 Spectrogram F2 F1 F0 contour F0

  9. F2 F1

  10. Modeling Voice Quality What is voice quality? “Voice quality is conceived here in a broad sense, as the characteristic auditory coloring of an individual speaker’s voice, and not in the more narrow sense of the quality deriving solely from laryngeal activity” From “The Phonetic Description of Voice Quality” by J. Laver (1980)

  11. Auditory coloring of an individual speaker’s voice results from: Laryngeal structure & settings Vocal Tract Structure & settings Respiratory system structure & settings Temporal control

  12. Voice Quality Changes based on vocal tract modifications • Longitudinal: • Modification of vocal tract length • 2. Latitudinal: • Tendencies to maintain a particular constrictive (or expansive) effect on the vocal tract shape.

  13. Modification of vocal tract length • (+ temporal modification) L= 17.5 cm

  14. L= 23 cm L= 11 cm 60% 80% 40% 20% • Modification of vocal tract length • (+ temporal and F0 modification) L= 17.5 cm

  15. 2. Tendencies to maintain a particular constrictive (or expansive) effect on the vocal tract shape. (+ F0 contour modification) “palatized” “pharyngealized”

  16. Beyond human voice quality… F0 = 30-50 Hz Vocal tract length = 31 cm F0 = 600-1000 Hz Vocal tract length = 4.4 cm

  17. Singing…?? Singer’s formant

  18. The End This work was supported by NIH R01-DC04789

More Related