1 / 9

Formant-based Synthesis of Singing

This legacy source-filter technique generates singing sounds from scratch without prior voice recordings. The synthesizer utilizes rule-based systems and gradually incorporates rules for music performance and singing over time.

Download Presentation

Formant-based Synthesis of Singing

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Formant-based Synthesis of SingingSten Ternström and Johan SundbergKTH Music Acoustics, Speech Music and Hearing, Stockholm • This is the legacy source-filter technique, with some minor updates • The sound is generated from scratch- no prior recordings of voices • The synthesizer is driven by the same rule systemplatform as the text-to-speech systems pioneered by Carlson & Granström • Rules for music performance and singinghave been added gradually over decades

  2. Rules in RULSYS syntax Flat text transcription of the Score Description of the Singer Rule system parameters Layout  audio output Synthesis Engine •  interactive control

  3. Rules Singer definitions Input data Score

  4. Parameter file L E TMI SI NG 28 parameters100 frames/sec

  5. Synthesizer highlights • source waveform: a train of sinc pulses, filtered to measure • here: spectrum slope covaries with source amplitude • adjustable L0, cutoff, vibrato and flutter • 8 formants in cascade, F6-F8 fixed • no source-filter interaction • fricative branch with two formant filters • no nasal branch • sample rate 16 kHz • runs on DSP hardware, 32-bit floating point

  6. Synthesizer relative level of the fundamental fundamental frequency vibrato extent vibrato frequency flutter extent flutter center frequency flutter bandwidth vocal intensity spectrum slope glottal amplitude delta-L0 high cutoff T0 gain Sinc pulsegenerator HP @ 25 Hz DC blocker Variable slope filter Notchfilter LP filter-24 dB/oct aspiration Fn, Bn Noise Formant chain F1...F8 Zero 1.8 kHz HP2 + + output frication Fricative filters K1, K2

  7. LTAS Bass, entire verse Soprano, entire verse

  8. Performance

  9. The End

More Related