From last time …. Grammar. Recognized Words “zero” “three” “two”. Cepstrum. Probabilities “z” -0.81 “th” = 0.15 “t” = 0.03. Decoder. Signal Processing. Probability Estimator. ASR System Architecture. Speech Signal. Pronunciation Lexicon. A Few Points about Human Speech Recognition.
Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.
Probabilities“z” -0.81“th” = 0.15“t” = 0.03
ASR System Architecture
(See Chapter 18 for much more on this)
Word error rates for 5000 word Wall Street Journal
read speech task using additive automotive noise
(old numbers – ASR would be a bit better now)