1 / 12

5526 Speech Recognition

5526 Speech Recognition. Application of Sphinx-4 Yuan Hao. SPHINX-4. Providing a more flexible framework for research in speech recognition Written entirely in the Java programming language . ZipCity. A simple application for Sphinx-4.

hailey
Download Presentation

5526 Speech Recognition

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. 5526 Speech Recognition Application of Sphinx-4 Yuan Hao

  2. SPHINX-4 • Providing a more flexible framework for research in speech recognition • Written entirely in the Java programming language

  3. ZipCity • A simple application for Sphinx-4. • ZipCity listens for the zip code and show the location related to the zip code.

  4. ZipCity • What should we do if we don’t know the zip code, but we know the name of the city? Modify ZipCity!

  5. Things we should modify • ZipCity.configer.xml • This document demonstrate the model and dictionary we use. Now, it only can recognize digit. • ZipCity.gram • ZipRecognizer.java • ZipDatabase.java

  6. ZipCity.configer.xml • Change the dictionary path. • <property name="dictionaryPath" • value="resource:/WSJ_8gau_13dCep_16k_40mel_130Hz_6800Hz/dict/cmudict.0.6d"/> • Change the filler path. • <property name="fillerPath" • value="resource:/WSJ_8gau_13dCep_16k_40mel_130Hz_6800Hz/dict/fillerdict"/>

  7. ZipCity.configer.xml • Change the acoustic model. • <property name="location" • value="resource:/WSJ_8gau_13dCep_16k_40mel_130Hz_6800Hz"/> • <property name="modelDefinition" • value="etc/WSJ_clean_13dCep_16k_40mel_130Hz_6800Hz.4000.mdef"/> • <property name="dataLocation" • value="cd_continuous_8gau/"/>

  8. ZipCity.gram • Adjust the grammar • public <cityname> = <name>; • <name> = New-york | Cocoa | San-Francisco | Chicago | Houston | San-diego | Tallahassee | Titusville | Orlando | Miami | computer ;

  9. ZipRecognizer.java • Add the city name we list in ZipCity.gram • Recognizer returen the value of “digitMap.put()”, so we should add the city name in digitMap code. • digitMap.put("chicago", "chicago"); digitMap.put("san-francisco", "san-francisco"); digitMap.put("new-york", "new-york");

  10. ZipDatabase.java • Look up the info of city using city name instead of using zip code. • zipDB.put(city, new ZipInfo(zip, city, state,latitude, longitude))

  11. Done! • Let’s see what happened!

  12. Supplement • The grammar should be complete, and so do the dictionary. Now, it just contain 10 cities name. • This application also can recognize word, even that word is not a city name.

More Related