speech recognition n.
Download
Skip this Video
Loading SlideShow in 5 Seconds..
Speech Recognition PowerPoint Presentation
Download Presentation
Speech Recognition

Loading in 2 Seconds...

play fullscreen
1 / 18

Speech Recognition - PowerPoint PPT Presentation


  • 266 Views
  • Uploaded on

Speech Recognition. Introduction. What is Speech Recognition? - Voice Recognition? Where can it be used? - Dictation - System control/navigation - Commercial/Industrial applications - Hand held digital recorders. Contents:. Continuous/Discrete

loader
I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
capcha
Download Presentation

Speech Recognition


An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
    Presentation Transcript
    1. Speech Recognition

    2. Introduction • What is Speech Recognition? - Voice Recognition? • Where can it be used? - Dictation - System control/navigation - Commercial/Industrial applications - Hand held digital recorders

    3. Contents: • Continuous/Discrete • How does it work? • Recent improvements • Current software options • Future of SR

    4. Continuous or Discrete? • Continuous speech - dictation • Discrete speech - system controls

    5. How does SR work? • Recognition • Training • Correction • Command/Control

    6. Recognition (1) Voice Input Analog to Digital Acoustic Model Language Model Feedback Display Speech Engine

    7. Recognition (2) Acoustic Modeling • Spoken words: “I think there are…..” • Phonemes: ‘ ay th-in-nk-kd dh-eh-r aa-r’ • H.M.M.’s: 5 state representation • Speech Engine

    8. Recognition (3) Language Modeling • Word context • Word frequency • Transition possibilities

    9. Voice Training (1) Can be done by: • Predetermined text segments • Individual words Compare new acoustic with old and combines • More training = better recognition

    10. Voice Training (2) User specific Voice file • Voice qualities • Pronunciation • Patterns of word use • Preferred vocabulary

    11. Making Corrections • Move cursor by voice command • Memorize edit commands • List of possible alternatives • Make correction manually

    12. Command/Control • Desktop grid • Program or Link name/number • URL name • Memorized commands

    13. Recent Improvements in SR • Faster training ~10 min. • Better recognition ~95% • More compatible software • Better system control/command

    14. Current Software Options for PC • Dragon Systems – Naturally Speaking • Philips – FreeSpeech • IBM – ViaVoice • Lernout & Hauspie – Voice Xpress

    15. How well do the work?

    16. Future of SR • SUI – Speech-based User Interface • Improvements needed: - Greater accuracy - Greater system control/command - More compatible software

    17. Conclusion • SR Uses • How does it work? • Current Software • Problems of SR • More SR coming soon….

    18. References • 1. Alwang, Greg. “Speech Recognition,” PC Magazine, December 1 1999 • 2. Hauptmann, Alexander G. Jang, Photina Jaeyun. Carnegie Mellon University. “Learning to Recognize Speech by Watching Television,” IEEE Intelligent Systems, September/October 1999. • 3. Miastkowski, Stan. “Latest Speech Software Gets You Up and Running Faster,” PC World, November 1999.