1 / 6

Voice Recognition

Voice Recognition. Math 5 Professor Barnett Timothy G. McManus Anthony P. Pastoors. What is Voice Recognition.

dragon
Download Presentation

Voice Recognition

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Voice Recognition Math 5 Professor Barnett Timothy G. McManus Anthony P. Pastoors

  2. What is Voice Recognition • Voice recognition is "the technology by which sounds, words or phrases spoken by humans are converted into electrical signals, and these signals are transformed into coding patterns to which meaning has been assigned" • Concept could more generally be called "sound recognition", we focus here on the human voice because we most often and most naturally use our voices to communicate our ideas to others in our immediate surroundings.

  3. How does it work? “Template matching" • simplest technique and has the highest accuracy when used properly, but it also suffers from the most limitations. • The electrical signal from the microphone is digitized by an "analog-to-digital (A/D) converter", and is stored in memory. • To determine the "meaning" of this voice input, the computer attempts to match the input with a digitized voice sample, or template, that has a known meaning. • The program contains the input template, and attempts to match this template with the actual input using a simple conditional statement. “Feature analysis" • “Speaker-independent" voice recognition. Does not need to find an exact or near-exact match between the actual voice input and a previously stored voice template • Processes the voice input using "Fourier transforms" or "linear predictive coding (LPC)" • Attempts to find characteristic similarities between the expected inputs and the actual digitized voice input. • These similarities will be present for a wide range of speakers, and so the system need not be trained by each new user.

  4. Practical Applications • Military • Police • People with Disabilities • Health Care • Vehicle use • Security

  5. Downside/ Difficulty • The difficulty in using voice as an input to a computer simulation lies in the fundamental differences between human speech and the more traditional forms of computer input. While computer programs are commonly designed to produce a precise and well-defined response upon receiving the proper input, the human voice and spoken words are anything but precise. Each human voice is different, and identical words can have different meanings if spoken with different inflections or in different contexts.

More Related