A Study on Detection Based Automatic Speech Recognition - PowerPoint PPT Presentation

A study on detection based automatic speech recognition l.jpg
Download
1 / 13

  • 507 Views
  • Uploaded on
  • Presentation posted in: Pets / Animals

A Study on Detection Based Automatic Speech Recognition. Author : Chengyuan Ma Yu Tsao Professor: 陳嘉平 Reporter : 許峰閤. Outline. Introduction Word detector design Hypotheses combination Experiment. Introduction.

I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.

Download Presentation

A Study on Detection Based Automatic Speech Recognition

An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -

Presentation Transcript


A study on detection based automatic speech recognition l.jpg

A Study on Detection Based Automatic Speech Recognition

Author : Chengyuan Ma

Yu Tsao

Professor:陳嘉平

Reporter :許峰閤


Outline l.jpg

Outline

  • Introduction

  • Word detector design

  • Hypotheses combination

  • Experiment


Introduction l.jpg

Introduction

  • The current ASR system is top-down and this is a bottom-up system.

  • It include:

    1.word detector.

    2.word hypothesis verification and false

    alarm pruning.

    3.Hypothesis combination.


Word detector design l.jpg

Word detector design

  • We have separate detector for each lexical item in the vocabulary.

  • HMM model are used for detector design.

  • The key issue is how to choose an appropriate grammer network.


Word detector design5 l.jpg

Word detector design


Word verification and pruning l.jpg

Word verification and pruning


Word verification and pruning7 l.jpg

Word verification and pruning

  • It’s obvious that these detectors generate a lot of false alarms.

  • Here are three pruning strategies will be presented.


Word verification and pruning8 l.jpg

Word verification and pruning

  • Temporal information based pruning:

    For example, the duration of the word “one” should be greater than 150 ms.

  • Attributes model based pruning:

    Each word has its own attribute sequence pattern.

  • Signal based pruning:

    Signal feature based pruning.

    For example, we know the energy of a nasalsound is often concentrated on the low frequency region.


Hypotheses combination l.jpg

Hypotheses combination

  • We investigate hypothesis combination strategies using outputs from all detectors to generate a word string.

  • The weighted directed graph is one of the methods that can be used to combine the detector output into a digit string.


Hypotheses combination11 l.jpg

Hypotheses combination

  • Each node in the graph is a detected digit boundary.

  • The number in the node is the time stamp.

  • The number beside each edge is the frame average log-likelihood.

  • We can use the Dijkstra’s algorithm to find the shortest path.


Experiment l.jpg

Experiment

  • Conduct on the TIDIGITS corpus.

  • Digit vocabulary is made of 11 digits, one to nine, plus oh and zero.

  • 12-dimensional MFCC is used for frond-end processing.


Experiment13 l.jpg

Experiment


  • Login