A STUDY ON SPEECH RECOGNITION USING DYNAMIC TIME WARPING

A STUDY ON SPEECH RECOGNITION USING DYNAMIC TIME WARPING CS 525 : Project Presentation PALDEN LAMA and MOUNIKA NAMBURU

Goals • Learn how it works ! • Focus: • Pre-Processing • Dynamic Time Warping/Dynamic Programming • Verify using MATLAB • Build a simple Voice to Text Converter application.

How does it work? Record Extract a voice Feature Vectors Digitized Speech Signal (.wave file) Acoustic Preprocessing (DFT + MFCC) Speech Recognizer (Dynamic Time Warping)

Speech signal A time signal of vowel /a:/ (fs=11 kHz, length=100ms) • Voiced Excitation  fundamental frequency (Speaker dependent) • Loudness  signal amplitude • Vocal tract shape  spectral shaping (most important to recognize words) time

ACOUSTIC PRE-PROCESSING Log power spectrum of vowel /a:/ (fs=11 kHz, N=512) • DFT (Discrete Fourier Transform)  Spectral Coeff. • Inverse DFT on log power spectrum  CepstralCoeff. • Makes it easier to extract spectral shaping of the speech signal. frequency Power spectrum of the vowel /a:/ after cepstral smoothing

MFCC (Mel frequency cepstral coefficients) • Mel frequency scale reflects frequency resolution of human ear. • Coeff. Of power spectrum  Mel Spectral Coeff. (FEATURE VECTOR)

RECOGNIZER • One word spoken contains dozens of feature vectors. (preprocessing every 10 ms of signal) • Compute a ”distance” between this unknown sequence of vectors (unknown word) and known sequence of vectors (prototypes of words to recognize) • PROBLEM !! Unequal length of vector sequence

Dynamic time warping : Find optimal assignment path

DTW : Recognizing connected words

MATLAB FUNCTIONS PRE-PROCESSING • recordMelMatrix(3) • S = wavread(“speech.wav”) • C = Melfiltermatrix(S, N, K) • computeMelSpectrum( C,S); DISPLAY FEATURES • Featuredisp.m WORD RECOGNITION • dp_asym(vector1, vector2)

Results hello hello1

hello library

hello computer

3.0304e+003 3.5820e+003 3.4499e+003

Welcome home (male) Welcome home (female)

Welcome home Welcome back

Welcome home Computer Science

Welcome back Computer Science

2.6418e+003 2.9468e+003 3.8109e+003 4.6701e+003

THANKS ! • ANY QUESTIONS?

A STUDY ON SPEECH RECOGNITION USING DYNAMIC TIME WARPING

A STUDY ON SPEECH RECOGNITION USING DYNAMIC TIME WARPING

Presentation Transcript

Time Series and Dynamic Time Warping

Parallelizing Dynamic Time Warping

Using Speech Recognition for Speech Therapy

Using Speech Recognition

Speech recognition using HMM

Using Dynamic Time Warping for Sleep and Wake Discrimination

Instruction Set Extension for Dynamic Time Warping

Dynamic Time Warping for Automated Cell Cycle Labelling

ADHD indicators modelling based on Dynamic Time Warping from RGB data: A feasibility study

Technical Seminar presentation on Speech Recognition using DWT

Exact indexing of Dynamic Time Warping

Dynamic Time Warping Applications and Derivation

Exact Indexing of Dynamic Time Warping

Keyword Spotting Dynamic Time Warping

Dynamic Time Warping

DYNAMIC TIME WARPING IN KEY WORD SPOTTING

A Study on Detection Based Automatic Speech Recognition

Real-Time Speech Recognition

D ynamic Time Warping and Minimum Distance Paths for Speech Recognition

A Game Based on Speech Recognition

Dynamic Time Warping (DTW)