1 / 10

g-SEDIK flow chart

g-SEDIK flow chart. artis.jiang@gmail.com. File structure. 語音強化流程. 錄音檔. 雜訊估測(適應性空間濾波 /adaptive beamforming ). 前處理 : 估雜訊 +VAD. MCRA (Minima Controlled Recursive Averaging ). LTVAD. MS (Minimum Statist i cs). 語音強化 / 增益函數 (Post filtering). Wiener filter 噪音估測的結果 非常敏感, 較不穩健. SS

leanna
Download Presentation

g-SEDIK flow chart

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. g-SEDIK flow chart artis.jiang@gmail.com

  2. File structure

  3. 語音強化流程 錄音檔 雜訊估測(適應性空間濾波/adaptive beamforming) 前處理: 估雜訊+VAD MCRA (Minima Controlled Recursive Averaging ) LTVAD MS (Minimum Statistics) 語音強化/增益函數 (Post filtering) Wiener filter 噪音估測的結果 非常敏感, 較不穩健 SS (Spectral Subtraction) 較穩健且效果不錯 LSA (Log-Spectral Amplitude ) (Maximum likelihood) 結果較差 OMLSA 強化檔 最佳方法

  4. input output function() data PESQ Winsound NB mic. Play Music Mp3/wav BF 辨識器 Mp3 player song_search() VAD Mic array [2] 顯示 歌名 %temp%\ raw.pcm MS SQL %temp%\ result.pcm PESQ pesqio fileIO pesqdsp VAD+filter dsp FFT+math pesqmain Show Score [0..5] Int PESQ_Score

  5. input output function() data 公用函式,mic_scan() 存成待測語音多維數列 Hmic_ASRDlg.cpp %temp%\ _sedik_ch_1.pcm _sedik_ch_1.txt Winsound NB mic. Short int pSamples[2][128] _sedik_ch_2.txt %temp%\ xplot() 公用函式, 存成待測語音檔 Mic array [8] TF_GSC VAD vad_power_mean() %temp%\ result.pcm Float ch_buffer[8][128] BF 辨識器 HSR_Start_Offline_Recognizer() Noise canceller hamming FFT Wave2Fea. Waveform[i] 顯示歌名 IFFT Play Music Mp3/wav Call Mp3 player song_search() result[] HSR_Recog_result(~) MS SQL

  6. Hmic_ASRDlg.cpp::tf_gsc_recog() Mic array R:\temp\ IFFT ch_buffer[8] TF_GSC result[] HSR_Start_Offline_Recognizer() Initial winsock BF VAD Initial G[] Noise canceller 顯示歌名 Play music result.pcm Produce MBF weightings by A 辨識器 Produce BM by A *.mdb access Call Mp3 player Recv data From MOD5270 Wave2Fea. Waveform[i] hamming MS SQL HSR_Recog_result(~) FFT FBF BM

  7. Mic Scan/Testing:: C:\Microphone_Testing\Direct_Out channel_1.txt channel_2.txt channel_8.txt

  8. VAD:: C:\Microphone_Testing\Direct_Out channel_1.txt channel_2.txt channel_8.txt

  9. Revision

More Related