1 / 4

Speech material

Results from offline processing. Comparison of dynamic range compression using single-band, multiband, and sliding-band compression schemes. Speech material “you will mark ut please” concatenated with scaling factors of 0.1, 0.8, 0.1, 0.4, 0.1 Processing

muniya
Download Presentation

Speech material

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Results from offline processing Comparison of dynamic range compression using single-band, multiband, and sliding-band compression schemes Speech material “you will mark ut please” concatenated with scaling factors of 0.1, 0.8, 0.1, 0.4, 0.1 Processing Single-band, multiband, and sliding-band dynamic range compression using Win. len. = 25.6 ms, FFT len. = 512

  2. Distortions during spectral transitions: Example of swept sinusoidal input. Input: constant amplitude, 125 –250 Hz linearly swept frequency, 200 ms sweep duration Single-band compression output Multiband compression (18 auditory critical bands) output Sliding band compression output CR = 30, Ta = 6.4 ms, Tr = 192 ms. Time (s)

  3. Example: "you will mark ut please" concatenated with scaling factors for variation in the input level. CR = 2, Ta = 6.4 ms, Tr = 6.4 & 192 ms. Input waveform Scaling factor Unprocessed waveform Processed Tr= 6.4 ms, low Pmc Processed Tr= 192 ms, low Pmc Processed Tr= 6.4 ms, highPmc Processed Tr= 192 ms, highPmc Time (s) • Processing of different speech materials with varying levels: No audible roughness or distortion during informal listening.

  4. Results from real-time processing Example: "you will mark ut please" concatenated with scaling factors for variation in the input level. CR = 2, Ta = 6.4 ms, Tr = 192 ms, low Pmc. Unprocessed waveform Offline processed waveform Real-time processed waveform Time (s) Informal listening: real-time output perceptually similar to the offline output PESQ for real-time w.r.t. offline : 3.5 Signal delay = 36 ms Use of processing capacity: 41% (lowest proc. clock for satisfactory operation = 50 MHz, max. clock = 120 MHz)

More Related