1 / 10

Document Expansion for Improved Speech Retrieval

This study addresses the challenges of erroneous transcription files, vocabulary mismatches, and noisy sound files in speech recognition. Document expansion techniques are proposed to enhance retrieval accuracy by adding new terms and reweighing existing ones. Experiments show significant improvements in retrieval results.

selima
Download Presentation

Document Expansion for Improved Speech Retrieval

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Document Expansion forSpeech Retrieval(Singhal, Pereira) Teoman ToramanÇağrı ToramanBilkent University, 2010

  2. Problem Statement Reasonable Transcription File: news_today.rtf Speech File: news_today.wav Automatic (or Manual) Speech Recognition 2 / 10

  3. Problem Statement Aboutness: Fatal train crash in Italy Query Indexing Results:D1, D2 3 / 10

  4. Problem Statement Erroneous Transcription File Noisy / Dirty Sound File Corrupted / Erroneous Automatic (or Manual) Speech Recognition 4 / 10

  5. Problem Statement Same Query Erroneous Corrupted / Erroneous Indexing Results:D2 (Vocabulary Mismatch) 5 / 10

  6. Problem Statement Noisy / Dirty Sound File Automatic (or Manual) Speech Recognition Corrupted / Erroneous • Recognition Mistakes: • Deletions • Wrong term weighting • Insertions 6 / 10

  7. Solution Expanded Corrupted / Erroneous Document Expansion 7 / 10

  8. Solution What is Document Expansion ? Step 2) Step 3) Step 1) RELATED CORPUS Corrupted / Erroneous Reweighing & Adding New Terms ... 10 similar files 8 / 10

  9. Experiments & Results 9 / 10

  10. Experiments & Results %10-15loss %20-25loss 10 / 10

More Related