1 / 10

Phone Reader Project

Phone Reader Project. By: Hossein and Hadi Shayesteh Supervisor: Mr. James Connan. Introduction. Phone Reader. Converting Image to text using OCR Reading the text using TTS. Potential Users. Intended for Blinds and Illiterates People dealing with a F oreign L anguage.

latham
Download Presentation

Phone Reader Project

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Phone Reader Project By: Hossein and HadiShayesteh Supervisor: Mr. James Connan

  2. Introduction Phone Reader • Converting Image to text using OCR • Reading the text using TTS Potential Users • Intended for Blinds and Illiterates • People dealing with a Foreign Language

  3. HLD / LLD of OCR 1 0..1 1.. 0..1 1.. 0..1

  4. HLD / LLD of OCR Feature Extraction • Assigning PixelNum to each class • Calculating LociNum using Characteristic • Loci approach • Creating Feature Vector for each segment Classification • Creating 38 classes • Creating Feature Vector for each class • Applying a Binary Mask • Calculating Euclidean Distance: • Classifying the input

  5. HLD / LLD for TTS

  6. Low level Design / TTS Word Pronunciation(UnitConcatenator) • Accepts the text • Checks the database for the word pronunciation • Reverts to “letter to sound rules” If the word doesn’t exist • outputs a sequence of phonemes • Passes the pronunciation to prosody stage Play Audio • Engine receives the phoneme • Loads the digital audio from a database • Does some pitch, time, and volume changes • Sends it out to the sound card.

  7. Overall Software Architecture OCR Package TTS Package • Main Method • Captures text image • Invokes OCR package • Sends extracted text to TTS package • Reads out the text

  8. Project Plan Term 3 • Implementing OCR and TTS engines in emulator environment • Integrating OCR and TTS engines Term 4 • Porting the complete package to the mobile platform • Testing the final package

  9. Project Demo Functionality Demo • Demonstrating project functionality using a win32 application User Interface Demo • Demonstrating project User Interface using a mobile emulator

  10. Question and Answer

More Related