1 / 22

DAWN: Dynamic Aural Web Navigation

DAWN: Dynamic Aural Web Navigation. Gopal Gupta, S. Sunder Raman, Mike Nichols, H. Reddy, N. Annamalai Department of Computer Science University of Texas at Dallas. Introduction. The Web is intrinsically Visual. We need computers to access the Web.

felice
Download Presentation

DAWN: Dynamic Aural Web Navigation

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. DAWN: Dynamic Aural Web Navigation Gopal Gupta, S. Sunder Raman, Mike Nichols, H. Reddy, N. Annamalai Department of Computer Science University of Texas at Dallas

  2. Introduction • The Web is intrinsically Visual. • We need computers to access the Web. • People with visual disabilities cannot fully iiiinteract with the Web.

  3. Aural Web • What is the Aural Web? • Based on the traditional Web. • Voice/Audio for Input/Output. • Listeners have complete control over navigation.

  4. Aural Web • What is the Aural Web? • Why do we need an Aural Web? • Make the Web accessible to all. • Do away with the need to have a computer to access iiiithe Web.

  5. Aural Web • What is the Aural Web? • Why do we need an Aural Web? • How can we obtain an Aural Web? • Translate HTML to VoiceXML (however, translation is not enough). • Need to enhance VoiceXML to make it dynamicallyiiiinavigable.

  6. VoiceXML • W3C Standard for marking voice documents • VoiceXML documents are ‘played’ on voice iiibrowsers. • A VoiceXML document consists of various iiiforms. Form names are used to control iiinavigation. • Inputs are restricted to a set of pre-defined iiiwords specified via a grammar.

  7. Example VoiceXML Document • <vxml version="2.0"> <form> <field name="rich"> <grammar type=“application/x-gsl” mode = “voice”> <![CDATA[[ [(yes)]{<option “yes”>} [(no)]<option “no”>} ]]]> </grammar> <prompt>Would you like to get rich quick?</prompt> <filled>Gotcha. <if cond="rich==‘yes’"> You want to be rich! <goto next="rich.vxml" /> <else /> You don't want to be rich. <goto next="poor.vxml" /> </if> </filled> </field> </form> </vxml>

  8. Translating HTML to VoiceXML • A module to denotationally map HTML constructs iiito VoiceXML. • It is extendable and flexible.

  9. Translating HTML to VoiceXML Input Attributes Input Duration in Seconds for Text Box : Input Duration in Seconds for Text Area : HTML Tags Output Text <blockquote> </blockquote> Starting of text quoted from elsewhere. Ignore Interface Sheet

  10. HTML to VoiceXML Translator • A module that denotationally map HTML constructs iiito VoiceXML. • It is extendable and flexible. • It can handle Forms. It preserves information about iiithe submit type and target URL. • The translator imposes certain reasonable iiirestrictions on the input HTML.

  11. DAWN Architecture HTML to VoiceXML Translator. WHY…? VoiceXML Enhancer.

  12. Limitations of VoiceXML • Navigation is controlled by the author; the listener has iiivirtually no control. • The author has to hardcode every possible navigation iiipath (obviously not possible). • Poorly authored documents become difficult to iiibrowse. • Speech recognition technology allows an arbitrary iiiuser to speak only pre-determined phrases.

  13. Limitations of VoiceXML (cont’d) • These limitations result in VoiceXML’s being useful for only simple applications. • Thus, simple database lookup (e.g., American Airlines airline information system) are possible, • But advanced applications that require complex interaction are not (e.g., making air reservation). • What is needed is the ability for the listener to move around the VXML document at his/her will.

  14. Solution • We introduce the concept of Voice Anchors, allowing iiilisteners to dynamically tag and recall any dialog. • We modify the VoiceXML documents and generate iiinew VoiceXML documents dynamically at run time. • Support pre-defined keywords (e.g., pause).

  15. Dynamic Voice Anchors • Analogous to bookmarks or HTML anchors. • An anchor is a speech label that can be associated iiiwith a specific dialog. • These anchors can be then used to recall associated iiidialogs. • A single anchor name can be used to tag multiple iiidialogs (cumulative anchor). • Any word can be chosen for an anchor name. The iiiuser spells them out the first time only.

  16. The Enhancer Module • Enhances the VoiceXML file readying it to accept iiiDynamic Voice Anchors. • Modifies the VoiceXML document to add iiiinterfaces to server side CGIs. • Adds functionality for specific keywords which iiihave pre-determined semantics. • e.g. skip, repeat, pause, resume, back.

  17. The Big Picture

  18. DEMO

  19. Some Applications • EPlan – Is a Web based integrated contingency handling system. The Aural Web increases its accessibility. • MathML to VoiceXML – Allows complex Mathematical formulas to be broken down, tagged and recalled. • Searching for directions on the Web. • Make online airline reservations using the phone.

  20. Current & Future Work • Design of Voice Scripting Languages (Talk by Mike Nichols tomorrow in Tiberius 2 at 10:30am). • Intelligent Navigation Strategy for navigating Tables. • Finally, incorporate all these techniques into a Voice Browser.

  21. Contributions • An Aural Web based on the traditional Web that allows users to perform complex Web operations using the phone. • Developed techniques to give a listener maximum control (via dynamic voice anchors).

  22. ?

More Related