Synthetic agents that speak and listen
Download
1 / 24

Synthetic Agents that Speak and Listen - PowerPoint PPT Presentation


  • 178 Views
  • Uploaded on

Synthetic Agents that Speak and Listen. Talking with Highbrow Avatars on Your Cell Phone Prof. Matthew Nickerson, Southern Utah University. Automated Audio Tours . Audio cassette player. Analog. Audio CD Player. Digital audio player. Multimedia player. Digital.

loader
I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
capcha
Download Presentation

PowerPoint Slideshow about 'Synthetic Agents that Speak and Listen' - richard_edik


An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript
Synthetic agents that speak and listen l.jpg

Synthetic Agents that Speak and Listen

Talking with Highbrow Avatars on Your Cell Phone

Prof. Matthew Nickerson, Southern Utah University


Automated audio tours l.jpg
Automated Audio Tours Listen

Audio cassette player

Analog

Audio CD Player

Digital audio player

Multimedia player

Digital


Research issues l.jpg
Research issues Listen

  • Frustration and complications

  • Player damage, loss, or theft.

  • Patron anxiety

  • Updates and changes

  • Outdoor venues can be problematic.

  • Patrons with limited mobility.


Automated audio tours4 l.jpg
Automated Audio Tours Listen

Audio cassette player

Analog

Audio CD Player

Digital audio player

B Y O P

Multimedia player

Digital


V oice e xtensible m arkup l anguage l.jpg
V Listenoice Extensible Markup Language

VXML is an XML-based markup language designed specifically to implement interactive voice dialogs.

Web Server

VXML

Digital Sound Content

User

Cell Phone

Voice / Telephony

Gateway


Historical photograph exhibit l.jpg
Historical photograph exhibit Listen

A gallery exhibit featuring historic photographs

covering 100 years

of theater history in Cedar City, Utah. 1900-2000


Benefits to developer l.jpg
Benefits to developer Listen

  • Low upfront costs, start slow

  • No check out/in, maintenance, personnel

  • Easily updated

  • Real-time usage statistics

  • Powerful evaluation tool


Benefits to users l.jpg
Benefits to users Listen

  • Familiar device

  • No check-in or collateral required

  • Avoid hygiene concerns

  • BYOD


Platform l.jpg
Platform? Listen

Work with a Vendor

Do it yourself


Partner with a vendor l.jpg
Partner with a vendor Listen

Web Server

VXML

Digital Sound Content

User

Cell Phone

Voice / Telephony

Gateway


Bridging worldwide networks l.jpg

User Listen

Cell Phone

Bridging Worldwide Networks

TELEPHONY

INTERNET

Voice

Server

Web Server - VXML

Digital Sound Content


Built in vxml tools l.jpg
Built in VXML tools Listen

  • Voice or DTMF input

  • Prerecorded or computer generated output

  • Audio system event handlers

  • Interrupt

  • Capture audio input



Virtual conversation l.jpg
Virtual conversation Listen

A gallery exhibit featuring historic photographs

covering 100 years

of local theater history, 1900-2000.


Building synthetic agents l.jpg
Building Synthetic Agents Listen

  • Voice or DTMF input

  • Prerecorded or computer generated output

  • Audio system event handlers

  • Interrupt

  • Capture audio input


Limiting response options l.jpg

Do you want to know more about General Lee? Listen

What artistic period are you interested in?

What area are you currently exploring?

Limiting Response Options

  • Ask questions


Limiting response options17 l.jpg
Limiting Response Options Listen

  • Ask questions

  • Create grammars

<rule id = “destination” scope = “public” >

<one-of>

<item> <tag> “new york” </tag> new york </item>

<item> <tag> “new york” </tag> new york city </item>

<item> <tag> “new york” </tag> big apple </item>

</one-of>

</rule>


Limiting response options18 l.jpg

PROXIMITY Listen

GEOGRAPHY

SUBJECT

Limiting Response Options

  • Ask questions

  • Create grammars

  • Point of contact


Location location location l.jpg
Location, location, location Listen

WiFi, GPS, Bluetooth


Challenges to cultural heritage applications and others l.jpg
Challenges to Cultural Heritage Applications… and others Listen

  • Current policies

  • Photography

  • Limiting phone calls/conversations

  • No speaker phones, please!

  • Reception


Choosing a voice l.jpg
Choosing a voice Listen

Battle of the sexes among synthetic agents and avatars

BMW, Unisys, GMVoices


Modulated human voice l.jpg
Modulated human voice… ? Listen

Some swear that synthetic agents are better… others just swear.

Clifford Nass, Stanford University; Sprint PCS


Nuance in virtual conversation l.jpg
Nuance in “virtual” conversation Listen

Affective interpretation of metaphorical utterances

Catherine Smith, et al.

School of Computer Science, University of Birmingham



ad