1 / 8

Speech Projects at ITC irst Roberto Gretter ITC irst

Speech Projects at ITC irst Roberto Gretter ITC irst AT&T Shannon Labs, Florham Park, NJ, 6/27/2000. ITCirst Trento, Italy. About 100 researchers in 5 divisions division SSI: vision, speech, statistics, software system analysis speech: 16 researchers in 3 internal projects:

kelli
Download Presentation

Speech Projects at ITC irst Roberto Gretter ITC irst

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Speech Projects at ITC irst Roberto Gretter ITC irst AT&T Shannon Labs, Florham Park, NJ, 6/27/2000

  2. ITCirstTrento, Italy • About 100 researchers in 5 divisions • division SSI: vision, speech, statistics, software system analysis • speech: 16 researchers in 3 internal projects: • MUNST - multilinguality, broadcast news, CORETEX • SHINE - recognition in car, acoustic localization • DITELO - telephone, dialogue

  3. Accessing tourism information • Local tourism agency provides info database about accomodation, structures and services, localities, events, sports, leisure time, art, nature, transportation, … (see http://www.provincia.tn.it/apt go) • Mixed initiative dialogue, explicit confirmation, no barge-in, over 2000 words vocabulary, information presented using a Mixed Representation approach (template-based + deep generation) • First prototype for data collection • Bootstrap system handles accommodation and place names • Bootstrap grammars

  4. <localita`> trento rovereto riva del garda cles <vorrei> vorrei dovrei voglio mi piacerebbe <stelle> ( a | con) ? (( 1|2|3|4|5 ) o ? )+ stelle <tipo> albergo campeggio residence affittacamere main_grammar Tourism information • Bootstrap grammars: <vorrei> andare in un <tipo>/TYPE <stelle>/STARS <vorrei> soggiornare vicino a <localita`> /CITY un <tipo>/TYPEa <localita`> /CITY <stelle>/STARS mi manda i dati <output>/CHAN al numero <tel>/FAX <vorrei> il <info>/INFO del <tipo>/TYPE <n>/NAME • Only 2 main grammars defined • Data collected without Wizard of Oz

  5. Tourism information: acquisitions • Users provided with some task to accomplish. • Users: domain expert / non expert • Acquisitions using the first prototype (May 2000): 2 hours 37 min speech - about 220 dialogues • Preliminary evaluation on 34 dialogues

  6. automatic routing agent automatic answering agent user human router human operator test-it test-de HMM-it 93.9 - HMM-de - 89.4 HMM-mix 94.3 89.5 Aurora (proposal) • Similar to HMIHY task: • call routing (How can I help you?) • natural language processing, information extraction, document analysis and retrieval, database access • German (DFKI), Italian (Offnet, IRST, CELI), Dutch (CTIT, Uni Twente), Spanish (DFE, Uni Barcelona) • Multilinguality in SpeeData • data-entry (Land-register)

  7. Kataweb • One of the most important Italian WEB portals go • Requirement: Fast Porting of new speech and NL technologies to WEB portals (Dec 2000) • Access information via written queries: • Who invented the electrical light? • Which is the capital of Alaska? • Information extraction from news • The Bank of Japan decided … the president said ... • Spoken dialogue by phone • call center solution • trading on line by phone, about 500 stocks • Future developments are possible • extend domains • integrate speech and natural language • computer vision

  8. Technology transfer to solution developers • What they are asking us, today • to include the recognizer in menu-based call centers • to build applications requiring dialogue • What they are going to ask us, tomorrow: • multilinguality • speaker verification • What we provide • Spinet server (recognizer, grammars, lexicon, ...) • API (c++, Java) • assistance in building prototypes/systems • assistance in verifying/improving systems

More Related