1 / 36

Signing for the Deaf using Virtual Humans

Signing for the Deaf using Virtual Humans. Ian Marshall Mike Lincoln J.A. Bangham S.J.Cox (UEA) M. Tutt M.Wells (TeleVirtual, Norwich). SignAnim. School of Information Systems, UEA Televirtual, Norwich Subtitles to Signing Conversion Funded by Independent Television Commission (UK).

keziah
Download Presentation

Signing for the Deaf using Virtual Humans

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Signing for the Deaf using Virtual Humans • Ian Marshall • Mike Lincoln • J.A. Bangham S.J.Cox • (UEA) • M. Tutt M.Wells • (TeleVirtual, Norwich)

  2. SignAnim School of Information Systems, UEA Televirtual, Norwich Subtitles to Signing Conversion Funded by Independent Television Commission (UK)

  3. Tessa • School of Information Systems, UEA • Televirtual, Norwich • Speech to Signing of Counter Clerk Turns in PO Transactions • funded by Post Office

  4. ViSiCAST School of Information Systems, UEA Televirtual, Norwich Independent Television Commission (UK) Post Office (UK) RNID (UK) IvD (Holland) University of Hamburg (Germany) IST (Germany) INT (France) EU funded 5th Framework Project

  5. Background – Deaf Community Deaf v Hard of Hearing Signing v. Subtitles 60,000 v. 1 in 8 of population 300 Level 3 signers

  6. Background – Sign Language Signed Sign Supported British Sign English English Language (SE) (SSE) (BSL) educated deaf community preferred first language

  7. SignAnim – Aims and Aspirations Exploration of (semi-)automatic conversion of subtitles to sign language … … to increase access for the Deaf ... … with a potential of providing access to up to 50/80% of TV broadcasts.

  8. SignAnim – Natural Language Processing Subtitle stream up to 180 words min -1 Sign rates typically 50% of speech rate (100 signs min-1) SE – too verbose to be signed in full SSE – elision of low information words BSL – translation to multi-modal signs

  9. Sign Dictionary Avatar SignAnim Components – Simon the Avatar Sign Stream Data Capture

  10. Motion Capture Cybergloves Magnetic Sensors Video face tracker

  11. P2 Pn Schematic of SignAnim system Audio/Video Stream TV Capture Card Eliser Avatar Software Mixer Teletext Stream … P1 D1 D2

  12. SignAnim Components – Eliser RequirementsResolution of Lexical Ambiguity Elision If @ receiver Timeliness of signing v If @ transmitter prioritising of parts of sign sequence

  13. CMU Parser Prioritiser Sign Dictionary Eliser - Summary Sign Stream Subtitle Frames Elision Level

  14. SignAnim – Natural Language Processing ‘Last night we brought you the tale of the duck that could not swim and had to learn while a guest of the RAF in Norfolk.’26 wordsin 2 subtitle framestime to speak / time subtitles on screen 7 secstime to sign in full 18 / 14 / 9 secsfinger spelling significant overhead

  15. SignAnim – Natural Language Processing ‘Last night we brought you the tale of the duck that could not swim and had to learn while a guest of the RAF in Norfolk.’Resolution of some lexical ambiguity by p.o.s. tagging - duck noun/ verb - had auxiliary/ verb - swim noun/ verb - in participle/ preposition to facilitate correct sign selection

  16. SignAnim – Natural Language Processing ‘Last night we brought you the tale of the duck that could not swim and had to learn while a guest of the RAF in Norfolk.’Potential elision determiners auxiliary verbs modifying phrases adjectives and adverbs in extreme cases jettison entire sentences

  17. SignAnim – Natural Language Processing ‘Last night we brought you the tale of the duck that could not swim and had to learn while a guest of the RAF in Norfolk.’ Additional problems structural ambiguity appropriate sign no sign for guest, default finger spell

  18. SignAnim – CMU link grammar Positive features Lexically driven sentence parser Robust Prioritorises multiple analyses On failure returns partially parsed word sequence Modifiability

  19. Xp Jp Wd Pv Dsu CO E MVp A Ds Ssi Perhaps the hen was actually reared by a broody duck ! SignAnim – CMU link grammar example

  20. CMU link grammar parser - a shell <noun> : ( {A-} & {D-} & Wd- & S+ ) or ( {A-} & {D-} & O- ) or ( {A-} & {D-} & PN- ); <adj> : A+ ; <det> : D+ ; <verb> : S- & O+ & {@PP+}; <prep> : PP- & PN+; book.n books.n report.n reports.n room person : <noun> ; yellow green : <adj> ; the a : <det> ; book.v books.v report.v reports.v brings : <verb> ; on in : <prep> ; CAPITALIZED-WORDS : <noun> or <adj> or <det> ; "." : FS- ; LEFT-WALL : (Wd+ & FS+) ;

  21. Wd O PN S O { D } S { D } { D } {@PP} { A } { A } { A } books books books books Wd O S S { D } {@PP} { A } books person CMU link Grammar Parser - link construction books .n : ( {A-} & {D-} & Wd- & S+ ) or ( {A-} & {D-} & O- ) or ( {A-} & {D-} & PN- ) or .v : (S- & O+ & {@PP+});

  22. linkparser> A person reports the book. Found 1 linkage (1 had no P.P. violations) Unique linkage, cost vector = (UNUSED=0 DIS=0 AND=0 LEN=7) +----------------FS---------------+ +---Wd---+ +------O-----+ | | +--D-+---S---+ +--D--+ | | | | | | | | ///// a person reports.v the book.n . ///// FS <---FS----> FS . (m) ///// Wd <---Wd----> Wd person (m) a D <---D-----> D person (m) person S <---S-----> S reports.v (m) reports.v O <---O-----> O book.n (m) the D <---D-----> D book.n

  23. Eliser - elision stategy Augment CMU dictionary with further p.o.s. information e.g. has.aux v. has.v Rules for word and path priorities #Link Weight Left Path Left Word Right Path Right Word CO 3 X X - - D 1 - X - - Ds 1 - X - - G 4 X X - - AN 4 - X - - A 4 - X - - RS 4 - X X X

  24. Xp Jp Wd Pv Dsu CO E MVp A Ds Ssi Perhaps the hen was actually reared by a broody duck ! Eliser - Prioritorising

  25. Xp Jp Wd Pv Dsu CO E MVp A Ds Ssi Eliser - Prioritorising Perhaps the hen was actually reared by a broody duck ! 10 10 10 10 10 10 10 10 10 10

  26. Xp Jp Wd Pv Dsu CO E MVp A Ds Ssi Eliser - Prioritorising Perhaps the hen was actually reared by a broody duck ! 3 10 10 10 10 10 10 10 10 10

  27. Xp Jp Wd Pv Dsu CO E MVp A Ds Ssi Eliser - Prioritorising Perhaps the hen was actually reared by a broody duck ! 3 1 10 10 10 10 10 10 10 10

  28. Xp Jp Wd Pv Dsu CO E MVp A Ds Ssi Eliser - Prioritorising Perhaps the hen was actually reared by a broody duck ! 3 1 10 10 2 10 10 10 10 10

  29. Xp Jp Wd Pv Dsu CO E MVp A Ds Ssi Eliser - Prioritorising Perhaps the hen was actually reared by a broody duck ! 3 1 10 10 2 10 9 9 9 9

  30. Xp Jp Wd Pv Dsu CO E MVp A Ds Ssi Eliser - Prioritorising Perhaps the hen was actually reared by a broody duck ! 3 1 10 10 2 10 9 1 9 9

  31. Xp Jp Wd Pv Dsu CO E MVp A Ds Ssi Eliser - Prioritorising Perhaps the hen was actually reared by a broody duck ! 3 1 10 10 2 10 9 1 4 9

  32. Xp Jp Wd Pv Dsu CO E MVp A Ds Ssi Eliser - Elision Perhaps the hen was actually reared by a broody duck ! 3 1 10 10 2 10 9 1 4 9

  33. TESSA - Overview Aim : To give access to Post Office services for those whose first language is not English.

  34. TESSA Input : Speech Recognition • Restricted Number of sentences (115) • Variable quantities (monetary amounts, days of the week) • Grammar defined as FSN • MLLR acoustic adaptation • Entropic recognition engine

  35. TESSA Output : BSL and Foreign Language • BSL sign sequences • Signs for variable quantities blended into standard phrases • Customer may ask for phrases to be repeated • Text translations into 4 languages for non-English speakers • English text for the hard of hearing

  36. Conclusions SignAnim and Tessa demonstrated + replay of motion captured sequences readable + usefulness of existing NLP and speech recognition technologies + desirability of BSL (rather than SSE)

More Related