1 / 17

Slides, other materials online: research.microsoft/research/graphics/kgreene/icad

Audio Taken Seriously; The present and future of audio at Microsoft Ken Greenebaum kgreene@microsoft.com Internet Platforms and tools Division Microsoft Corporation. Slides, other materials online: http://www.research.microsoft.com/research/graphics/kgreene/icad. Overview. Today

myrrh
Download Presentation

Slides, other materials online: research.microsoft/research/graphics/kgreene/icad

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Audio Taken Seriously;The present and future of audio at MicrosoftKen Greenebaum kgreene@microsoft.comInternet Platforms and tools DivisionMicrosoft Corporation ICAD Industry Panel

  2. Slides, other materials online: http://www.research.microsoft.com/research/graphics/kgreene/icad ICAD Industry Panel

  3. Overview • Today • Solid media foundations (DirectX, ActiveMovie) • Soon • Advanced media (ActiveAnimation, Whisper/Whistler) • Tomorrow • Conversational interfaces ICAD Industry Panel

  4. Today: DirectSoundhttp://www.microsoft.com/mediadev/audio/iaud.htm • Streaming audio • Reasonable latency • Input (soon) • Device independence • Multiple app’s audio mix • DSound3D ICAD Industry Panel

  5. Today: Active Movie • Graph based media architecture • Movie playback • Movie record (soon!) • Open filter API • Audio plugin technology ICAD Industry Panel

  6. Today: Netshowhttp://www.microsoft.com/netshow/ • Streaming network audio/video • Multicast audio using RTP (real-time protocol) • ASF file format, conversion, editing tools • NT server ICAD Industry Panel

  7. Today: Interactive Music(Formerly BlueRibbon’s AudioActive) • Intelligent interactive music • Composes/Delivers music • Based on expert system • Human composer ‘authors’ templates • Music always sounds fresh and original • Look for it: PowerPoint ‘97, MSN Riff ICAD Industry Panel

  8. Soon: DirectMusicContact: craighs@microsoft.com • Consistent Playback of MIDI Music • Internet support for Music • DLS downloadable sample sets • Optional software MIDI synth • Internet MIDI jamming? ICAD Industry Panel

  9. Soon: “Appelles” • Expect an announcement soon! • Animation Description Language • Functional Paradigm • Media Integration • Implicit Time • Language Integration (Java) • Enable sophisticated Web animation ICAD Industry Panel

  10. Appelles Audio Capabilities: • All audio types orthogonal • Parametric Synthesis • MIDI • Audio Active Music Synthesis • Streaming audio • PCM Audio • 3D Spatialized sound embedded in geometry ICAD Industry Panel

  11. Soon: “Talisman” Audiohttp://www.microsoft.com/hwdev/devdes/talisman.htm/ • Hardware acceleration of: • DSound/DSound3D • Echo Cancellation • Active Movie filter accelerator • 32bit mixer • DLS compatible synthesizer • MODEM/Telephony ICAD Industry Panel

  12. Soon: “Whisper”http://www.research.microsoft.com/research/srg/ • Windows Highly Intelligent Speech Recognizer • Based on SphinxII • Continuous speech recognition • Speaker independent • Context-free grammar decoding ICAD Industry Panel

  13. Soon: “Whistler”http://www.research.microsoft.com/research/srg/ • Trainable Text to Speech Synthesizer • Training from human speech; maintains: • Natural prosody • Characteristics of original human • Emotional control • Uses NLP technology to parse text ICAD Industry Panel

  14. Tomorrow: Conversational Interfaces • Motivation: • Given choice people communicate with speech • People prefer natural language over ‘command languages’ • anthropomorphism unavoidable w/spoken interaction ICAD Industry Panel

  15. Persona Projecthttp://www.research.microsoft.com/ui/persona/home.htm/ • Conversational Assistant as UI • Spoken conversation (voice recognition/synth) • Natural Language (in limited domains) • Assistant w/Rich visual presence • Simulates verbal and non-verbal cues ICAD Industry Panel

  16. Here’s Peedy and Gene: ICAD Industry Panel

  17. Conclusion: • Microsoft is: • Taking media very seriously • Offering a solid foundation today • Designing the future ICAD Industry Panel

More Related