1 / 10

LYU0302: PVCAIS-Personal Video-Conference Archive Indexing System

LYU0302: PVCAIS-Personal Video-Conference Archive Indexing System. Background Videoconference is becoming popular Participants wish to keep videoconference archives for the later reference Normal video and audio files are neither searchable nor helpful to recall their contents

junius
Download Presentation

LYU0302: PVCAIS-Personal Video-Conference Archive Indexing System

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. LYU0302: PVCAIS-Personal Video-Conference Archive Indexing System • Background • Videoconference is becoming popular • Participants wish to keep videoconference archives for the later reference • Normal video and audio files are neither searchable nor helpful to recall their contents • Indexing of videoconference archives has not been investigated till now

  2. Phase I Phase II Phase III LYU0302: PVCAIS • Objective: develop an experimental system with the following architecture.

  3. LYU0302: PVCAIS • Phase I: Media acquisition (3~4 months) - Study the mechanism and media format of a videoconferencing client to obtain media data.

  4. MCU MC MP0-n Architecture of H.323-Based Videoconference System PBN Terminal Terminal A/V, Data, Control Terminal Terminal

  5. UDP+RTP/IP Video CODEC H.261, H.263 Video I/O Audio CODEC G.711, G.723 Audio I/O PBN inter-face GUI Data application: Text chat & whiteboard User CODEC T.120 System control CODEC H.245, H.225 TCP/IP Structure of H.323 Terminal • A terminal (videoconferencing client) interacts with users, encode/decode/render media streams, and communicates with MCU.

  6. UDP+RTP/IP Video CODEC H.261, H.263 Video I/O Audio CODEC G.711, G.723 Audio I/O PBN inter-face GUI Data application: Text chat & whiteboard User data CODEC T.120 System control CODEC H.245, H.225 TCP/IP Text file Whiteboard file Video file Audio file What we’ll do to a terminal? • Intercept media streams and store them in files. • Video/Audio: intercept encoded streams • Text/Whiteboard: intercept decoded streams

  7. LYU0302: PVCAIS • Phase II: Media info retrieval (3~4 months) • Select a group of algorithms to implement the multimedia-indexing functions. • Face detection & recognition • Speech recognition • OCR • Title generation • Keyword selection

  8. LYU0302: PVCAIS • Phase II - Integrate these functions to generate an XML index file for each videoconference. - Create and manage the file system of indexed archives.

  9. Searching Playback Figure 4. An example GUI of PVCAIS LYU0302: PVCAIS • Phase III: GUI (2 months) • Implement a GUI of PVCAIS to: • Search in indexed videoconference archives. • Synchronized playback of a selected conference.

  10. Required Techniques for Phase I • Source code of H.323 videoconference system (outsourcing) • H.323 knowledge • Structure of terminal • H.263 video CODEC • Visual C++ • Multithread • DirectSound • GUI programming

More Related