1 / 21

[The Band SIG] MPEG7 - Audio

[The Band SIG] MPEG7 - Audio. 손우람 2007 년 12 월 1 일. Why MPEG-7?. MPEG standards. 압축 (Compression) MPEG-1 (CD) MPEG-2 (DVD, DTV) MPEG-4 (WEB, Mobile) 내용 기술 (Content Description) MPEG-7 멀티미디어 프레임워크 MPEG-21 그 외 MPEG-A, B, C, D, E. MPEG-7 Multimedia Indexing and Searching.

raziya
Download Presentation

[The Band SIG] MPEG7 - Audio

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. [The Band SIG]MPEG7 - Audio 손우람 2007년 12월 1일

  2. Why MPEG-7?

  3. MPEG standards • 압축 (Compression) • MPEG-1 (CD) • MPEG-2 (DVD, DTV) • MPEG-4 (WEB, Mobile) • 내용 기술 (Content Description) • MPEG-7 • 멀티미디어 프레임워크 • MPEG-21 • 그 외 • MPEG-A, B, C, D, E

  4. MPEG-7 Multimedia Indexing and Searching • MPEG-7 Indexing & Searching: • Semantics-based (people, places, events, objects, scenes) • Content-based (color, texture, motion, melody, timbre) • Metadata (title, author, dates) • MPEG-7 Access & Delivery: • Media personalization • Adaptation & summarization • Usage environment (user preferences, devices, context)

  5. MPEG-7 MDS: Free Text Annotation Example • The following example gives an MPEG-7 description of a car that is depicted in an image: • <Mpeg7> • <Description xsi:type="SemanticDescriptionType"> • <Semantics> • <Label> • <Name> Car </Name> • </Label> • <Definition> • <FreeTextAnnotation> • Four wheel motorized vehicle • </FreeTextAnnotation> • </Definition> • <MediaOccurrence> • <MediaLocator> • <MediaUri> image.jpg </MediaUri> • </MediaLocator> • </MediaOccurrence> • </Semantics> • </Description> • </Mpeg7>

  6. 오디오 부터…

  7. Audio Fingerprint

  8. 장르 분류 • Genre Classification • …

  9. Audio Visualization

  10. Music Information Retrieval • Content-based querying and retrieval • Automatic classification • Music recommendation and play-list generation • Music summarization • Musical Feature Extraction • Harmony, chord and tonality • Melody and motives • Rhythm, beat, tempo and form

  11. MPEG 7 Audio • Low-Level Descriptors • Description Schemes • Description Definition Language (DDL) • BiM (Binary Format for MPEG-7)

  12. What is Descriptor(D)? • 정의 • 오디오 특징 벡터 혹은 구성물의 의미 • Ex) • Audio Power • Audio Envelope • Audio Spectrum Flatness

  13. Description Schemes (DSs) • 정의 • 쉽게 말해서 DS의 집합 • 예) • Instrument Timbre (악기 음색) • LogAtackTime • HarmonicSpectralCentroid • …

  14. Description Definition Language (DDL) • DS와 DSs를 정의하는 언어 • XML로 표현 • …??...

  15. Scalable Series Original Series Scaled Series Index i 1 2 3 4 5 6 7 8 2 3 1 ratio 2 1 5 numOfElements 12 totalNumOfSamples Scalar vs. Vector

  16. Low-Level Descriptors • Basic Descriptors • Basic Spectral Descriptors • Signal Parameter Descriptors • Timbral Temporal Descriptors • Timbral Spectral Descriptors • Spectral Basis Descriptors

  17. 오디오의 기본적 구성 • 시간 도메인 (Time Domain) 0 Nw Nhop L0 • N: index • S(n): signal • Fs: Sampling rate • L: index of time frames L1 L2 Hop size Lw

  18. Basic Descriptors • Audio Waveform • Audio Power

  19. 다음 시간에는… • 돌아가며 Descriptor 하나씩 준비하기 • 약 20-40분 • 각각의 Descriptors의 내용을 추출하기 위한 알고리즘 생각하기 • 코드로 구현해보기 (템플릿 코드 제작예정) • 각자 자유주제로 세미나 • 약 10-20분

More Related