ZemPod : A semantic web approach to podcasting

ZemPod: A semantic web approach to podcasting Journal Of Web Semantics 2008 Oscar Celma, Music Technology Group, Spain Yves Raimond, Centre for Digital Music, UK August 31th, 2009

Contents Introduction Background System architecture Usage scenario Conclusions

Introduction [1/2] • Podcast • Portmanteau of the “iPod” and “broadcast” • A media file distributed in Internet • Use syndication feeds • Explosion in popularity of mobile devices • Make syndication model more attractive • Thousands of audio podcasts are available on the net

Introduction [2/2] • There are some limitations of podcasting • No formal description • Only textual description available in HTML • No information about the contents of a podcast session • Consists of a single audio file • Difficult to seek into one of the music tracks • To overcome these limitations • Using traditional audio signal processing • Speech/audio segmentation • Audio identification • Adding semantics to the podcast

Contents • Introduction • Background • Multimedia web syndication • Speech/music segmentation • Audio identification • The music ontology • System architecture • Usage scenario • Conclusions

Multimedia web syndication [1/2] • File format used for syndication • RSS • Really Simple Syndication (RSS 2.0) • Rich Site Summary (RSS 0.91 and 1.0) • RDF Site Summary (RSS 1.0) • Atom • To standardize feeds notation and autodiscovery • Due to some limitations and incompatibility versions of the RSS family

Multimedia web syndication [2/2] Example of RSS

Feeds and the semantic web • Atom/Owl • Aims at capturing the semantics of the Atom syndication format • Feed • Attached metadata • Entry • Holds a text content

Speech/music segmentation • Discriminating between speech (or spoken content) versus music • Achieving an automatically and meaningful segmentation of a podcast session • Speech/music segmentation methods • Gaussian Mixture Models (GMM) • Support Vector Machines (SVM) classifiers • Combination of standard Hidden Markov Models and Multilayer Perceptrons

Audio identification • Allows identification of unknown music • Audio fingerprint • A unique, compact code derived from perceptually relevant aspects of a recording • Usages • Identification • Authentication • Content-based key generation • Content-based audio retrieval and processing • Hidden Markov Models (HMM) • Can precisely model temporal evolution of audio signals

Music ontology [1/2] • Create a formal framework • Describing music-related information • Covering complex editorial information • External Ontologies used by Music Ontology • OWL-Time ontology • Describing the temporal content of Web • Interval, Instant • FRBR • Functional Requirements for Bibliographic Records • Work, Expression, Manifestation, Item • FOAF • Friend Of A Friend • Person, Group, Organization

Music ontology [2/2] Describing a music production workflow

Contents • Introduction • Background • System architecture • RDFizing a podcast session • Access and workflow • Awareness of feeds • Resource identifiers • Usage scenario • Conclusions

System architecture • Main goal is • Analysing and decomposing a given podcast audio file • RDFizing the podcast information

The system segments the audio file into speech and music sections

Apply speech recognition to extract a list of textual terms

Weight terms’ relevance according to a dictionary of musical terms

Recognize music chunks using fingerprinting

Query a metadata repository to get basic information with the track

RDFizing a podcast session • To describe the semantics of a podcast • Using Atom-OWL and music ontology • “From 0 to 2 min, there is someone speaking about Michel Jackson, then there is a recording of a ‘Billie Jean’ in 1983” • Using 2 sub concept of the Event • MusicSegment • Temporal region holding music • SpeechSegment • Temporal region holding speech

Access and workflow • REST interface • Representational state transfer • Style of software architecture for distributed hypermedia systems such as WWW • Allow us to access the podcast service • http://zempod.net/ • Considering the podcast service is available

Access and workflow- Awareness of feeds POST USER http://zempod.net/feed 201 (Created) Location Identifier http://zempod.net/feed/4567 • Internal representation of this feed • Music ontology/AtomOWL • Can be queried through SPARQL

Access and workflow- Resource identifiers • MO/AtomOWL are designed as a hierarchical URI space • Feed • Supports a syndication • http://zempod.net/feed/{FEEDID} • Entry • Holds a text content • http://zempod.net/feed/{FEEDID}/entry{ENTRYID} • Item • Actual contents • http://zempod.net/feed/{FEEDID}/entry{ENTRYID}/item{ITEMID}

Contents • Introduction • Background • System architecture • Usage scenario • Submission of the original feed • Analysis of the new entries • Semantic description of the new entries • Conclusions

Submission of the original feed Original feed POST http://www.ourmedia.org/user/billy2rivers/mrss http://zempod.net/feed 201 (Created) Location Identifier http://zempod.net/feed/1234

Analysis of the new entries Processing a new podcast session

Semantic description of the new entries GET http://zempod.net/feed/1234 USER

Conclusions • To solve limitations of podcasting • No formal description of a podcast • Difficult to seek into one of the music tracks • Using traditional audio signal processing • Speech/music segmentation • Audio identification • Using semantic web techniques • Transform the current RSS to the Atom/OWL • It will ease some important music information retrieval tasks

Related Ontology – MO/Event • To express the production process of a piece of music • The main sub-classes of event • Performance, Recording, Arrangement, Composition

Related Ontology - FRBR • Functional Requirements for Bibliographic Records • 서지 레코드의 기능상 요건 • 목록규칙이나 목록의 완성을 의도하는 개체-관계 모델 • 서지레코드의 구조와 관계 • 목록규칙 제정과 시스템 디자인을 위한 정확한 어휘 제공

FRBR – Entities and Relationships (1) Entities and Primary Relationships

FRBR – Entities and Relationships (2) Entities and “Responsibility” Relationships

FRBR – Entities and Relationships (3) WORK has as subject has as subject has as subject WORK EXPRESSION PERSON CONCEPT OBJECT MENIFESTATION ITEM CORPORATE BODY EVENT PLACE Entities and “Subject” Relationships

MusicBrainz

ZemPod : A semantic web approach to podcasting

ZemPod : A semantic web approach to podcasting

Presentation Transcript

Semantic Web Services Tutorial

Psychology 439G/572B

Spectral Hashing

Open / Semantic / Social / Mobile

Knowledge Discovery over the Deep Web, Semantic Web and XML

Eric Neumann Clinical Semantic Group W3C HCLS chair, MIT Fellow

What Semantic Web researchers need to know about Machine Learning?

COMPSCI 732: Semantic Web Technologies

Unit 11 Semantic Development of Words for Each Sex and What It Shows

The Semantic Web Landscape

Internet Engineering Course

Semantic Web Services Systems and Tools 4th International Semantic Web Conference (ISWC 2005)

Tutorial AAAI 2006 Tutorial Forum

Chapter 1 The Semantic Web Vision

Approach to Adrenal Incidentalomas

COMPSCI 732: Semantic Web Technologies

Semantic Web

Semantic Web

Artificial Intelligence

Semantic Web Services

What Do You Want —Semantic Understanding?

Searching The Semantic Web