1 / 14

Using the NASA Thesaurus to Support the Indexing of Streaming Media

Using the NASA Thesaurus to Support the Indexing of Streaming Media. Gail Hodge Information International Associates, Inc. Janet Ormes & Patrick Healey NASA Goddard Space Flight Center Library. Historic Context.

lynna
Download Presentation

Using the NASA Thesaurus to Support the Indexing of Streaming Media

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Using the NASA Thesaurus to Support the Indexing of Streaming Media Gail Hodge Information International Associates, Inc. Janet Ormes & Patrick Healey NASA Goddard Space Flight Center Library

  2. Historic Context • The Library has collected and circulated the Center’s colloquia on audio or video since 1967 • A catalog of these holdings have been posted on the Library’s web site since 2001 • Patrons required to come to the Library, resulting in limited accessibility of recorded colloquia • Streaming Media Center Project began in 2001 as part of the Library’s response to Knowledge Management initiatives

  3. Introducing the GSFC Media Center

  4. Streaming Media • Streaming media • Video that is encoded for delivery across the internet/intranet • Encoding • Computer processing of video to a format for web casting • Web casting • The act of delivering audio and video content across the internet/intranet • Can be delivered live or on-demand

  5. The Goddard Library Streaming Media Center • The Streaming Media Center is now available from the Library website (http://library.gsfc.nasa.gov) • Can be included in personalized portals • Library has collected >350 hours of video • >100 hours indexed • Currently broadcasting 2 hours daily for the Earth Observing Systems Knowledge Management Pilot

  6. Access Issues • Current Needs • Need to know the overall topic of the video • More likely to remember the topic, presenter, date or series • Permanent Access • Less likely that users will remember the video’s metadata • More likely that users will want specific information • Terminology may change over time

  7. Indexing Video Content • Video indexing is similar to a back-of-the book index for specific information • Entering a keyword leads you to the specific location of the subject

  8. Features of Selected Software • Compares recognized speech with stored default terminology • Uses speaker inflection to identify meaningful intervals • Indexing and Search components included

  9. Incorporation of NASA Thesaurus • Added specific scientific terminology • Incorporated terms and their NTs, RTs and UF/USE relationships • Used text of Astrophysics Data System to provide terms in grammatical structures • Provides query expansion and improves relevancy

  10. Query Expansion “Saturn Moons” + Ios + Triton Or “Scatha Satellite” + P78-2 Satellite

  11. Query Expansion (Illustrated) Sample Search (aurora) on same one hour lecture entitled “Jupiter’s Aurora”. One file was indexed using the NASA thesaurus, the other was indexed using a more basic scientific word list. Benefits GREATER overall relevance understanding MORE relevant content found (2M+ VS 20 Sec’s) Ignores IRRELEVANT content (Speech Recognition Error)

  12. Relevance Interval Creation • Relevance Interval Creation links related concepts within media files, which drives Relevance Intervals • External knowledge from the thesaurus improves the accuracy of the Creation process because the explicit knowledge in text is incomplete

  13. Relevance Interval (Illustrated) Sample Search (aurora) on same one hour lecture entitled “Jupiter’s Aurora”. One file was indexed using the NASA thesaurus, the other was indexed using a more basic scientific word list. Benefits GREATER overall relevance understanding MORE relevant content found (2M+ VS 20 Sec’s) Ignores IRRELEVANT content (Speech Recognition Error)

  14. Benefits • Identify relevant pieces of content within a longer video • Stream more relevant, specific information intervals to users • Minimize manual processing • Ultimately improve reuse of information and increase opportunities for knowledge sharing

More Related