Pattern recognition in action for cataloging and metadata
Download
1 / 42

Pattern Recognition in Action for Cataloging and Metadata - PowerPoint PPT Presentation


  • 230 Views
  • Uploaded on

Pattern Recognition in Action for Cataloging and Metadata. 2006 OLC Technical Services Retreat Chris Grabenstatter April 25, 2006. Agenda. OCLC Cataloging/Metadata strategic directions Architecture to support strategy Examples of projects . Cataloging Environment.

loader
I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
capcha
Download Presentation

PowerPoint Slideshow about 'Pattern Recognition in Action for Cataloging and Metadata' - Jeffrey


An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript
Pattern recognition in action for cataloging and metadata l.jpg

Pattern Recognition in Action for Cataloging and Metadata

2006 OLC Technical Services Retreat

Chris Grabenstatter

April 25, 2006


Agenda l.jpg
Agenda

  • OCLC Cataloging/Metadata strategic directions

  • Architecture to support strategy

  • Examples of projects


Cataloging environment l.jpg
Cataloging Environment

  • Fewer catalogers, reduced budgets

  • Little growth in print materials acquisitions

  • E-resources increasing – cataloged?


Deliver more automatically l.jpg
Deliver more automatically

  • Build on PromptCat, Cataloging Partners program success

  • Partner with major materials providers

  • Cataloging tied to selection – possible new service


More scripts language support l.jpg
More Scripts/Language Support

  • Growing WorldCat

    • Supporting libraries’ diverse collections

    • Easier to get materials cataloged

  • Growing membership

    • One stop shopping

    • Both US and global libraries


Metadata support for e content l.jpg
Metadata support for e-content

  • Support automated metadata generation for e-resources

  • Facilitate storage and discovery of digital content

  • Support new metadata schemes - crosswalks

  • Enrich WorldCat with e-serials records and holdings


Continue to deliver value l.jpg
Continue to deliver value

  • Ongoing Connexion maintenance

  • Standards

  • Simplify pricing


Lego era l.jpg

“The Internet is entering its Lego era. Indeed, blocks of interchangeable software components are proliferating … and developers are joining them together to create a potentially infinite array of useful new programs.”

--John Markoff, The New York Times, April 5, 2006

Lego Era


Library 2 0 l.jpg
Library 2.0 interchangeable software components are proliferating … and developers are joining them together to create a potentially infinite array of useful new programs.”

“Library 2.0 is about small pieces of software loosely joined, … requires business models where multiple vendors bring value to consumers together … to reduce duplication of effort and reduce barriers to innovation…”

--Paul Miller, “Library 2.0: the challenge of disruptive innovation.”

http://www.talis.com/resources/documents/447_Library_2_prf1.pdf


Slide10 l.jpg

OCLC Metadata Management Service interchangeable software components are proliferating … and developers are joining them together to create a potentially infinite array of useful new programs.”

Connexion Digital Archive Content Coop ILS PICA NetLibrary Material Vendors Publishers

OAI Repositories

Local DB’s

Web Services/Portal/API Layer

Local

Holdings

(MFHD)

OAI Harvest

Validate

DA Ingest

DA Extract

DA Access

Metadata Creation

Z39.50

(authorities

Non-roman

Format

Crosswalks

Acquisitions/

Selection

Terminologies

Pan/Zoom

Language

Service

SRW/Zing Update

Shelf Ready

Reports &

Stats

Profiling

Metadata Capture

Usage

Stats

Profiling

Data

Digital

Archive


Projects metadata support for e content l.jpg
Projects interchangeable software components are proliferating … and developers are joining them together to create a potentially infinite array of useful new programs.”Metadata support for e-content

  • Extraction/Creation Web Service

  • Crosswalk Web service

  • OCLC Terminologies Service

  • Content Cooperative Pilot

  • OCLC eSerials Holdings Service


Extraction creation web service l.jpg
Extraction/Creation Web Service interchangeable software components are proliferating … and developers are joining them together to create a potentially infinite array of useful new programs.”

Extract metadata from Web sites, PDF files, and Word files

  • Re-implementing and enhancing functionality currently available in Connexion browser

    • Connexion browser – May 2006

    • Connexion client 1.60 – June 2006


Connexion extract metadata l.jpg
Connexion extract metadata interchangeable software components are proliferating … and developers are joining them together to create a potentially infinite array of useful new programs.”

  • Enter URL or path to extract metadata

    • Supported file types .htm, .doc, .pdf

  • Create multiple records from Web sites linked to the parent URL

  • Specify to display or save created workforms, apply default constant data, and define My Status value

  • Future – add tools to “create” metadata


Crosswalk web service l.jpg
Crosswalk Web Service interchangeable software components are proliferating … and developers are joining them together to create a potentially infinite array of useful new programs.”

  • Batchload/PromptCat

    • ONIX to MARC

    • OAI harvesting – Dublin Core to MARC

  • Future

    • Import and export Dublin Core data from Connexion client

    • Support for other metadata schemes both browser and client interfaces


Oclc terminologies service l.jpg
OCLC Terminologies service interchangeable software components are proliferating … and developers are joining them together to create a potentially infinite array of useful new programs.”

  • Introduction, June 2006

  • Add more access points using other controlled vocabularies, e.g., MeSH, GSAFD

  • Available to all OCLC Cataloging subscribers

  • Subscriptions available for non-Cataloging users

  • Use with a variety of metadata editors, e.g., Connexion browser and client


List of terminologies in initial release l.jpg
List of Terminologies in initial release interchangeable software components are proliferating … and developers are joining them together to create a potentially infinite array of useful new programs.”

  • aat - Art & Architecture Thesaurus (J. Paul Getty Trust)

  • dct - Dublin Core Metadata Initiative Type Vocabulary (Dublin Core Metadata Initiative)

  • gmgpc - Thesaurus of Graphic Materials, TGM I (Library of Congress)

  • gsafd - Guidelines On Subject Access To Individual Works Of Fiction, Drama, Etc. (American Library Association)

  • lctgm - Thesaurus of Graphic Materials, TGM II (Library of Congress)

  • mesh - Medical Subject Headings (MeSH®) (National Library of Medicine)

  • ngl - Newspaper Genre List (University of Washington)

  • tgn - Thesaurus of Geographic Names (J. Paul Getty Trust)

  • ulan - Union List of Artists' Names (J. Paul Getty Trust


Slide22 l.jpg

Terminology interchangeable software components are proliferating … and developers are joining them together to create a potentially infinite array of useful new programs.”

Pane

A separate application

1

2


Content cooperative pilot l.jpg
Content Cooperative Pilot interchangeable software components are proliferating … and developers are joining them together to create a potentially infinite array of useful new programs.”

  • Upload content objects to the OCLC Digital Archive from Connexion browser and client interfaces

    • Digital image, thesis & dissertation, oral history, e-book, video, etc.

  • Replace WorldCat records to automatically add a URL pointing to the content object

  • Access digital content from FirstSearch, Group Catalogs, and OpenWorldCat


Slide30 l.jpg

Planned Open WorldCat Page for Digital Image interchangeable software components are proliferating … and developers are joining them together to create a potentially infinite array of useful new programs.”


Slide31 l.jpg

Planned interchangeable software components are proliferating … and developers are joining them together to create a potentially infinite array of useful new programs.” Zoom & Pan


Oclc eserials holdings service l.jpg
OCLC eSerials Holdings service interchangeable software components are proliferating … and developers are joining them together to create a potentially infinite array of useful new programs.”

  • Automatically updates eSerials holdings in WorldCat

  • Access to eSerials via WorldCat Resource Sharing

  • Access to eSerials through OCLC discovery platform

  • Compare electronic and print serials collections

    No additional work for the library!


Slide33 l.jpg

P interchangeable software components are proliferating … and developers are joining them together to create a potentially infinite array of useful new programs.”

P

P

E

E

P

E

P

P

E

E

P

E

E

E

Digital Collections

D

P

E

P

E

P

E

Print Collections

P

P

P

E

P

E

E

Links to online Full Text

Resource Sharing Services

Links to OPACs

P

P

E

P

E

E

OCLC Libraries

Vendors

OCLC

OCLC FirstSearch

WorldCat Resource Sharing

WorldCat Collection Analysis

Open WorldCat

Resolver /

A-Z serials

list

Resolver /

A-Z serials

list

P

E

P

P


Pilot partners l.jpg
Pilot Partners interchangeable software components are proliferating … and developers are joining them together to create a potentially infinite array of useful new programs.”

  • 35 Pilot libraries

  • EBSCO

  • Ex Libris

  • Serials Solutions

  • TDNet

  • More to come


Benefits to the library l.jpg
Benefits to the library interchangeable software components are proliferating … and developers are joining them together to create a potentially infinite array of useful new programs.”

  • Increased operational efficiencies in ILL

    • Filling where possible

    • A revenue opportunity for some

    • You control requests via automatic deflection

  • Increased visibility at the point of need

  • Leverages investment in services


Progress l.jpg
Progress interchangeable software components are proliferating … and developers are joining them together to create a potentially infinite array of useful new programs.”

  • Initial production system available late June 2006

  • Web-based registration form

  • No charge to participate in the eSerials holdings service

  • Future enhancements projected to include options for local holdings data, MARC record update service, and additional deflection choices


Projects deliver more automatically l.jpg
Projects interchangeable software components are proliferating … and developers are joining them together to create a potentially infinite array of useful new programs.”Deliver more automatically

  • Improve shelf-ready cataloging

    • PromptCat/Cataloging Partners – 100% goal

    • Partner with major vendors

  • Selection

    • Possible future service

    • OCLC partnering with materials vendors to help with notification slip selection process

    • Cataloging a by product of selection

    • Watch for more information in the future!


Projects more scripts language support l.jpg
Projects interchangeable software components are proliferating … and developers are joining them together to create a potentially infinite array of useful new programs.”More Scripts/Language support

  • New scripts support

    • Cyrillic, Greek and Hebrew – July 2005

    • Thai and Tamil scripts for use with Connexion client 1.50 (investigating Devanagari, Sinhala, and Bengali next)

  • Connexion interface translations

    • Client

      • Chinese (Traditional and Simplified) and Japanese – July 2005

      • German and Korean – Nov. 2005

    • CatExpress – French (Nov. 2006)

  • Unicode export – Nov. 2005

  • Automatic transliteration Web service – June 2006


Questions answers sharing l.jpg
Questions, Answers interchangeable software components are proliferating … and developers are joining them together to create a potentially infinite array of useful new programs.”& Sharing

?

?

?


Cataloging future directions contact us l.jpg
Cataloging future directions interchangeable software components are proliferating … and developers are joining them together to create a potentially infinite array of useful new programs.”Contact us

  • Eric Childress: eric_childress@oclc.org

  • Chris Grabenstatter: c_grabenstatter@oclc.org