pattern recognition in action for cataloging and metadata
Download
Skip this Video
Download Presentation
Pattern Recognition in Action for Cataloging and Metadata

Loading in 2 Seconds...

play fullscreen
1 / 42

Pattern Recognition in Action for Cataloging and Metadata - PowerPoint PPT Presentation


  • 229 Views
  • Uploaded on

Pattern Recognition in Action for Cataloging and Metadata. 2006 OLC Technical Services Retreat Chris Grabenstatter April 25, 2006. Agenda. OCLC Cataloging/Metadata strategic directions Architecture to support strategy Examples of projects . Cataloging Environment.

loader
I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
capcha
Download Presentation

PowerPoint Slideshow about 'Pattern Recognition in Action for Cataloging and Metadata' - Jeffrey


An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript
pattern recognition in action for cataloging and metadata

Pattern Recognition in Action for Cataloging and Metadata

2006 OLC Technical Services Retreat

Chris Grabenstatter

April 25, 2006

agenda
Agenda
  • OCLC Cataloging/Metadata strategic directions
  • Architecture to support strategy
  • Examples of projects
cataloging environment
Cataloging Environment
  • Fewer catalogers, reduced budgets
  • Little growth in print materials acquisitions
  • E-resources increasing – cataloged?
deliver more automatically
Deliver more automatically
  • Build on PromptCat, Cataloging Partners program success
  • Partner with major materials providers
  • Cataloging tied to selection – possible new service
more scripts language support
More Scripts/Language Support
  • Growing WorldCat
    • Supporting libraries’ diverse collections
    • Easier to get materials cataloged
  • Growing membership
    • One stop shopping
    • Both US and global libraries
metadata support for e content
Metadata support for e-content
  • Support automated metadata generation for e-resources
  • Facilitate storage and discovery of digital content
  • Support new metadata schemes - crosswalks
  • Enrich WorldCat with e-serials records and holdings
continue to deliver value
Continue to deliver value
  • Ongoing Connexion maintenance
  • Standards
  • Simplify pricing
lego era
“The Internet is entering its Lego era. Indeed, blocks of interchangeable software components are proliferating … and developers are joining them together to create a potentially infinite array of useful new programs.”

--John Markoff, The New York Times, April 5, 2006

Lego Era
library 2 0
Library 2.0

“Library 2.0 is about small pieces of software loosely joined, … requires business models where multiple vendors bring value to consumers together … to reduce duplication of effort and reduce barriers to innovation…”

--Paul Miller, “Library 2.0: the challenge of disruptive innovation.”

http://www.talis.com/resources/documents/447_Library_2_prf1.pdf

slide10

OCLC Metadata Management Service

Connexion Digital Archive Content Coop ILS PICA NetLibrary Material Vendors Publishers

OAI Repositories

Local DB’s

Web Services/Portal/API Layer

Local

Holdings

(MFHD)

OAI Harvest

Validate

DA Ingest

DA Extract

DA Access

Metadata Creation

Z39.50

(authorities

Non-roman

Format

Crosswalks

Acquisitions/

Selection

Terminologies

Pan/Zoom

Language

Service

SRW/Zing Update

Shelf Ready

Reports &

Stats

Profiling

Metadata Capture

Usage

Stats

Profiling

Data

Digital

Archive

projects metadata support for e content
Projects Metadata support for e-content
  • Extraction/Creation Web Service
  • Crosswalk Web service
  • OCLC Terminologies Service
  • Content Cooperative Pilot
  • OCLC eSerials Holdings Service
extraction creation web service
Extraction/Creation Web Service

Extract metadata from Web sites, PDF files, and Word files

  • Re-implementing and enhancing functionality currently available in Connexion browser
    • Connexion browser – May 2006
    • Connexion client 1.60 – June 2006
connexion extract metadata
Connexion extract metadata
  • Enter URL or path to extract metadata
    • Supported file types .htm, .doc, .pdf
  • Create multiple records from Web sites linked to the parent URL
  • Specify to display or save created workforms, apply default constant data, and define My Status value
  • Future – add tools to “create” metadata
crosswalk web service
Crosswalk Web Service
  • Batchload/PromptCat
    • ONIX to MARC
    • OAI harvesting – Dublin Core to MARC
  • Future
    • Import and export Dublin Core data from Connexion client
    • Support for other metadata schemes both browser and client interfaces
oclc terminologies service
OCLC Terminologies service
  • Introduction, June 2006
  • Add more access points using other controlled vocabularies, e.g., MeSH, GSAFD
  • Available to all OCLC Cataloging subscribers
  • Subscriptions available for non-Cataloging users
  • Use with a variety of metadata editors, e.g., Connexion browser and client
list of terminologies in initial release
List of Terminologies in initial release
  • aat - Art & Architecture Thesaurus (J. Paul Getty Trust)
  • dct - Dublin Core Metadata Initiative Type Vocabulary (Dublin Core Metadata Initiative)
  • gmgpc - Thesaurus of Graphic Materials, TGM I (Library of Congress)
  • gsafd - Guidelines On Subject Access To Individual Works Of Fiction, Drama, Etc. (American Library Association)
  • lctgm - Thesaurus of Graphic Materials, TGM II (Library of Congress)
  • mesh - Medical Subject Headings (MeSH®) (National Library of Medicine)
  • ngl - Newspaper Genre List (University of Washington)
  • tgn - Thesaurus of Geographic Names (J. Paul Getty Trust)
  • ulan - Union List of Artists\' Names (J. Paul Getty Trust
slide22

Terminology

Pane

A separate application

1

2

content cooperative pilot
Content Cooperative Pilot
  • Upload content objects to the OCLC Digital Archive from Connexion browser and client interfaces
    • Digital image, thesis & dissertation, oral history, e-book, video, etc.
  • Replace WorldCat records to automatically add a URL pointing to the content object
  • Access digital content from FirstSearch, Group Catalogs, and OpenWorldCat
oclc eserials holdings service
OCLC eSerials Holdings service
  • Automatically updates eSerials holdings in WorldCat
  • Access to eSerials via WorldCat Resource Sharing
  • Access to eSerials through OCLC discovery platform
  • Compare electronic and print serials collections

No additional work for the library!

slide33

P

P

P

E

E

P

E

P

P

E

E

P

E

E

E

Digital Collections

D

P

E

P

E

P

E

Print Collections

P

P

P

E

P

E

E

Links to online Full Text

Resource Sharing Services

Links to OPACs

P

P

E

P

E

E

OCLC Libraries

Vendors

OCLC

OCLC FirstSearch

WorldCat Resource Sharing

WorldCat Collection Analysis

Open WorldCat

Resolver /

A-Z serials

list

Resolver /

A-Z serials

list

P

E

P

P

pilot partners
Pilot Partners
  • 35 Pilot libraries
  • EBSCO
  • Ex Libris
  • Serials Solutions
  • TDNet
  • More to come
benefits to the library
Benefits to the library
  • Increased operational efficiencies in ILL
    • Filling where possible
    • A revenue opportunity for some
    • You control requests via automatic deflection
  • Increased visibility at the point of need
  • Leverages investment in services
progress
Progress
  • Initial production system available late June 2006
  • Web-based registration form
  • No charge to participate in the eSerials holdings service
  • Future enhancements projected to include options for local holdings data, MARC record update service, and additional deflection choices
projects deliver more automatically
ProjectsDeliver more automatically
  • Improve shelf-ready cataloging
    • PromptCat/Cataloging Partners – 100% goal
    • Partner with major vendors
  • Selection
    • Possible future service
    • OCLC partnering with materials vendors to help with notification slip selection process
    • Cataloging a by product of selection
    • Watch for more information in the future!
projects more scripts language support
Projects More Scripts/Language support
  • New scripts support
    • Cyrillic, Greek and Hebrew – July 2005
    • Thai and Tamil scripts for use with Connexion client 1.50 (investigating Devanagari, Sinhala, and Bengali next)
  • Connexion interface translations
    • Client
      • Chinese (Traditional and Simplified) and Japanese – July 2005
      • German and Korean – Nov. 2005
    • CatExpress – French (Nov. 2006)
  • Unicode export – Nov. 2005
  • Automatic transliteration Web service – June 2006
ad