Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.
OCLC Update BIBCO/CONSER OpCo 2010 May 6 Robert Bremer Cynthia M. Whitacre WorldCat Quality Management Division OCLC
Topics Covered • MARC Update • DDR: Duplicate Detection and Resolution • ISSN-L • Provider Neutral Monographs for E-Books
MARC Update 2010 • Expected Date of Install: May 16, 2010 • Contents: Most elements of MARC Updates Nos. 10 & 11 • Documentation: Technical Bulletin 258 (to be released any day now) • Not included: • Implementation of Computer Files 008/23, 006/06 (Form of item) • Implementation of subfields $3 in Bibliographic & Authority fields 034, and subfields $5 in Bibliographic 800-830 fields
MARC Update 2010: Bibliographic Records • New codes for Form of Item for use in 006 & 008 fields • o = online • q = direct electronic • 040 subfield $e code ‘rda’ authorized; $e becomes repeatable • New fields 336, 337, & 338 for content type, media type and carrier type
MARC Update 2010: Bibliographic Records Continued • New subfields for 033, 518. • New 38X fields for attributes of musical works: • Field 380: Form of Work. • Field 381: Other Distinguishing Characteristics of Work or Expression. • Field 382: Medium of Performance. • Field 383: Numeric Designation of Musical Work. • Field 384: Key.
MARC Update 2010: Authority Records • 040 subfield $e code ‘rda’ authorized; $e becomes repeatable • New 38X fields for attributes of musical works: • Field 380: Form of Work. • Field 381: Other Distinguishing Characteristics of Work or Expression. • Field 382: Medium of Performance. • Field 383: Numeric Designation of Musical Work. • Field 384: Key.
MARC Update 2010: Authority Records Continued • New subfields for 046 • Fields added for entity attributes: Field 336 Content Type Field 374 Occupation Field 370 Associated Place Field 375 Gender Field 371 Address Field 376 Family Information Field 372 Field of Activity Field 377 Associated Language Field 373 Affiliation
DDR: Duplicate Detection & Resolution • New DDR has been running small batches since middle of 2009. Began running in production at end of January 2010. • Two methods of attack: • Entire WorldCat database, starting with Record #1 • Daily journal files
DDR: More details • All formats are included, not just books • Statistics: As of early May 2010 • Walking the Database: 21 million records examined • 1.25 million records merged • Daily Journal Files: 7.5 million record examined 250,000 records merged
ISSN-L • ISSN-L enables collocation or linking among different media versions of a continuing resource • Field 022 subfields $l and $m implemented in 2009 • ISSN International Centre assigned a corresponding ISSN-L to all existing ISSNs and provided a listing • Resulting list processed against WorldCat to identify records where ISSN-L was missing • ISSN-L added in field 022 subfield $l by macro
ISSN-L continued • About 1 million WorldCat records were changed in January through April • Changed CONSER records were redistributed • Over 8100 CONSER records deferred for later processing • List of records to be changed grew to 1.5 million shortly after starting the process • Additional records will be modified in a subsequent pass through the database later this year
Provider Neutral e-Books • Conversion of provider-specific e-book records to provider-neutral is being accomplished via use of a macro • A set-by-set approach used to deal with specific fields that may be common to records within various sets • Initial pass through ebrary, Myilibrary, and NetLibrary completed, but a second pass is needed to deal various issues and problems with data and coding • Finding lots of hybrid records—print? or online? • Conversion of GoogleBooks likely to begin in May
Thank You! Questions?