1 / 12

Archiving microdata Standards and good practices

Archiving microdata Standards and good practices. United Nations Statistics Commission New York, February 26, 2009 Olivier Dupriez World Bank, Development Data Group and International Household Survey Network. odupriez@worldbank.org. The value of data. Survey and censuses

odetta
Download Presentation

Archiving microdata Standards and good practices

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Archiving microdataStandards and good practices United Nations Statistics Commission New York, February 26, 2009 Olivier Dupriez World Bank, Development Data Group and International Household Survey Network odupriez@worldbank.org

  2. The value of data • Survey and censuses • High cost !  High value ? • Data have value beyond the purpose for which they were originally collected (“repurposing” of data) • Large under-exploited potential • Condition: proper archiving • Documentation, dissemination, preservation

  3. Data archiving – Two models • By a specialized data center (“trusted repository”) • (US, Canada, Europe) • Often academic • High level of expertise • Infrastructure • Standards and best practices for documentation • Formal dissemination and preservation policies and procedures • Support to users • By the data producer • (Most developing countries) • Not seen as a key role • Lack of expertise • Inappropriate infrastructure • Ad hoc practices • No compliance with international standards • Unclear policies and procedures

  4. Sharing good practices Objective: transfer data archiving good practices and standards to data producers International Household Survey Network (IHSN) • A network of international agencies (coordinated by World Bank /PARIS21) • Develop tools, guidelines, training materials • Advocates compliance with good practices and international standards www.ihsn.org

  5. Microdata documentation Good documentation is needed to: • Properly analyze the data • Increase credibility of derived indicators and analysis • Allow replication of data collection or analysis • Build institutional memory DDI + Dublin Core metadata standards (XML) A checklist of everything you need to know • Study description • File description • Variable description • Related materials www.ddialliance.org

  6. IHSN DDI Metadata Editor Documenting the study: sampling, data collection, scope and coverage, etc.

  7. IHSN DDI Metadata Editor Documenting files and variables: formulation of question, interviewer’s instructions, computation of variables, etc.

  8. IHSN DDI Metadata Editor Metadata in XML format … … can be “transformed” into html, pdf, other

  9. Microdata cataloguing XML/DDI metadata is web-ready, “browsable and searchable”

  10. Microdata dissemination • Growing demand for microdata • Potential to add much value to existing data • But requires: • Enabling legislation • Formal policy/procedures (IHSN guidelines) • Technical capacity to prepare data for dissemination • Documenting, cataloguing • Anonymizing (IHSN tools being tested)

  11. Data and metadata preservation Situation in many countries: documents in hard copy only, outdated storage media, multiple versions of datasets, much information lost (or never generated). Goal: Data and documentation remain readable, meaningful, understandable, accessible  manage hardware, software and storage media (not only backups; also “migration”) On-going: IHSN-ICPSR guidelines (Open Archival Information System - OAIS; ISO 14721)

  12. Conclusions and recommendations • NSOs do not need to have all features of advanced data centers, but data archive is part of their mandate • Documentation and preservation are a MUST, even if you don’t disseminate • Good practices and standards are relatively easy to implement • Good documentation of past surveys helps improve the quality of future surveys

More Related