1 / 10

IPUMS-International Integration Process

IPUMS-International Integration Process. Matt Sobek Minnesota Population Center sobek@umn.edu. June 2011 Data Release. Input material. Pre-processing. Standardization. Integration. Data files. Reformat data Donation Draw sample Confidentiality. Code clean-up Verify data.

fordon
Download Presentation

IPUMS-International Integration Process

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. IPUMS-InternationalIntegration Process Matt SobekMinnesota Population Centersobek@umn.edu

  2. June 2011 Data Release

  3. Input material Pre-processing Standardization Integration Data files Reformat data Donation Draw sample Confidentiality Code clean-up Verify data Harmonize codes Variable programming Constructed variables GIS boundary files Data dictionary Questionnaires Enum instructions Sample information Translate to English Images to editable files Ipums data dictionary Tag enumeration text Document sourcevariables Variable descriptions Sample design

  4. End Matt SobekMinnesota Population Centersobek@umn.edu

  5. Confidentiality Measures • Swap a small percentage of cases between geographic areas. • Suppress low-level geographic variables. • Recode geographic units to ensure small localities cannot be identified (typically those with fewer than 20,000 persons). • For recent censuses: • Recode cells representing very small numbers of persons in the population (into a residual or combined with a larger category). • Top- or bottom-code continuous variables with a thin tail. • Suppress specific categories of variables as requested by the NSO. • Suppress entire variables as requested by the NSO.

  6. Harmonize Codes: Translation Matrix for Marital Status China 1982 Colombia 1973 Kenya 1989 Mexico 1970 U.S.A. 1990

  7. Constructed “Pointer” Variables (Simple household) Spouse’s 2 1 0 0 0 0 Mother’s Father’s 0 0 0 0 0 0 2 1 2 1 2 1 (Colombia 1985)

  8. Census Questionnaire Image (Mexico 2000) Water Access

  9. Text of Census Questionnaire (Mexico 2000)

  10. XML-Tagged Census Questionnaire (Mexico 2000) Water access

More Related