1 / 62

Census 2010: Accessing Census Data

Census 2010: Accessing Census Data . THURSDAY, July 21, 2011 10-11:30am. Quick Review. 2010 decennial data is “short-form” only – demographic characteristics; ACS now source of “long-form” type of data Census Data released in two “flavors” – Aggregate Data Microdata

varuna
Download Presentation

Census 2010: Accessing Census Data

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Census 2010: Accessing Census Data THURSDAY, July 21, 2011 10-11:30am

  2. Quick Review 2010 decennial data is “short-form” only – demographic characteristics; ACS now source of “long-form” type of data Census Data released in two “flavors” – Aggregate Data Microdata A third type of data product identifies geographic boundaries Aggregate Data released in a variety of products, differing in content, geographic specificity and temporal coverage Microdata has flexibility of individual level information, but balances this by only gross geographic detail

  3. Access and Resources Aggregate data resources Microdata resources Geography resources Documentation resources Visualization resources Local resources

  4. Access and Resources Aggregate resources: Current: American Factfinder, DataFerrett, Uexplore/Dexter Historical: NHGIS, ICPSR, Historical Census Browser Microdata resources Online Analysis: SDA & IPUMS Extract/Download: IPUMS, ICPSR, NBER, DataFerrett, Census Geographic: Census, MABLE/Geocorr, IPUMS Documentation: IPUMS, AFF2, ICPSR Visualization: Social Explorer Local Resources: DOF/DRU, SDC’s, UC DATA, DataLab

  5. Aggregate Data Resources

  6. The “old” American Factfinder

  7. The “new” American Factfinder

  8. Alternative to AFF: FTP Full Files

  9. Same FTP Options for ACS

  10. Historical Census Data Browser

  11. Micro-data Resources

  12. Survey Documentation and Analysis (SDA)and the Integrated Public Use Microdata Samples (IPUMS)

  13. The Integrated Public Use Microdata Samples

  14. The Integrated Public Use Microdata Samples www.ipums.org at the Minnesota Population Center IPUMS-USAHarmonized data on people in the U.S. census and American Community Survey, from 1850 to the present. IPUMS-CPSHarmonized data on people in the Current Population Survey, every March from 1962 to the present Important! Harmonized: Questions asked change over time: How to make data comparable? Integrated: Multiple data collections & surveys simultaneously available Microdata: The underlying individual-level data is available, not just pre-defined tables.

  15. The American Community Survey and the Current Population Survey CPS – Long-running monthly survey (dating back to the 1940’s) focused on labor force characteristics (unemployment, earnings, hours worked). ~ 55,000 sample HH’s, multiple interviews, personal In addition to the basic monthly questions, additional modules are “piggy-backed” onto the survey to provide more depth on particular topics. Most widely used supplement is the Annual Social and Economic Supplement (ASEC) - aka Annual Demographic Survey or the March Files. (~100,000 HH’s) In-depth survey – lots of detail about sources of income, work, occupational, hours, etc. (as well as core demographic information on race/ethnicity, nativity, age, sex, educataion)

  16. The American Community Survey and the Current Population Survey ACS – “New” continuous survey, replaces the long form of the decennial census, first fully implemented in 2005 (non-institutionalized) and 2006 (institutionalized). ~ 2,000,000 HH’s annually, mixed mail-in/personal interviews Substantial overlapping content with CPS Broader range of content, somewhat less detail Larger sample sizes allow for greater geographic detail

  17. The American Community Survey and the Current Population Survey Microdata vs. Aggregate

  18. The Integrated Public Use Microdata Samples www.ipums.org at the Minnesota Population Center Strengths: Tremendous centralized documentation Many “value-added” data items Wonderful extraction engine (if downloading data) Multiple statistical Packages supported Online Analysis also possible

  19. The Integrated Public Use Microdata Samples Online Analysis Links

  20. The Basics of SDA • What is SDA? • What can you do with SDA? • The parts of the SDA interface • Menu • Variable List • Active variables • Analysis Specification

  21. But… Before we go live…. Part II. Working with SDA 1.Parts of the SDA interface 2.Finding data/variables/subjects - search - documentation 3. Analysis - Components - rows, columns, selection, controls Procedures - crosstabs, means, correlations 4. Aids in Analysis Recoding Saving new variables Downloading

  22. The Basics of SDA What is SDA? SDA (Survey Documentation and Analysis) is a set of programs for the documentation and Web-based analysisof survey data. It was developed and is maintained by the Computer-assisted Survey Methods Program (CSM) at UC Berkeley. It was developed as a companion program with CASES (Computer Assisted Survey Execution Program), a package for collecting survey data based on structured questionnaires, using a variety of modes of data collection. It operates on a transposed file structure, which makes analysis of datasets, especially large datasets, extremely fast.

  23. Part I. The Basics of SDA What is SDA? SDA (Survey Documentation and Analysis) is a set of programs for the documentation and Web-based analysisof survey data. It was developed and is maintained by the Computer-assisted Survey Methods Program (CSM) at UC Berkeley. It was developed as a companion program with CASES (Computer Assisted Survey Execution Program), a package for collecting survey data based on structured questionnaires, using a variety of modes of data collection. It operates on a transposed file structure, which makes analysis of datasets, especially large datasets, extremely fast.

  24. Part I. The Basics of SDA • What data is available in SDA? • LOTS! • Many popular social science datasets (e.g. the GSS, the ANES, the PUMS from the Decennial Census, the ACS, the CPS Annual Demographic Files,…… can be found in SDA format. • Many archives (ICPSR, IPUMS, CPANDA, Roper, SDA, UCDATA….) provide at least some of their holdings in SDA format.

  25. Multiple Census Samples at IPUMS (http://usa.ipums.org/usa/sda/)

  26. And CPS (March files) data, as well (http://cps.ipums.org/cps/sda/)

  27. The Basics of SDA • What can you do with SDA? • SDA can be used to: • learn about a dataset (metadata, paradata) • search for variables of interest • investigate sample sizes and variable distributions • perform statistical analyses • transform, manipulate and create variables for each unit • extract and download subsets or full datasets

  28. Part I. The Basics of SDA • The four parts of the SDA interface • Action Menu • Variable List • Active Variable • Analysis Specification

  29. Action Menu

  30. Collapsed Variable Tree

  31. Active Variables

  32. Analysis Specification

  33. 2. Finding data/variables/subjects Online SDA codebook IPUMS detailed documentation

  34. Working with SDA Analysis – Components - rows, columns, selection, controls Procedures -crosstabs, means, correlations Screens will vary depending upon what procedure you are using. Start with exploratory – frequencies, cross-tabulations

  35. The variables you are interested in Who to include in the table

  36. Part II. Working with SDA Aids in Analysis Recoding Saving new variables Downloading

  37. Recoding variables – on the fly Can be used in row, column, control (Crosstabs) age (5-18)Selects, but does not collapse age (r: 5-18) Selects AND Collapses age (d: 5-18) Collapses, but does not select age (c:13,5) Collapses into categories of width w age (c:st,w) starting with value st Recoding variables – Web interface

  38. Question 1: Use the CPS or ACS? Question 2: What is the desired level of analysis (person, family, household)? Question 3: Who should be excluded? (How to limit to family households, or only particular age groups, or….?

  39. DataFerrett Content

More Related