1 / 66

Data Reference (the very, very basics)

Data Reference (the very, very basics). Data-reference: what do we need?. Tools Strategies Terminology Understanding of what we are looking for: not books or articles -- or facts. Data-reference: what do we need?.

Download Presentation

Data Reference (the very, very basics)

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Data Reference(the very, very basics)

  2. Data-reference: what do we need? • Tools • Strategies • Terminology • Understanding of what we are looking for: not books or articles -- or facts.

  3. Data-reference: what do we need? • Understanding of what we are looking for: not books or articles -- or facts. • Terminology • Strategies • Tools

  4. La trahison des images, The treachery of images, Rene Magritte

  5. Ceci n’est pas les “data.” C’est les statistiques!

  6. Data Statistics Raw (for analysis) Cooked (facts) Intended for use by computer For human use:Eye-readable, charts, tables, graphs Computer-readable Can be print, micro, computer readable Collected based on social science methodologies or administrative procedures Produced from data

  7. Data

  8. Statistics

  9. + = Where do statistical babies come from?

  10. Data or Statistics: Why does it matter? • Different search strategies and tools. • Defines your goal. • Helps you know when you've found it!

  11. Tip: Data or Statistics? • Determine if the user wants (needs) statistics or data. • Do you want want one number? • Are you looking for a fact or figure? • Do you want to know “how many?”

  12. Tip: Data or Statistics? • Determine if the user wants (needs) statistics or data. • Or… do you want a series of numbers? • Do you want to identify trends, make comparisons, model relationships? • Will you be using statistical software (not Excel)?

  13. http://factfinder.census.gov/

  14. http://www.census.gov/compendia/statab/elections/election.pdfhttp://www.census.gov/compendia/statab/elections/election.pdf

  15. http://www.census.gov/compendia/statab/tables/06s0405.xls

  16. ftp://ftp.bls.gov/pub/special.requests/lf/aat44.txt

  17. http://www.bls.gov/webapps/legacy/cpsatab7.htm

  18. From survey to data to statistics… Survey instrument Q1. [enter zip code ] Q2. [enter R’s first name ] Q3. [enter sex of R ] Q4. What was your major in College? Q5. What was your income last year? Q6. Did you go to church last week?

  19. Answers to Questions Zip Name Sex Major income church 29002 Wilma F lit 0 y 99005 Barney M engin 10 n 99005 Betty F . 0 n 92005 Ethel F theater 1000 y 12534 Fred M. M PE 10000 y 12534 Lucy F lit 700 y 25000 Ricky M music 11000 y 20000 Fred A. M dance 10500 n 15000 Ginger F math 9500 y

  20. Must anonymize the data! Zip Name Sex Major income church 29002 Wilma F lit 0 y 99005 Barney M engin 10 n 99005 Betty F . 0 n 92005 Ethel F theater 1000 y 12534 Fred M. M PE 10000 y 12534 Lucy F lit 700 y 25000 Ricky M music 11000 y 20000 Fred A. M dance 10500 n 15000 Ginger F math 9500 y

  21. Must anonymize the data! Zip Name Sex Major income church 29002 001 F lit 0 y 99005 002 M engin 10 n 99005 003 F . 0 n 92005 004 F theater 1000 y 12534 005 M PE 10000 y 12534 006 F lit 700 y 25000 007 M music 11000 y 20000 008 M dance 10500 n 15000 009 F math 9500 y

  22. Change Text to Numeric Codes Zip Name Sex Major income church 29002 001 F lit 0 y 99005 002 M engin 10 n 99005 003 F . 0 n 92005 004 F theater 1000 y 12534 005 M PE 10000 y 12534 006 F lit 700 y 25000 007 M music 11000 y 20000 008 M dance 10500 n 15000 009 F math 9500 y

  23. Change Text to Numeric Codes Zip Name Sex Major income church 29002 001 1 lit 0 y 99005 002 2 engin 10 n 99005 003 1 . 0 n 92005 004 1 theater 1000 y 12534 005 2 PE 10000 y 12534 006 1 lit 700 y 25000 007 2 music 11000 y 20000 008 2 dance 10500 n 15000 009 1 math 9500 y

  24. Change Text to Numeric Codes The “codebook” mustdocument the numeric codes used!For example: Variable: “sex” 1 = female 2 = male Zip Name Sex Major income church 29002 001 1 lit 0 y 99005 002 2 engin 10 n 99005 003 1 . 0 n 92005 004 1 theater 1000 y 12534 005 2 PE 10000 y 12534 006 1 lit 700 y 25000 007 2 music 11000 y 20000 008 2 dance 10500 n 15000 009 1 math 9500 y

  25. Change Text to Numeric Codes Zip Name Sex Major income church 29002 001 1 0075 0 y 99005 002 2 0070 10 n 99005 003 1 . 0 n 92005 004 1 0076 1000 y 12534 005 2 0001 10000 y 12534 006 1 0075 700 y 25000 007 2 0077 11000 y 20000 008 2 0078 10500 n 15000 009 1 0050 9500 y

  26. Change Text to Numeric Codes Zip Name Sex Major income church 29002 001 1 0075 0 1 99005 002 2 0070 10 2 99005 003 1 . 0 2 92005 004 1 0076 1000 1 12534 005 2 0001 10000 1 12534 006 1 0075 700 1 25000 007 2 0077 11000 1 20000 008 2 0078 10500 2 15000 009 1 0050 9500 1

  27. Change Text to Numeric Codes Zip Name Sex Major income church 29002 001 1 lit 0 y 99005 002 2 engin 10 n 99005 003 1 . 0 n 92005 004 1 theater 1000 y 12534 005 2 PE 10000 y 12534 006 1 lit 700 y 25000 007 2 music 11000 y 20000 008 2 dance 10500 n 15000 009 1 math 9500 y

  28. Change Text to Numeric Codes Zip Name Sex Major income church 29002 001 1 0075 0 y 99005 002 2 engin 10 n 99005 003 1 . 0 n 92005 004 1 theater 1000 y 12534 005 2 PE 10000 y 12534 006 1 0075 700 y 25000 007 2 music 11000 y 20000 008 2 dance 10500 n 15000 009 1 math 9500 y

  29. Change Text to Numeric Codes Zip Name Sex Major income church 29002 001 1 0075 0 y 99005 002 2 0070 10 n 99005 003 1 . 0 n 92005 004 1 0076 1000 y 12534 005 2 0001 10000 y 12534 006 1 0075 700 y 25000 007 2 0077 11000 y 20000 008 2 0078 10500 n 15000 009 1 0050 9500 y

  30. Change Text to Numeric Codes Sometimes, evennumeric variablesare encoded in ranges. For example: Variable: “income” 1 = less than 1000 2 = 1000 - 4999 3 = 5000 - 10000 4 = more than 10000 9 = not reported Zip Name Sex Major income church 29002 001 1 0075 0 1 99005 002 2 0070 10 2 99005 003 1 . 0 2 92005 004 1 0076 1000 1 12534 005 2 0001 10000 1 12534 006 1 0075 700 1 25000 007 2 0077 11000 1 20000 008 2 0078 10500 2 15000 009 1 0050 9500 1

  31. Change Text to Numeric Codes Sometimes, evennumeric variablesare encoded in ranges. For example: Variable: “income” 1 = less than 1000 2 = 1000 - 4999 3 = 5000 - 10000 4 = more than 10000 9 = not reported Zip Name Sex Major income church 29002 001 1 0075 1 1 99005 002 2 0070 1 2 99005 003 1 . 1 2 92005 004 1 0076 2 1 12534 005 2 0001 3 1 12534 006 1 0075 1 1 25000 007 2 0077 4 1 20000 008 2 0078 4 2 15000 009 1 0050 3 1

  32. Data Files do not need “headers” Zip Name Sex Major income church 29002 001 1 0075 1 1 99005 002 2 0070 1 2 99005 003 1 . 1 2 92005 004 1 0076 2 1 12534 005 2 0001 3 1 12534 006 1 0075 1 1 25000 007 2 0077 4 1 20000 008 2 0078 4 2 15000 009 1 0050 3 1

  33. Data Files do not need “headers” 29002 001 1 0075 1 1 99005 002 2 0070 1 2 99005 003 1 . 1 2 92005 004 1 0076 2 1 12534 005 2 0001 3 1 12534 006 1 0075 1 1 25000 007 2 0077 4 1 20000 008 2 0078 4 2 15000 009 1 0050 3 1

  34. Data Files do not need extra space 29002 001 1 0075 1 1 99005 002 2 0070 1 2 99005 003 1 . 1 2 92005 004 1 0076 2 1 12534 005 2 0001 3 1 12534 006 1 0075 1 1 25000 007 2 0077 4 1 20000 008 2 0078 4 2 15000 009 1 0050 3 1

  35. Data Files do not need extra space 290020011 0075 1 1 990050022 0070 1 2 990050031 . 1 2 920050041 0076 2 1 125340052 0001 3 1 125340061 0075 1 1 250000072 0077 4 1 200000082 0078 4 2 150000091 0050 3 1

  36. Data Files do not need extra space 2900200110075 1 1 9900500220070 1 2 990050031. 1 2 9200500410076 2 1 1253400520001 3 1 1253400610075 1 1 2500000720077 4 1 2000000820078 4 2 1500000910050 3 1

  37. Data Files do not need extra space 29002001100751 1 99005002200701 2 990050031. 1 2 92005004100762 1 12534005200013 1 12534006100751 1 25000007200774 1 20000008200784 2 15000009100503 1

  38. Data Files do not need extra space 290020011007511 990050022007012 990050031. 12 920050041007621 125340052000131 125340061007511 250000072007741 200000082007842 150000091005031

  39. Codebook must document locations 290020011007511 990050022007012 990050031. 12 920050041007621 125340052000131 125340061007511 250000072007741 200000082007842 150000091005031 For example: Variable: “sex” location: column 9 width: 1

  40. Codebook must document locations 123456789 290020011007511 990050022007012 990050031. 12 920050041007621 125340052000131 125340061007511 250000072007741 200000082007842 150000091005031 For example: Variable: “sex” location: column 9 width: 1

  41. Codebook documents question, location, codes. 290020011007511 990050022007012 990050031. 12 920050041007621 125340052000131 125340061007511 250000072007741 200000082007842 150000091005031 For example:Q3. [enter sex of R ] Variable: “sex” location: column 9 width: 1 Variable: “sex” 1 = female 2 = male

  42. To Use Data You Need 3 Things • Data: the datafile (the raw numbers) • Metadata: the “codebook” (where the numbers are and what they mean) • Statistical Software (for reading the datafile and analyzing the data)

  43. Data 90020011007511 990050022007012 990050031. 12 920050041007621 125340052000131 125340061007511 250000072007741 200000082007842 150000091005031 + Codebook Q3. [enter sex of R ] Variable: “sex” location: column 9 width: 1 Variable: “sex” 1 = female 2 = male + Statistical software

  44. And produces charts, tables, analysis, etc. Student writes SPSS program to analyze data… SPSS reads the program SPSS commands 90020011007511 990050022007012 990050031. 12 920050041007621 125340052000131 125340061007511 250000072007741 200000082007842 150000091005031 SPSS reads the data.

More Related