Dli boot camp 2011 finding statistics tools and techniques
This presentation is the property of its rightful owner.
Sponsored Links
1 / 43

DLI Boot Camp 2011 Finding Statistics: Tools and Techniques PowerPoint PPT Presentation


  • 47 Views
  • Uploaded on
  • Presentation posted in: General

DLI Boot Camp 2011 Finding Statistics: Tools and Techniques. SDA. Jean Blackburn Vancouver Island University Library [email protected] [email protected]: Microdata Analysis & Subsetting.

Download Presentation

DLI Boot Camp 2011 Finding Statistics: Tools and Techniques

An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -

Presentation Transcript


Dli boot camp 2011 finding statistics tools and techniques

DLI Boot Camp 2011Finding Statistics: Tools and Techniques

SDA

Jean Blackburn

Vancouver Island University Library

[email protected]


Sda@chass microdata analysis subsetting

[email protected]: Microdata Analysis & Subsetting

  • University of Toronto CHASS Data Centre runs a web-based statistical package called SDA (Survey Documentation & Analysis) developed at UC Berkeley

  • The [email protected] service links SDA to microdata files from Statistics Canada surveys and more.

  • IP authenticated access to [email protected] is available to DLI institutions for an annual fee


Key features

Key Features

  • Variable-level searching

  • Extraction / subsetting capabilities

  • Access to codebooks

  • Web-based statistical analysis capabilities

  • Ability to recode and compute new variables


Http sda chass utoronto ca

http://sda.chass.utoronto.ca


Dli boot camp 2011 finding statistics tools and techniques

There are both French and English language versions of [email protected], but the French language data catalogue is limited as of the time of this presentation…


Dli boot camp 2011 finding statistics tools and techniques

The [email protected] English language data catalogue contains not only DLI microdata sets, but also open-access data and those restricted to University of Toronto users.


Dli boot camp 2011 finding statistics tools and techniques

Variable-level searching across SDA data sets


Dli boot camp 2011 finding statistics tools and techniques

You can specify a specific data set or cluster of data sets to search, and whether to search survey OR variable-level metadata (but not both)

Click the + to expand the data file clusters and select specific data sets to search


Dli boot camp 2011 finding statistics tools and techniques

You can specify specific fields to search as well…


Dli boot camp 2011 finding statistics tools and techniques

The View button shows you the codebook definition for the variable; “Study” titles link to the SDA interface...


Dli boot camp 2011 finding statistics tools and techniques

…the SDA interface (we’ll come back to it in more detail later).


Dli boot camp 2011 finding statistics tools and techniques

As well as searching, you can browse the [email protected] data catalogues…


Dli boot camp 2011 finding statistics tools and techniques

The French language data catalogue has 3 survey titles available; there are several data sets available for the Recensement de la population de Canada.


Dli boot camp 2011 finding statistics tools and techniques

Click on a data set link to access the SDA interface.


Dli boot camp 2011 finding statistics tools and techniques

You can access the codebook for this data set from the menu bar…


The codebook opens in a separate browser window or tab

The codebook opens in a separate browser window or tab…


Dli boot camp 2011 finding statistics tools and techniques

Use the Variable Selection tool to browse variables in the data set by category.


Dli boot camp 2011 finding statistics tools and techniques

Expand variable categories by clicking the +

Click on variables to select them


Dli boot camp 2011 finding statistics tools and techniques

You can view the codebook definitions for selected variables by clicking the View button beside the “Selected” field.


Dli boot camp 2011 finding statistics tools and techniques

Codebook definition for “immstat” variable

Codebooks include numeric values and labels for responses, record layouts, and unweighted frequencies.


You can also search variables by keyword within a data set

You can also search variables by keyword within a data set


Dli boot camp 2011 finding statistics tools and techniques

Follow the “Search Techniques Help” link for tips on wildcards and field searching. N.B. The SDA help files are English language only.

Click the variable button to select the variable for manipulation in SDA. For example, we can use SDA to get a weighted frequency distribution for the “immstat” variable…


Dli boot camp 2011 finding statistics tools and techniques

To run a weighted frequency for the “Statut d’immigrant” variable (“immstat”)…

1. Copy the selected variable “immstat” to the Row field in the Frequencies/Crosstab analysis tool, using the “Row” button…

2. Adjust table and display options as desired. SDA defaults to weighted cases. I have selected a pie chart type.

3. Click the “Run the Table” button.


The weighted frequency distribution table and chart will open in a new browser window or tab

The weighted frequency distribution table and chart will open in a new browser window or tab…


Charts can be saved as image files to insert into word documents etc

Charts can be saved as image files to insert into Word documents, etc.


Dli boot camp 2011 finding statistics tools and techniques

  • What if we want to see immigration status…

  • for a particular province?

  • for people of a particular age?

  • for men and women separately?

SDA allows you to apply filter and control variables to your analysis.


Dli boot camp 2011 finding statistics tools and techniques

To run a weighted frequency for the “Statut d’immigrant” variable (“immstat”), for British Columbia only, we’ll use the Province variable (“pr”) as a filter…

Select the “pr” variable and click the View button to see the codebook definition…

(Note that the “immstat” variable remains in the Frequencies/

Crosstab program “Row” field. )


Dli boot camp 2011 finding statistics tools and techniques

The codebook definition for the Province variable (“pr”) tells us that the value for British Columbia is 59…


Dli boot camp 2011 finding statistics tools and techniques

Click the Filter button to move the “pr” variable to the Selection Filter(s) field in the SDA Frequencies/Crosstab Program

Enter the value for British Columbia, 59, within the Selection Filter parentheses, and run the table…


Dli boot camp 2011 finding statistics tools and techniques

Filtering for British Columbia results in a different distribution of immigrant status.

Let’s try filtering further – for children under 15 years…


Dli boot camp 2011 finding statistics tools and techniques

Select the Age Group variable (“agegrp”) and click the View button to see the codebook definition…

In the codebook, values

1 to 5 represent children under 15 years…


Dli boot camp 2011 finding statistics tools and techniques

Because we want to filter for both Province and Age Group, it’s important to select “Append” rather than “Replace” before clicking the Filter button to move the variable to the Selection Filter(s) field.

For the agegrp filter, we need to enter a range of values representing children under 15 years (1-5).


Dli boot camp 2011 finding statistics tools and techniques

Here’s the distribution for immigrant status, filtered for province and age.

Now let’s remove the age group filter, and instead compare the immigrant status distribution in British Columbia for men and women…


Dli boot camp 2011 finding statistics tools and techniques

To see distributions for all values of a particular variable (e.g. sex), select the variable and use the Ctrl button to move it to the Control field in the SDA Frequencies/Crosstab program, and run the table…


Dli boot camp 2011 finding statistics tools and techniques

Controlling for sex, we get three different frequency distributions: one for women, one for men and one for all valid cases.

There is not much variation between these distributions!


Dli boot camp 2011 finding statistics tools and techniques

Crosstabs in SDA

To look for a relationship between 2 variable frequency distributions, you can run a crosstabulation in SDA by copying the dependent variable to the Row field and the independent variable to the Column field and running a table.

Let’s use the 2006 Aboriginal Peoples Survey: Adults to see whether living in urban or rural environments affects ability to speak an Aboriginal language…


Dli boot camp 2011 finding statistics tools and techniques

Ability to speak an Aboriginal language (“bg01”) is the dependent variable and gets copied to the Row field. Geography (“geo”) is the independent variable and gets copied to the Column field. I’ve chosen the Stacked Bar Chart chart type.


Dli boot camp 2011 finding statistics tools and techniques

The crosstabulation table and chart show that Aboriginal peoples living in Census Metropolitan Areas are less likely to speak an Aboriginal language than those living in rural areas.

The stacked bar chart effectively portrays the dramatic difference between the Arctic and the other Geography values.


Dli boot camp 2011 finding statistics tools and techniques

Other analysis methods are available in SDA, under the Analysis menu


Dli boot camp 2011 finding statistics tools and techniques

In SDA, you can download custom data sets with the

Download  Customized Subset command


Dli boot camp 2011 finding statistics tools and techniques

Specify the data and syntax (e.g. SPSS) file formats you want…

Add filtering criteria for cases to include, if desired…

Select All, Some or None of the variable categories to be included in your custom data set. If you select Some for any category, you’ll be able to select specific variables on the next screen, after clicking the Continue button…


Dli boot camp 2011 finding statistics tools and techniques

Hold down the Shift or Ctrl key to select contiguous or non-contiguous variables from the lists provided, and click Continue…

Check over your selections and click the “Create the Files” button…


Dli boot camp 2011 finding statistics tools and techniques

Right-click the links to the files to download them to your local computer or network!


  • Login