Overview of sample surveys
Download
1 / 63

OVERVIEW OF SAMPLE SURVEYS - PowerPoint PPT Presentation


  • 229 Views
  • Updated On :

OVERVIEW OF SAMPLE SURVEYS. Mehdi Nassirpour,Ph.D. Illinois Department of Transportation. This presentation was part of the Applied Sampling Workshop at the Annual TRB Conference in Washington DC in January 2004. HOW GOOD MUST THE SAMPLE BE?.

loader
I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
capcha
Download Presentation

PowerPoint Slideshow about 'OVERVIEW OF SAMPLE SURVEYS' - royal


An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript
Overview of sample surveys

OVERVIEW OF SAMPLE SURVEYS

Mehdi Nassirpour,Ph.D.

Illinois Department of Transportation

This presentation was part of the Applied Sampling Workshop at the

Annual TRB Conference in Washington DC in January 2004.


How good must the sample be
HOW GOOD MUST THE SAMPLE BE?

  • There is no uniform standard of quality that must be reached by every sample.

  • The quality of the sample depends entirely on the stage of the research and how the information will be used.

Division of Traffic Safety at IDOT


Current population survey
CURRENT POPULATION SURVEY

  • CPS is a monthly survey of households.

  • It provides data on the labor force, employment, unemployment, and persons not in the labor force.

  • This is a precise and controlled sample since it is the only source of monthly estimates of total employment and unemployment.

  • The sampling error for this kind of sample is about 0.1 percent

Division of Traffic Safety at IDOT


Public perception of illinois safety belt use
PUBLIC PERCEPTION OF ILLINOIS SAFETY BELT USE

  • A sample of 500 Illinois residents over 18 years of age were selected. Although to achieve equal sample reliability, the sample size for a state or local geographic area would need to be virtually as large as if the study were a national sample of the US, one generally finds that local samples are smaller. That is, although the public attitudes toward safety belt issues are as important, the level of research funds available is smaller for a state than for a national study.

Division of Traffic Safety at IDOT


Inappropriate sample design
INAPPROPRIATE SAMPLE DESIGN

  • Whether or not a sample design is appropriate depends on how it is used and the resources available. It may be fair to say that the sample generalizations made from the sample go too far.

Division of Traffic Safety at IDOT


What is the appropriate sample design
WHAT IS THE APPROPRIATE SAMPLE DESIGN?

  • DEGREE OF ACCURACY

  • RESOURCES

  • TIME

  • ADVANCED KNOWLEDGE OF THE POPULATION

  • NATIONAL VERSUS LOCAL

  • NEED FOR STATISTICAL ANALYSIS

Division of Traffic Safety at IDOT


Small scale sample with limited resources
SMALL-SCALE SAMPLE WITH LIMITED RESOURCES

  • Generalizability

  • Sample size

    • Too small for a meaningful analysis

    • Adequate for some but not all major analyses

    • Adequate for the purpose of study

  • Sample Execution

    • Poor response rate

    • Careless field work

  • Use of resources

Division of Traffic Safety at IDOT


Stages in the

Selection

of a Sample

Define the target Population

Select a sampling frame

Determine if probability or non-probability sampling will be chosen

Plan procedures for selecting sampling units

Determine sample size

Select actual sampling units

Conduct field work


Target population
TARGET POPULATION

  • RELEVANT POPULATION

  • OPERATIONALY DEFINE

Division of Traffic Safety at IDOT


Defining population
DEFINING POPULATION

1.DEFINITION OF TARGET POPULATION

  • Complete set of individuals from which information is collected

  • TARGET AREA

    • Entire region or set of locations from which information is collected

  • Example: define population for a study of elderly in Springfield, IL

  • How will you distinguish the elderly from the non- elderly?

  • Will the elderly be defined by occupational categories? Do you want retired people? Or do you want persons over 65 and retired?

  • Division of Traffic Safety at IDOT


    Sampling frame
    SAMPLING FRAME

    • A LIST OF ELEMENTS FROM WHICH SAMPLE MAY BE DRAWN

    • WORKING POPULATION

    • MAILING LIST--DATABASE

    • SAMPLING FRAME ERROR

    Division of Traffic Safety at IDOT


    Sampling frame examples
    SAMPLING FRAME (Examples)

    CONSTRUCTION OF OPERATIONAL SAMPLING FRAME

    • List of all subjects in the population

    • Specific definition of population

    • Wish to have a sampling frame that is almost or exactly identical to the entire population

    • Example: use of telephone surveys of voter preferences for political parties

    • Population of interest: all voters

      • Sampling frame: all voters with a telephone and who answer it

    • SAMPLED POPULATION – set of all individuals contained in the sampling frame, from which the sample is actually taken.

    • SAMPLED AREA – set of all locations within the study area boundary line that delimits the spatial sampling frame, from which the sample is drawn

    Division of Traffic Safety at IDOT


    Sampling units
    SAMPLING UNITS

    • GROUP SELECTED FOR THE SAMPLE

    • PRIMARY SAMPLING UNIT (PSU)

    • SECONDARY SAMPLING UNIT

    • TERTIARY SAMPLING UNIT

    Division of Traffic Safety at IDOT


    Sampling errors
    SAMPLING ERRORS

    • SAMPLING FRAME ERROR (STUDY DESIGN)

    • RANDOM SAMPLING ERROR (SAMPLING VARIABILITY)

    • NONRESPONSE ERROR (MEASUREMENT BIASES)

    Division of Traffic Safety at IDOT


    Random sampling error
    RANDOM SAMPLING ERROR

    • DIFFERENCE BETWEEN THE SAMPLE RESULT AND THE RESULT OF A CENSUS CONDUCTED USING IDENTICAL PROCEDURES

    • STATISTICAL FLUCTUATION DUE TO CHANCE VARIATIONS

    Division of Traffic Safety at IDOT


    Systematic errors
    SYSTEMATIC ERRORS

    • NONSAMPLING ERRORS

    • UNREPRESENTATIVE SAMPLE RESULTS

    • NOT DUE TO CHANCE

    • DUE TO STUDY DESIGN OR IMPERFECTIONS IN EXECUTION

    Division of Traffic Safety at IDOT


    Sources of n0n sampling errors
    SOURCES OF N0N-SAMPLING ERRORS

    • Under-representation

      • poor, homeless, prison inmates

      • opinion polls over telephones will miss 6% of population that do not have phones

    • Non-response

      • when selected individuals are not contacted or do not respond

      • usually 30%

      • results in bias

    • Interviewing skills - important not to introduce bias

      • types of questions asked

      • attitude during interviewing

      • wording of questions - confusing, misleading, intimidating


    Sources of sampling error
    SOURCES OF SAMPLING ERROR

    • Inadequate sample size

    • The smaller the sample, the more difficult it will be for that sample to truly capture the characteristics of a population

      • Imprecise sample/results

    • The larger the sample, the better

    • But, collecting large samples costs money and resources

    • In reality, a balance needs to be struck between collecting extensive samples and spending a lot of money and resources and saving money but not having enough data to draw conclusions from

    Division of Traffic Safety at IDOT


    Relationship between total error and sampling and non sampling errors
    Relationship Between Total Error and Sampling and Non-Sampling Errors

    Sampling

    Error

    Total Error

    Non-sampling Error

    Division of Traffic Safety at IDOT


    Two types of sampling
    TWO TYPES OF SAMPLING Non-Sampling Errors

    • PROBABILITY SAMPLING

    • NONPROBABILITY SAMPLING

    Division of Traffic Safety at IDOT


    Nonprobability sampling
    NONPROBABILITY SAMPLING Non-Sampling Errors

    • CONVENIENCE

    • JUDGMENT

    • QUOTA

    • SNOWBALL

    Division of Traffic Safety at IDOT


    Probability sampling
    PROBABILITY SAMPLING Non-Sampling Errors

    • SIMPLE RANDOM SAMPLE

    • SYSTEMATIC RANDOM SAMPLE

    • STRATIFIED SAMPLE

    • CLUSTER SAMPLE

    • MULTISTAGE RANDOM SAMPLE

    Division of Traffic Safety at IDOT


    Convience sampling
    CONVIENCE SAMPLING Non-Sampling Errors

    • Obtaining a sample of people or units that are most convenient.

    Division of Traffic Safety at IDOT


    Judgment sampling
    JUDGMENT SAMPLING Non-Sampling Errors

    • Selecting a sample based on judgment of an individual about some appropriate characteristics required from the sample member.

    Division of Traffic Safety at IDOT


    Quota sampling
    QUOTA SAMPLING Non-Sampling Errors

    • Requires that the various subgroups in a population are represented .

    • It should not be confused with stratified sampling.

    Division of Traffic Safety at IDOT


    Snowball sampling
    SNOWBALL SAMPLING Non-Sampling Errors

    • Requires additional respondents are obtained from information provided by the initial sample of respondents.

    Division of Traffic Safety at IDOT


    Judgment sampling1
    JUDGMENT SAMPLING Non-Sampling Errors

    • Selecting a sample based on judgment of an individual about some is appropriate.

    Division of Traffic Safety at IDOT


    Simple random sample
    SIMPLE RANDOM SAMPLE Non-Sampling Errors

    • A sampling procedure that ensures that each element in the population will have an equal chance of being included in the sample.

    Division of Traffic Safety at IDOT


    How to choose random sample
    HOW TO CHOOSE RANDOM SAMPLE Non-Sampling Errors

    • Assign each element within the sampling frame a unique number (1-99).

    • Identify a random start from the random number table.

    • Determine how the digits in the random number table will be assigned to the sampling frame.

    • Select the sample elements from the sampling frame.

    Division of Traffic Safety at IDOT


    Systematic random sample
    SYSTEMATIC RANDOM SAMPLE Non-Sampling Errors

    • Identify the total number of elements in the population

    • Identify the sampling ratio K/n (K=total population size/n=size of desired sample)

    • identify the random start.

    • Draw a sample by choosing every kth entry

    Division of Traffic Safety at IDOT


    EXAMPLE OF Non-Sampling Errors

    SYSTEMATIC

    RANDOM SAMPLE

    Division of Traffic Safety at IDOT


    Stratified random sample
    STRATIFIED RANDOM SAMPLE Non-Sampling Errors

    • Sub-samples are drawn within different strata.

    • Each stratum in more or less equal on some characteristics.

    Division of Traffic Safety at IDOT


    Reasons for stratified random sample
    REASONS FOR STRATIFIED RANDOM SAMPLE Non-Sampling Errors

    • Make a sample more efficient since variance differs between the strata.

    • Reduce sampling error between strata.

    • Reduce number of cases required in order to achieve a given degree of accuracy.

    Division of Traffic Safety at IDOT


    Types of stratified random sample
    TYPES OF STRATIFIED RANDOM SAMPLE Non-Sampling Errors

    • Proportionate Stratified Random Sample

    • Disproportionate Stratified Random Sample

    Division of Traffic Safety at IDOT


    Proportionate stratified random sample
    PROPORTIONATE STRATIFIED RANDOM SAMPLE Non-Sampling Errors

    • It is used to get a more representative sample than might be expected under SRS.

    • Reduces sampling errors between strata with respect to the relative numbers selected. This is true when we have homogeneous groups.

    • Population strata must be known in order to draw a proportionate stratified sample.

    Division of Traffic Safety at IDOT


    Disproportionate stratified random sample
    DISPROPORTIONATE STRATIFIED RANDOM SAMPLE Non-Sampling Errors

    • It is used to manipulate the number of cases selected in order to improve efficiency of the design.

    • The main interest is to study separate sub-populations represented by the strata rather than on the entire population

    Division of Traffic Safety at IDOT


    Typical examples of stratified random sample
    TYPICAL EXAMPLES OF STRATIFIED RANDOM SAMPLE Non-Sampling Errors

    • More popular examples are demographics, Age, Gender, Race, Region, Road type, Urban/Rural.

    Division of Traffic Safety at IDOT


    Weighting the sample
    WEIGHTING THE SAMPLE Non-Sampling Errors

    • Reason for weighting is to correct problems associated with sample bias (sampling and non-sampling ).

    • Known Sampling biases, such as household selected by random digit dialing will have more than one phone number.

    Division of Traffic Safety at IDOT


    Weighting process
    WEIGHTING PROCESS Non-Sampling Errors

    • Assign a weight that is equal to the inverse of its probability of selection. In this case, where all sample elements have had the same chance of selection, given the same weight: 1. (This is called self-weighting sample)

    Division of Traffic Safety at IDOT


    Weighting example
    WEIGHTING EXAMPLE Non-Sampling Errors

    Nonwhite Female weight =7.2/12.3=0.59

    Nonwhite Male weight =3.8/9.8=0.39

    White Female weight = 57.7/56.7=1.02

    White Male weight = 31.2/21.2=1.47

    Division of Traffic Safety at IDOT


    Computation estimates of means and standard errors for stratified sample
    Computation (Estimates of Means, and standard Errors) for Stratified Sample

    • Compute values for each strata and then weight them based on the relative size of the stratum in the population.

    Division of Traffic Safety at IDOT


    WEIGHTING FORMULA Stratified Sample

    Division of Traffic Safety at IDOT



    Estimated standard errors
    Estimated Standard Errors Samples

    County 1:

    County 2:

    County 3:

    Division of Traffic Safety at IDOT


    Estimated mean and variance
    Estimated Mean and Variance Samples

    2

    Division of Traffic Safety at IDOT


    Cluster sampling
    CLUSTER SAMPLING Samples

    • Divide population into a large number of groups, called clusters and then sample among clusters. Finally select all individuals within those clusters.

    • The main reason for cluster sampling is to sample economically while retaining the characteristics of a probability sample.

    Division of Traffic Safety at IDOT


    Types of cluster sampling
    TYPES OF CLUSTER SAMPLING Samples

    • Single -Stage Cluster sampling--Divide population into several hundred census tracts and then select 40 tracts as a sample. Then select every individuals within selected census tracts.

    • Multistage Cluster Sampling--Take a random sample of census tracts within a city. Then within each selected census tract we take a simple random sample of blocks (smaller clusters). Finally we might select every third house and interview every second adult within each of these households

    Division of Traffic Safety at IDOT


    Cluster sampling probability proportionate to size pps
    CLUSTER SAMPLING SamplesProbability Proportionate to Size (PPS)

    • Arrange clusters in a desire order (not necessarily by size)

    • Obtain the size data

    • Sum up the size measures over clusters

    • Determine sampling interval

    • Select a random start

    Division of Traffic Safety at IDOT


    Difference between cluster sampling and stratified sampling
    Difference Between Cluster Sampling and Stratified Sampling Samples

    • Although both types of sample involve divide population into groups, they involve in a opposite sampling operations.

    • In a stratified sample, we sample individuals within every stratum. The sampling errors involve variability within strata. Strata are supposed to be homogeneous as possible and as different as possible from each other.

    • In (single-stage ) cluster sampling, we have no source of sampling error within the clusters because every case is being used. The variability is between the clusters.

    Division of Traffic Safety at IDOT


    Difference between cluster sampling and simple random sample
    Difference Between Cluster Sampling and Simple Random Sample Samples

    • Cluster sample is less efficient than the simple random samples of the same size. But it may cost considerably less.

    • The efficiency can be measured in terms of the size of standard error of estimate, a small standard error indicates high efficiency.

    Division of Traffic Safety at IDOT


    Comparing cluster sampling and simple random sample
    Comparing Cluster Sampling and Simple Random Sample Samples

    These are the variances of the means for cluster and simple random

    samples, Pi represents the population intra-class correlation, and

    the mean number of cases selected from each of the cluster


    Muti stage cluster sampling
    MUTI-STAGE CLUSTER SAMPLING Samples

    • Stratification techniques within the clusters will be used to refine and improve the sample. Examples of this kind of sampling Census, National Safety Belt Survey.

    Division of Traffic Safety at IDOT


    Principal steps to consider in choosing a sample size

    PRINCIPAL STEPS TO CONSIDER IN CHOOSING A SAMPLE SIZE Samples

    Mehdi Nassirpour, Ph.D.

    Illinois Department of Transportation


    After sample design is selected
    AFTER SAMPLE DESIGN IS SELECTED Samples

    • DETERMINE SAMPLE SIZE

    • SELECT ACTUAL SAMPLE UNIT

    • CONDUCT FIELD WORK

    Division of Traffic Safety at IDOT


    Steps in determining sample size
    STEPS IN DETERMINING SAMPLE SIZE Samples

    • Importance of the research or the gains and losses associated with alternative decisions

    • Previous example of sample sizes used in social sciences

    • Confidence Level to be used

    • Degree of accuracy within which we wish to estimate the parameter.

    • Some reasonable estimate of the values of any parameters that may appear in the formula.

    Division of Traffic Safety at IDOT


    Data elements needed to determine sample size
    DATA ELEMENTS NEEDED TO DETERMINE SAMPLE SIZE Samples

    • Mean Value

    • Standard Error

    • Accuracy level

    • Confidence Level

    Division of Traffic Safety at IDOT


    Formula for determining sample size with an example
    Formula for Determining Sample Size with an Example Samples

    .1

    Accuracy Level =

    (Standard deviation of Population)

    Confidence level = 95%

    Example: Determining a sample size to

    estimate the mean number of schooling

    completed by persons with foreign-born

    Parents.

    Division of Traffic Safety at IDOT


    Formula for determining sample size for a categorical variable with an example
    Formula for Determining Sample Size for a Categorical Variable with an Example

    Accuracy Level plus or

    minus 5 percent (95%

    confidence level)

    Steps:

    A. .05/1.96=.0255102

    B. (.0255102)2 =.0006507

    Division of Traffic Safety at IDOT


    Point and interval estimations
    Point and Interval Estimations Variable with an Example

    • Point Estimation: Estimating Population mean using Sample Mean

      • Bias: Estimate is unbiased if the mean of its sampling distribution is equal to value of the parameter being estimated

      • Efficiency of an Estimate: It refers to the degree to which the sampling distribution is clustered about the true value of the parameter. The smaller the the standard error, the greater the efficiency of the estimate.

    • Interval Estimation: It refers to interval estimation of population parameter.

      • Actual procedure used to obtain an interval estimate is Confidence Interval.


    Confidence interval formula
    Confidence Interval Formula Variable with an Example

    Interval would run between 14.02 to 15.98 using 95 percent Confidence level

    Division of Traffic Safety at IDOT


    Confidence interval formula for sample
    Confidence Interval Formula For Sample Variable with an Example

    Interval would run between 45.15 to 58.85 using 99 percent Confidence level

    Division of Traffic Safety at IDOT


    Confidence interval formula for proportions
    Confidence Interval Formula For Proportions Variable with an Example

    Interval would run between 45.15 to 58.85 using 99 percent Confidence level

    Division of Traffic Safety at IDOT


    ad