1 / 35

Sampling

Sampling. Time Series. ● Time Series: One observational unit is observed many times Examples: Mostly macro, finance – The daily price of a stock over a month – Brunei's exports from 1980-2005 – The monthly unemployment rate from January 2000 – December 2004. Advantages:

ernie
Download Presentation

Sampling

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Sampling

  2. Time Series ● Time Series: One observational unit is observed many times Examples: Mostly macro, finance • – The daily price of a stock over a month • – Brunei's exports from 1980-2005 – The monthly unemployment rate from January 2000 – December 2004

  3. Advantages: • – We have many observations of the same thing • – This means we can use inferential statistics for large samples based on one entity Disadvantages: 1.Samples are usually small – Examples: Yearly economic growth for.... 20 years? 2.Observations are not independent and identically distributed – Examples: If economic growth was higher than average in 1997, it was probably higher than average in 1996

  4. 3. Spurious correlation and regression can arise – Examples: • ● We have a time series of the world's annual economic output from 1940 to 2000 • ● We have a time series of the number of countrieswith nuclear weapons from 1940 to 2000 • ● Will the two be correlated? Does it mean anything?

  5. Longitudinal Data ● Longitudinal or Panel Data: Several observational units are surveyed several times Examples: • – We have output per person for every district in Brunei from 1990-2000 • – We observe 2,000 households every year for 10 years. During that time they have births, deaths, job changes, etc.

  6. Advantage: We can control for individual characteristics of the observational unit • Example: We observe people over five years. Some of them move from place to place. • We measure the income difference when a person moves from city to country or country to city Disadvantage:Observations are not independent and identically distributed – Example: ● People who have high incomes in one place will have high incomes in another place ● This means you can predict the person's income one year if you know it in another year

  7. Using Secondary Data Be sure you have: A codebook, which describes the relationship between the survey questions and the data on the computer – Examples: • How are industries categorized? • How are data coded? • Are all questions asked of all people? • A description of the sampling technique, i.e., how the sample was collected

  8. Why do sampling? To learn about the characteristics of a group of people or objects without having to collect information about all of the people or objects of interest. To save money and time. To increase internal validity. Use of multiple data collectors or the passage of large amounts of time can negatively impact internal validity . Well conducted samples can actually be more accurate than collecting the desired data from all of the people or objects of interest.

  9. Sampling Methods

  10. In Class Exercise This exercise will demonstrate the power of random sampling. It involves the following steps: A survey question will be distributed to everyone in class asking for your position on an important current issue. All the responses will be tabulated. A random sample of the responses will be drawn. The results of the sample will be compared to the results from the universe of people in class.

  11. In Class Exercise Survey Questions: 1. Age? A. less than 20 B. 21-30 C. 31-40 D. 41-50 E. Above 50 2. Gender? A. Male B. Female 3. Highest Academic Qualification? A. O Level B. A Level C. National Diploma D. Higher National Diploma E. Bachelors Degree F. Masters Degree. 3. Do you own a facebook account? A. Yes B. No 4. If yes, how often do you update your fb account? A. Everyday B. Once a week C.2-4 days/week D. 5-6days/weeks E. Not applicable F. Others:___________pls specify. 5. What activities do you use your facebook for? A. Personal Use B. Selling C. Advertising D. Others:__________pls specify ID Number ____

  12. Form of the Sample ● Inperson interviews – Advantages: • ● Higher response rate • ● May be the only way to reach some people, especially poor people • ● Can obtain precise answers to technical questions (income, occupation, etc.) – Disadvantages: • ● Expensive • ● People may refuse to answer private questions

  13. Form of the Sample ● InPerson Questionnaires – Advantages: • ● Higher response rate • ● May be the only way to reach some people, especially poor people • ● Good way to ask very private questions – Disadvantages: • ● Expensive • ● Assumes literacy

  14. Solicited responses (by mail, handout, etc.) – Advantages: • ● Relatively inexpensive • ● May get interesting responses – Disadvantages: • ● Low response rate; biased toward extreme views • ● Assumes literacy

  15. Telephone interviews – Advantages: • ● Relatively inexpensive • ● Respondents feel some privacy • ● Reasonable response rate – Disadvantages: • ● Biased toward wealthy in Ethiopia • ● In all countries: Landlines biased toward old, cellphones toward young

  16. Online interviews – Advantages: • ● Very inexpensive; saves inputting costs as well • ● Respondents feel privacy • ● Response rate varies by method of solicitation – Disadvantages: • ● Very biased toward wealthy in some countries • ● Biased toward young everywhere; very poor have less online access in industrialized world

  17. Types of Sampling The following three types of samples are based on the use of probability theory. These types of samples increase external validity (i.e., they produce results which can to some extent be generalized to a broader group). Simple random sample Stratified random sample Cluster samples

  18. Two ways of making a probability sample more representative of the population being studied: Make sure that every unit picked for the sample has the same chance of being picked as any other unit (randomness). Increase the sample size (less important that (1) above).

  19. Proper Size of the Sample Factors that affect what the size of the sample needs to be: The heterogeneity of the population (or strata or clusters) from which the units are chosen. How many population subgroups (strata) you will deal with simultaneously in the analysis. How accurate you want your sample statistics (parameter estimates) to be. How common or rare is the phenomenon you are trying to detect. How much money and time you have.

  20. Calculating Sample Size X2NP(1-P) n = ________________ C2(N-1)+X2P(1-P) Where: n = the required sample size X2 = is the chi-square value for 1 degree of freedom at some desired probability level N = is the size of the population universe (which gets more important as N gets smaller) P = is the population parameter of the variable (set=.5 which is the worst case scenario, meaning maximally heterogeneous for a dichotomous variable) C = the chosen confidence interval Important note: This formula is good for dichotomous variable (yes/no type variable), not more complex variables.

  21. Stratified Sampling Is done whenever it is likely than an important subpopulation will be under represented in a simple random sample. Must know independent variables upon which to stratify Must know the sizes of the strata subpopulations Is complex and more costly Each strata has it's own sampling error. But the aggregate sampling error of the total population is reduced. There is proportionate and disproportionate random sampling

  22. Cluster Sampling Is a way to sample a population when there is no convenient lists or frames (e.g., homeless in shelters or soup kitchens).

  23. Self-Selection Bias Is caused by the unit of observation (e.g., person) choosing whether or not to be a respondent in a survey. If the self-selection process itself is random, it will not compromise the randomness of the selection process. If the self-selection process is not random (is systematic), it will compromise the randomness of the selection process.

More Related