1 / 25

Sampling Distributions of Proportions

Sampling Distributions of Proportions. Parameter. A number that describes the population Symbols we will use for parameters include m - mean s – standard deviation p – proportion (p) a – y-intercept of LSRL b – slope of LSRL. Statistic.

siran
Download Presentation

Sampling Distributions of Proportions

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Sampling Distributions of Proportions

  2. Parameter • A number that describes the population • Symbols we will use for parameters include m - mean s – standard deviation p – proportion (p) a – y-intercept of LSRL b – slope of LSRL

  3. Statistic • A number that that can be computed from sample data without making use of any unknown parameter • Symbols we will use for statistics include x – mean s– standard deviation p– proportion a– y-intercept of LSRL b– slope of LSRL

  4. A distribution is all the values that a variable can be.

  5. The dotplot is a partial graph of the sampling distribution of all sample proportions of sample size 20. If I found all the possible sample proportions – this would be approximately normal! • Toss a penny 20 times and record the number of heads. • Calculate the proportion of heads & mark it on the dot plot on the board. What shape do you think the dot plot will have?

  6. Where x is the number in the sample & n is the sample size Sampling Distribution • Is the distribution of possible values of a statistic from all possible samples of the same size from the same population • In the case of the pennies, it’s the distribution of all possible sample proportions (p) We will use: p for the population proportion and p-hat for the sample proportion

  7. Suppose we have a population of six people: Alice, Ben, Charles, Denise, Edward, & Frank We are interested in the proportion of females. This is called What is the proportion of females? Draw samples of two from this population. How many different samples are possible? The parameter of interest 1/3 6C2 =15

  8. Alice & Ben .5 Alice & Charles .5 Alice & Denise 1 Alice & Edward .5 Alice & Frank .5 Ben & Charles 0 Ben & Denise .5 Ben & Edward 0 Ben & Frank 0 Charles & Denise .5 Charles & Edward 0 Charles & Frank 0 Denise & Edward .5 Denise & Frank .5 Edward & Frank 0 Find the 15 different samples that are possible & find the sample proportion of the number of females in each sample. How does the mean of the sampling distribution (mp-hat) compare to the population parameter (p)? mp-hat = p Find the mean & standard deviation of all p-hats.

  9. Suppose we have a population of six people: Alice, Ben, Charles, Denise, Edward, & Frank Draw samples of three from this population. How many different samples are possible? Find the mean & standard deviation of all p-hats. What do you notice about the means & standard deviations? 6C3 = 20

  10. Formulas: These are found on the formula chart!

  11. Correction factor – multiply by Does the standard deviation of the sampling distribution equal the equation? NO - So – in order to calculate the standard deviation of the sampling distribution, we MUST be sure that our sample size is less than 10% of the population! WHY? We are sampling more than 10% of our population! If we use the correction factor, we will see that we are correct.

  12. Assumptions (Rules of Thumb) • Sample size must be less than 10% of the population (independence) (Population > 10n) • Sample size must be large enough to insure a normal approximation can be used. np> 10 & n (1 – p) > 10

  13. Why does the second assumption insure an approximate normal distribution? What would happen if the fixed number was 100? Remember back to binomial distributions Suppose n = 10 & p = 0.1(probability of a success), a histogram of this distribution is strongly skewed right!

  14. Suppose a binomial distribution has n = 100 and p = 0.1. What is the mean and standard deviation of this distribution? Graph a histogram of this binomial distribution. What shape to do you expect this to be? Notice that these bars are extremely small and extend out to 100 – so this distribution is skewed right However, when n is large enough, the tail will spread into an approximate normal curve m = 10 & s = 3 Since p = .1, we would expect this distribution to be skewed right

  15. Why do we need to also check n(1 – p)? Consider what the histogram looks like when n = 10 and p = .9. We must also check that the upper tail will spread out into an approximate normal curve.

  16. Assumptions (Rules of Thumb) • Sample size must be less than 10% of the population (independence) (Population > 10n) • Sample size must be large enough to insure a normal approximation can be used. np > 10 & n (1 – p) > 10

  17. Chip Activity: • Select three samples of size 5, 10, and 15 and record the number of blue chips. • Place your proportions on the appropriate dotplots. What do you notice about these distributions?

  18. n = 10 n = 20 n = 50 n = 100 0.2 0.2 0.2 0.2 Some proportion distributions where  = 0.2 Let p be the proportion of successes in a random sample of size n from a population whose proportion of S’s (successes) is p.

  19. Based on past experience, a bank believes that 7% of the people who receive loans will not make payments on time. The bank recently approved 200 loans. What are the mean and standard deviation of the proportion of clients in this group who may not make payments on time? Are assumptions met? What is the probability that over 10% of these clients will not make payments on time? Yes – np = 200(.07) = 14 n(1 - p) = 200(.93) = 186 Ncdf(.10, 10^99, .07, .01804) = .0482

  20. Suppose one student tossed a coin 200 times and found only 42% heads. Do you believe that this is likely to happen? Find the probability that a coin would land heads less than 42% of the time. np = 200(.5) = 100 & n(1-p) = 200(.5) = 100 Since both > 10, I can use a normal curve! Find m & s using the formulas. No – since there is approximately a 1% chance of this happening, I do not believe the student did this.

  21. Assume that 30% of the students at SHS wear contacts. In a sample of 100 students, what is the probability that more than 35% of them wear contacts? Check assumptions! mp-hat = .3 & sp-hat = .045826 np = 100(.3) = 30 & n(1-p) =100(.7) = 70 Ncdf(.35, 10^99, .3, .045826) = .1376

  22. Example If the true proportion of defectives produced by a certain manufacturing process is 0.08 and a sample of 400 is chosen, what is the probability that the proportion of defectives in the sample is greater than 0.10? Since n = 400(0.08) = 32 > 10 and n(1-) = 400(0.92) = 368 > 10, it’s reasonable to use the normal approximation.

  23. Example (continued)

  24. Example Suppose 3% of the people contacted by phone are receptive to a certain sales pitch and buy your product. If your sales staff contacts 2000 people, what is the probability that more than 100 of the people contacted will purchase your product? Clearly  = 0.03 and p = 100/2000 = 0.05 so

  25. Example - continued If your sales staff contacts 2000 people, what is the probability that less than 50 of the people contacted will purchase your product? Now  = 0.03 and p = 50/2000 = 0.025 so

More Related