1 / 67

Confidence Intervals and Hypothesis Testing in Statistics

Learn how to compute confidence intervals for population means, dealing with unknown standard deviation, sample size calculation, significance levels, errors, and data transformations in statistics.

Download Presentation

Confidence Intervals and Hypothesis Testing in Statistics

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Sections 9.1 – 9.3

  2. If we want to compute a confidence interval (CI) for a population mean, which formula would we use?

  3. If we want to compute a confidence interval (CI) for a population mean, which formula would we use? It depends! Think!!

  4. If we know the population standard deviation, σ, we would use

  5. Suppose σ is unknown What changes would decrease the width of a confidence interval (CI) for a population mean?

  6. Suppose σ is unknown

  7. Suppose σ is unknown

  8. If we want to compute the sample size needed for a given confidence level and given margin of error, how would we proceed?

  9. If we want to compute the sample size needed for a given confidence level and given margin of error, how would we proceed? Think!!! 2 possible cases

  10. If we want to compute the sample size needed for a given confidence level and given margin of error, how would we proceed? Is population standard deviation, σ: • known • or unknown?

  11. What does P-value mean?

  12. P-value is the probability of getting a result as extreme or more extreme than the result (test statistic) we got from our sample, given the null hypothesis is true.

  13. What is α?

  14. α is the level of significance.

  15. α is the level of significance. How does α relate to P-value?

  16. α is the level of significance. α is the maximum P-value for which the null hypothesis will be rejected.

  17. α is the level of significance. α is the maximum P-value for which the null hypothesis will be rejected. Reject null hypothesis because the P-value of 0.## is less than the significance level of α = 0.05

  18. How does α compare for a two-sided test versus a one-sided test?

  19. How does α compare for a two-sided test versus a one-sided test? It’s the same for both. For example, for a 95% confidence level, α is 0.05 for a two-sided test and a one-sided test.

  20. Remember Errors? What is: • a Type I error? (b) a Type II error?

  21. Errors Type I error is rejecting a true null hypothesis. (b) a Type II error?

  22. Errors Type I error is rejecting a true null hypothesis. Type II error is failing to reject a false null hypothesis.

  23. Errors Type I error is rejecting a true null hypothesis. P(Type I error) = ?

  24. Errors Type I error is rejecting a true null hypothesis. P(Type I error) = α, the level of significance

  25. Why do we transform data?

  26. Why do we transform data? To change skewed data into more normal data.

  27. 15/40 Guideline?

  28. 15/40 Guideline? 15/40 guideline is a set of rules that helps us know when it is appropriate to use a t-interval or t-test for the population mean.

  29. 15/40 Guideline Page 608

  30. If our sample size is 40 or more, do we need to plot the sample data?

  31. If our sample size is 40 or more, do we need to plot the sample data? Yes!! Why?

  32. If our sample size is 40 or more, do we need to plot the sample data? Yes!! Why? Need to check for outliers.

  33. Page 610, E41 Pretend that each data set described is a random sample and that you want to do a significance test or construct a confidence interval for the unknown mean. Use the sample size and the shape of the distribution to decide which of these descriptions (I–IV) best fits each data set

  34. Page 610, E41 • There are no outliers, and there is no evidence of skewness. Methods based on the normal distribution are suitable. • The distribution is not symmetric, but the sample is large enough that it is reasonable to rely on the robustness of the t-procedure and construct a confidence interval, without transforming the data to a new scale

  35. Page 610, E41 III. The shape suggests transforming. With a larger sample, this might not be necessary, but for a skewed sample of this size transforming is worth trying. IV. It would be a good idea to analyze this data set twice, once with the outliers and once without.

  36. Page 610, E41 • weights, in ounces, of bags of potato chips

  37. Page 610, E41 • weights, in ounces, of bags of potato chips n = 15; fairly symmetric with outlier

  38. Page 610, E41 • weights, in ounces, of bags of potato chips n = 15; fairly symmetric with outlier IV. It would be a good idea to analyze this data set twice, once with the outliers and once without.

  39. Page 610, E41 B. Per capita gross domestic product (GNP) for various countries

  40. Page 610, E41 B. Per capita gross domestic product (GNP) for various countries n = 34, strongly skewed right

  41. Page 610, E41 B. Per capita gross domestic product (GNP) for various countries n = 34, strongly skewed right III. The shape suggests transforming. With a larger sample, this might not be necessary, but for a skewed sample of this size transforming is worth trying.

  42. Page 610, E41 • Batting averages of American League players

  43. Page 610, E41 • Batting averages of American League players n > 40, fairly symmetric, no outliers

  44. Page 610, E41 • Batting averages of American League players n > 40, fairly symmetric, no outliers I. There are no outliers, and there is no evidence of skewness. Methods based on the normal distribution are suitable.

  45. Page 610, E41 D. self-reported grade-point averages of 67 students

More Related