1 / 31

Chapter 8

Chapter 8 . Hypothesis Testing I . Significant Differences. Hypothesis testing is designed to detect significant differences : differences that did not occur by random chance. This chapter focuses on the “one sample” case: we compare a random sample against a population.

kobe
Download Presentation

Chapter 8

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Chapter 8 Hypothesis Testing I

  2. Significant Differences • Hypothesis testing is designed to detect significant differences: differences that did not occur by random chance. • This chapter focuses on the “one sample” case: we compare a random sample against a population. • We compare a sample statistic to a population parameter to see if there is a significant difference.

  3. Example • The education department at a large state university has been accused of “grade inflation” such that education majors have higher GPAs than students in general. • GPAs of all education majors should be compared with the GPAs of all students. • There are 1000s of education majors, far too many to interview. • How can the dispute be investigated without interviewing all education majors?

  4. Example • The average GPA for all students is 2.70 and the standard deviation is 0.70.. • The box reports the statistical information for a random sample of education majors Mu = 2.70

  5. Example • There is a difference between the parameter (2.70) and the statistic (3.00). • It seems that education majors do have higher GPAs, at least in our sample. • However, we are working with a random sample (not all education majors). • The observed difference may have been caused by random chance.

  6. Two Explanations for the Difference • The mean for all education students is the same as the pop. mean (2.70). • The difference in the sample mean is caused by random chance. • The difference is real (significant). • Education majors have higher GPAs than students in general

  7. Hypotheses • Null Hypothesis (H0) • “The difference is caused by random chance”. • The H0 always states there is “no significant difference.” • Alternative hypothesis (H1) • The mean GPA for all education majors is higher than the mean GPA for all students in the college. • (H1) always contradicts the H0. • If we can reject the null hypothesis, our data support the alternative hypothesis (but don’t prove it)

  8. Testing the two explanations • Assume that H0 is true. • What is the probability of getting the sample mean (3.0) if the H0 is true and all education majors really have a mean of 2.7? • If the probability is less than 0.05, reject the null hypothesis.

  9. Test the Hypotheses • This will be a one-tailed test since our alternative hypothesis is not just that education majors’ GPAs are different but that they are higher. • At a 95% confidence level, we put the whole .05 in the upper tail of the normal sampling distribution. • Using the Z score formula and Appendix A to determine the probability of getting the observed difference, we find that Z(critical) = +1.65. • If Z(obtained) is at or above 1.65, we can reject the null hypothesis. The zone of rejection begins at 1.65 and includes any z-score higher than that

  10. Test the Hypotheses • Substituting the values into formula 8.1, we calculate a Z score of 4.6. For a large N, we can use the sample standard deviation • This is above 1.65. and a difference this large would be very rare. In fact, if the null hypothesis were true, the probability of such an extreme outcome is p = .0001. • Therefore we can reject the null hypothesis and say that our result tends to support the alternative hypothesis, that GPAs in the education department are higher than for the university as a whole. .

  11. Interpreting results: significance • This difference is significant (note the technical meaning of significant—namely, that it probably didn’t happen by chance). • “Significant” doesn’t necessarily mean large or even important. • If N is very large, even a small difference could be significant.

  12. Review: five step model for testing hypotheses • Make Assumptions and meet test requirements. • State the null and alternative hypotheses. • Select the sampling distribution and establish the critical region. • Compute the test statistic. • Make a decision and interpret results.

  13. The Five Step Model: Do the steps more formally Is there a grade inflation problem in the education dept?

  14. Step 1 Make Assumptions and Meet Test Requirements • Random sampling • Hypothesis testing assumes samples were selected according to EPSEM. • The sample of 117 was randomly selected from all education majors. • LOM is Interval-Ratio • GPA is I-R so the mean is an appropriate statistic. • Sampling Distribution is normal in shape • This is a “large” sample (N>100).

  15. Step 2 State the Null Hypothesis Null Hypothesis: μ(education department) = 2.7, the same as the mean for the university as a whole • The sample of 117 comes from a population that has a GPA of 2.7. • The difference between 2.7 and 3.0 is trivial and caused by random chance.

  16. Step 2 State the Alternative Hypothesis Alternative (or research) hypothesis: Mean in education is higher than 2.7 • The sample of 117 probably comes from a population that has a higher GPA than 2.7. • The difference between 2.7 and 3.0 probably reflects an actual difference between education majors and other students.

  17. Step 3 Select Sampling Distribution and Establish the Critical Region • Sampling Distribution= Z • Alpha (α) = .05 • Any difference with a probability less than α is rare and will cause us to reject the H0. • Critical Region begins at +1.65 • This is the critical Z score associated withα= .05, eon-tailed test. • If the obtained Z score falls in the C.R., reject the H0.

  18. Step 4 Compute the test statistic • Using formula 8.1, Z (obtained) = 4.6

  19. Step 5 Make a Decision and Interpret Results • The obtained Z score fell in the C.R., so we reject the H0. • If the H0 were true, a sample outcome of 3.00 would be unlikely. • Therefore, the H0 is false and must be rejected. • GPAs in the Education Department are most likely significantly different than GPAs in the college as a whole.

  20. The Five Step Model: Summary • In hypothesis testing, we try to identify statistically significant differences that did not occur by random chance. • In this example, the difference between the parameter 2.70 and the statistic 3.00 was large and unlikely (p < .05) to have occurred by random chance.

  21. The Five Step Model: Summary • We rejected the H0 and concluded that the difference was significant. • It is very likely that Education majors have GPAs higher than the general student body

  22. One-tailed vs. two-tailed hypothesis testing • What if our alternative hypothesis were not that education majors have higher GPAs but simply that their GPAs are significantly different than for the student body as a whole • In that case, we do a two-tailed test, and we split the alpha=.05 into both tails (.025 in each) • In that case, z(crit) = +/- 1.96 • We’d still use formula 8.1 to calculate Z(obt)

  23. One-tailed vs. two-tailed hypothesis testing • What if our null hypothesis were that education majors had lower mean GPAs than the student body as a whole. • In that case, the whole alpha (.05) goes into the negative tail of the distribution. • Z(crit) = -1.65 • We’d still use formula 8.1 to calculate z(obt)

  24. Student’s t distribution • What if N is smaller than 100? Is there still a way to test for significance • Yes, the student’s t distribution does the trick. • What’s more, as N gets to 100 or more, the t-distribution behaves just like the Z-distribution, so we can actually just use the t-distribution all the time

  25. What if we had a smaller sample of education majors? • N = 65 • Sample mean = 2.9 • Sample standard deviation = 0.4

  26. Step 1 Make Assumptions and Meet Test Requirements • Random sampling • Hypothesis testing assumes samples were selected according to EPSEM. • The sample of 65 was randomly selected from all education majors. • LOM is Interval-Ratio • GPA is I-R so the mean is an appropriate statistic. • Sampling Distribution is a t-distribution • This is a small sample (N<100).

  27. Step 2 State the Null and Alternative Hypothess Null hypothesis: The overall GPA for Education majors = 2.7, the same as in the college a a whole. Alternative hypothesis: The mean GPA in education is actually higher than 2.7

  28. Step 3 Select Sampling Distribution and Establish the Critical Region • Sampling Distribution= student’s t • Alpha (α) = .05 • Degrees of freedom = N-1 (64), 1-tailed test • Critical Region begins at +1.671 • If the obtained t score falls in the Critical Region, at or above 1.671, reject the H0 and support the alternative or research hypothesis

  29. Step 4 Compute the test statistic • Using formula 8.2, t (obtained) = 4.

  30. Step 5 Make a Decision and Interpret Results • The obtained t score fell in the Critical Region, so we reject the H0. • If the H0 were true, a sample outcome of t = 4 would be unlikely. • Therefore, the H0 must be rejected. • Our supports the proposition that education majors have a GPA that is significantly significantly higher than the GPA of the general student body

  31. 2 types of error • Type 1 or alpha error: the probability of rejecting the null hypothesis when it is actually true. We set up the problem in a way that minimizes (but does not eliminate) this possibility. • Type 2 or beta error: the probability of failing to reject a null hypothesis when it is actually false. • The two types of error are inversely related; lowering the risk of type 1 error raises the risk of type 2 error

More Related