1 / 18

Introduction to Inference Ch. 9-1

Introduction to Inference Ch. 9-1. Hypothesis Testing: Drawing Conclusions. Confidence intervals are one of the two most common types of statistical inference. The second type of inference is called a Test of Significance (also called Hypothesis Test ) .

leala
Download Presentation

Introduction to Inference Ch. 9-1

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Introduction to Inference Ch. 9-1 Hypothesis Testing: Drawing Conclusions

  2. Confidence intervals are one of the two most common types of statistical inference. The second type of inference iscalled aTest of Significance (also calledHypothesis Test). Use a confidence intervalwhen your goal is to estimate a population parameter. Use a hypothesis testwhen you wish to assess a claim concerning a population using the evidence provided. Tests Of Significance

  3. Statistical tests are based on determining the probability of obtaining a particular outcome if we repeated an experiment many times using a specified sample size – sampling distributions. The underlying question posed by a test of significance is: What’s the probability that I received such data just by chance? If a hypothesis seem consistent with what we would expect from natural sampling variability, we retain our original hypothesis; however, if the results are far from what we would expect, we reject the hypothesis. The Reasoning of Tests of Significance

  4. Of course, when the probability is very low, it may be that the hypothesis is actually true and we just witnessed something that is extremely unlikely by mere chance. When it comes to data though, statisticians don’t believe in miracles. Instead, they say that if the data is unlikely enough, then it is OK to reject the hypothesis. The Reasoning of Tests of Significance

  5. Consider the following example: A manufacturer creates huge pieces of metal called ingots. Unfortunately, the ingots must be produced completely crack-fee. 80% of the ingots that they make are crack-free – this is unacceptable because of the high cost of recycling the defective ingots. Engineers develop a new process that is suppose to reduce the cracking proportion slightly. Using the new process, an SRS of 400 ingots were created and only 17% of them cracked. Should the manufacturer declare victory? Has the cracking rate really decreased or was 17% just pure luck? Significance tests try to answer questions like this. Others may be: Has the president’s approval rating changed since last month? Has teenage smoking decreased in the last 5 years? Is the global temperature really increasing? The Reasoning of Tests of Significance

  6. A hypothesis is something that is not proved, but assumed to be true for the purpose of argument. In order to begin a hypothesis test, we need a hypothesis that we consider to be true – we call this the Null Hypothesis. The null hypothesis, which we denote H0, specifies a population model parameter of interest and proposes a value for that parameter. When we create a hypothesis test, we develop a dichotomy where either the hypothesis is true or the Alternative Hypothesis, Ha,is true. The following demonstrates the notation that we will use: The Null Hypothesis:H0: p =p0 (original proportion) orH0 :μ=μ0 Alternative Hypothesis:Ha: p > p0or Ha: p < p0or Ha: p ≠ p0 or Ha: μ> μ0or Ha: μ< μ0or Ha: μ≠μ0 The Null and Alternative Hypothesis’

  7. Describe the effect you are searching for in terms of a population parameterlike µ or p. Never state a hypothesis in terms of a sample statistic like The null hypothesis H0 is the statement that this effect is not present in the population, whereas the alternative hypothesisHastates that it is. If the sample statistic is far from the parameter value stated by H0, then we have evidence to REJECT H0and if the sample is relatively close to the parameter value, we FAIL TO REJECT H0 The p-value says how unlikely (or likely) a result at least as extreme as the one observed would be if H0 were true. Results with small P-values would rarely occur if the H0 were true. Outline of a test

  8. P-value & Statistical Significance • P-value - ( for an upper critical value, z*) is the probability (computed assuming that H0 is true) that the test statistic would take a value as extreme or more extreme than that actually observed. • The smaller the P-value is, the stronger the evidence against H0provided by the data. Sampling distribution of p when p = 0 ^ P-value p = 0

  9. Significance Tests and OJ Simpson • If you think about the OJ Simpson case, it brings to mind several ideas that are parallel to significance tests. First, OJ was considered innocent until proven guilty beyond a reasonable doubt. For significance, our Null hypothesis is true until rejected beyond a “reasonable doubt.” Second, it was up to the prosecutor to have enough evidence to convict OJ. Similarly, it is our burden to provide enough evidence to reject the Null Hypothesis. However, since the prosecutor was not able to provide enough evidence to convict him, they declared him “not guilty.” Likewise, if we do not have enough “evidence” to reject the Null Hypothesis, we “fail to reject the Null Hypothesis.” Note: Just as the jury did not say that OJ was innocent, we do not say that the Null Hypothesis is true!!!

  10. The decisive value of P is called the Significance level  (alpha). The value of  is related to the confidence level C by  = 1 – C Therefore, a 5% significance level corresponds to a 95% Confidence level. i.e  = 1 - 0.95 = 0.05 If the P-value is less than or equal to , then the result is considered significant at level , P-value ≤, the result is significant. Significance Level, 

  11. What does the p-value have to do with it? • At which p-value would you reject Hₒ in favor of Hₐ with α=.05? • p =0.10 • p = 0.055 • p = 0.02 Correct Answer: C • At a .05 significance level, anything below .05 is considered so unlikely that we would reject the null hypothesis.

  12. What does the p-value have to do with it? • You and your friends are having a competition. You want to see who can get the best quiz score, simply by guessing (multiple choice quiz of course) . You determine that the probability of getting a B is p = .0005 . Interpret this p-value in context. • There is a probability of .0005 of getting a score of B or better on the quiz. If someone were to get a B on the quiz, it may suggest that they actually studied (not just guessed!)

  13. Steps to follow: Step 1: Identify population Parameter, state the null and alternative Hypotheses, determine what you are trying to do (and determine what the question is asking). Step 2: Verify the Assumptions by checking the conditions. Step 3: If conditions are met, Name the inference procedure and carry out the inference: Calculate theTest statistic and Obtain the p-value Step 4: Make a decision(reject or fail to reject H0). State your conclusionabout HA in context of the problem using p-value. Tests for a Population Mean

  14. Ex 1: In a discussion of the education level of the American workforce, an anchor on CNN stated “The average young person can’t even balance a checkbook.” The NAEP survey says that a score of 275 or higher on its quantitative test reflects the skill needed to balance a checkbook. The NAEP took a random sample of 840 young Americans had a mean score of 272, and a standard deviation of 60. Using a 95% Confidence level, does this mean that young Americans are unintelligent?

  15. Ex 1: State the appropriate null and alternative hypothesis: We want to test a claim about the mean NAEP score of all young Americans. The mean NAEP score for all young Americans is at the checkbook balancing level. The mean NAEP score for all young Americans is below the checkbook balancing level.

  16. Ex 2: The manager of a fast-food restaurant wants to reduce the proportion of drive-through customers who have to wait more than 2 minutes to receive their food once their order is placed. Based on store records, the proportion of customers who had to wait at least 2 minutes was p = .63 . To reduce this proportion, the manager assigns an additional employee to assist with drive-through orders. During the next month the manager will collect random samples of drive through times.

  17. Ex 2: State the appropriate null and alternative hypothesis: We want to test a claim about the proportion of customers that wait at least 2 minutes at the drive-through window. The proportion of those who have to wait at least 2 minutes has not changed. The proportion of those who have to wait at least 2 minutes has been reduced.

  18. What are we looking for?? Ha: p > p0 is P(Z ≥ z) (one sided to the right) Ha: p < p0 is P(Z ≤ z) (one sided to the left) Ha: p ≠ p0 is 2P(Z ≥ |z|) (two sided)

More Related