1 / 34

Statistics Review

Statistics Review. One Semester in 50 minutes. Setting up a null-hypothesis and finding the p-value. QUESTION: Are babies more likely to pick the “good” toy then the “bad” toy after watching a video? (Investigation 1). How would we answer this question?.

mpropp
Download Presentation

Statistics Review

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Statistics Review One Semester in 50 minutes

  2. Setting up a null-hypothesis and finding the p-value.

  3. QUESTION: Are babies more likely to pick the “good” toy then the “bad” toy after watching a video? (Investigation 1)

  4. How would we answer this question? QUESTION: Are babies more likely to pick the “good” toy then the “bad” toy after watching a video? (Investigation 1)

  5. QUESTION: Are babies more likely to pick the “good” toy then the “bad” toy after watching a video? Philosophy: Lets gather a bunch of data and find out how surprising our results are.

  6. Process: We write a null hypothesis: the null hypothesis is the boring (“status-quo”) hypothesis that we want to show is unlikely to be true given our data.

  7. Process: We write a null hypothesis: the null hypothesis is the boring (“status-quo”) hypothesis that we want to show is unlikely to be true given our data. The alternative hypothesis is our research question. We are hoping to reject the null hypothesis in favor of the alternative hypothesis.

  8. Process: We write a null hypothesis: the null hypothesis is the boring (“status-quo”) hypothesis that we want to show is unlikely to be true given our data. The alternative hypothesis is our research question. We are hoping to reject the null hypothesis in favor of the alternative hypothesis. Gather data

  9. Process: We write a null hypothesis: the null hypothesis is the boring (“status-quo”) hypothesis that we want to show is unlikely to be true given our data. The alternative hypothesis is our research question. We are hoping to reject the null hypothesis in favor of the alternative hypothesis. Gather data Do stats (to determine whether we can reject the null hypothesis)

  10. Do Stats Can we reject the null hypothesis? Method: Assume the null hypothesis is true. Calculate the probability of getting results as (or more) extreme than the data given that null hypothesis is true. If we are unlikely (usually use less than 5% probability) to get these extreme results, then we reject the null-hypothesis. Otherwise we do not reject it.

  11. Note: This method only works one-way. • If we reject the null hypothesis than we claim that the alternative hypothesis is true (this is what researchers want the result to be). • If we do NOT reject the null hypothesis we CANNOT claim anything.

  12. 2. Calculating the probability of getting results as (or more) extreme than the data given that null hypothesis is true. -this probability is the p-value! (which you all know since we memorized it for the last exam). In other words: are our results different enough from what we would expect (assuming null hypothesis is true) to conclude that the null hypothesis is not true?

  13. Example: (Investigation 1) – Are babies more likely to pick the “good” toy then the “bad” toy after watching a video? Null Hypothesis State the null hypothesis in english (no numbers or symbols): (class) State the null hypothesis mathematically: (class)

  14. Example: (Investigation 1) – Are babies more likely to pick the “good” toy then the “bad” toy after watching a video? Null Hypothesis State the null hypothesis in english (no numbers or symbols): The video doesn’t affect the baby’s choice. Baby is equally likely to pick either toy. State the null hypothesis mathematically: H_0: p=0.5 (p is the probability of choosing the good toy)

  15. Example: (Investigation 1) – Are babies more likely to pick the “good” toy then the “bad” toy after watching a video? Alternative Hypothesis State the alternative hypothesis in english (no numbers or symbols): class State the alternative hypothesis mathematically: class

  16. Example: (Investigation 1) – Are babies more likely to pick the “good” toy then the “bad” toy after watching a video? Alternative Hypothesis State the alternative hypothesis in english (no numbers or symbols): The video does affect the baby’s choice. Baby is more likely to pick the good toy. State the alternative hypothesis mathematically: H_a: p>0.5 (when would we use not equal??)

  17. Data: We collected data on 16 babies and found that 14 of them picked the “good” toy. When calculating the p-value what is our parameter? A mean The difference between two means A proportion The difference between two proportions Your mom

  18. Data: We collected data on 16 babies and found that 14 of them picked the “good” toy. When calculating the p-value what is our parameter? A mean The difference between two means A proportion The difference between two proportions Your mom *we could think of it as a mean – the number of babies picking the good toy, or the proportion of babies who pick the good toy.

  19. Data: We collected data on 16 babies and found that 14 of them picked the “good” toy. How do we find the p-value?? Use a z-statistic?? Use a t-statistic?? Use the applet (or another simulation)?? Calculate the exact probability?? Why??

  20. Data: We collected data on 16 babies and found that 14 of them picked the “good” toy. How do we find the p-value?? Use a z-statistic?? Use a t-statistic?? Use the applet (or another simulation)?? –OK but only gives us an approximation Calculate the exact probability?? This gives us the exact p-value for this problem. But how do we do this?? Why??

  21. 3. Use the applet (or another simulation) to approximate the p-value • Flip a fair coin 16 times. • Did we get 14, 15, 16 heads? (this is “as or more extreme than the data”) • Repeat this process MANY times. What proportion of times did we get these extreme results (14,15,16 heads)

  22. Simulation results with 1000 trials of 16 flips histogram

  23. ( number of times we flipped 14 Heads PLUS number of times we flipped 15 Heads PLUS number of times we flipped 16 Heads) divided by total number of trials P-value is approximately: (3+2+0)/1000=5/1000=.005

  24. ( number of times we flipped 14 Heads PLUS number of times we flipped 15 Heads PLUS number of times we flipped 16 Heads) divided by total number of flips P-value is approximately: (3+2+0)/1000=5/1000=.005 The area of the right 3 rectangles -> There is (approximately) a 0.005 chance (0.5%) that we would get 14 or more Heads if the null hypothesis is true. VERY UNLIKELY. So we reject the null hypothesis and conclude that the alternative hypothesis is true.

  25. 4. Calculate the exact probability. How??

  26. Calculate the exact probability. P-value=P(14 Heads)+P(15 Heads)+P(16 Heads) How do we calculate this?

  27. Calculate the exact probability. P-value=P(14 Heads)+P(15 Heads)+P(16 Heads) How do we calculate this? Binomial Theorem!!

  28. Now: What if we collected data on 100 babies (instead of 16)?? What would change??

  29. Data: We collected data on 100 babies and found that 61 of them picked the “good” toy. How do we find the p-value?? Use a z-statistic?? Use a t-statistic?? Use the applet (or another simulation)?? Calculate the exact probability?? Why??

  30. Data: We collected data on 100 babies and found that 61 of them picked the “good” toy. How do we find the p-value?? Use a z-statistic?? Use a t-statistic?? Use the applet (or another simulation)?? Calculate the exact probability?? (3) and (4): this gets tougher as our sample size (n) gets larger.

  31. “Exact” Probabilities • For n large (np>15 and nq>15 ish) this looks like normal curve. • So if we transform it into a “Standard Normal”, we can use the Normal Table to approximate the area (p-value)

  32. How do we transform a Normal Distribution with mean μ and standard deviation into a standard normal?? What is the mean and standard deviation of a standard normal?

  33. How do we transform a Normal Distribution with mean μ and standard deviation into a standard normal?? Find the z-score (number of standard deviations larger than the mean). What is the mean and standard deviation of a standard normal? Standard Normal has mean 0 and standard deviation 1.

  34. Z-score on board

More Related