670 likes | 688 Views
Learn how to compute confidence intervals for population means, dealing with unknown standard deviation, sample size calculation, significance levels, errors, and data transformations in statistics.
E N D
If we want to compute a confidence interval (CI) for a population mean, which formula would we use?
If we want to compute a confidence interval (CI) for a population mean, which formula would we use? It depends! Think!!
If we know the population standard deviation, σ, we would use
Suppose σ is unknown What changes would decrease the width of a confidence interval (CI) for a population mean?
If we want to compute the sample size needed for a given confidence level and given margin of error, how would we proceed?
If we want to compute the sample size needed for a given confidence level and given margin of error, how would we proceed? Think!!! 2 possible cases
If we want to compute the sample size needed for a given confidence level and given margin of error, how would we proceed? Is population standard deviation, σ: • known • or unknown?
P-value is the probability of getting a result as extreme or more extreme than the result (test statistic) we got from our sample, given the null hypothesis is true.
α is the level of significance. How does α relate to P-value?
α is the level of significance. α is the maximum P-value for which the null hypothesis will be rejected.
α is the level of significance. α is the maximum P-value for which the null hypothesis will be rejected. Reject null hypothesis because the P-value of 0.## is less than the significance level of α = 0.05
How does α compare for a two-sided test versus a one-sided test?
How does α compare for a two-sided test versus a one-sided test? It’s the same for both. For example, for a 95% confidence level, α is 0.05 for a two-sided test and a one-sided test.
Remember Errors? What is: • a Type I error? (b) a Type II error?
Errors Type I error is rejecting a true null hypothesis. (b) a Type II error?
Errors Type I error is rejecting a true null hypothesis. Type II error is failing to reject a false null hypothesis.
Errors Type I error is rejecting a true null hypothesis. P(Type I error) = ?
Errors Type I error is rejecting a true null hypothesis. P(Type I error) = α, the level of significance
Why do we transform data? To change skewed data into more normal data.
15/40 Guideline? 15/40 guideline is a set of rules that helps us know when it is appropriate to use a t-interval or t-test for the population mean.
15/40 Guideline Page 608
If our sample size is 40 or more, do we need to plot the sample data?
If our sample size is 40 or more, do we need to plot the sample data? Yes!! Why?
If our sample size is 40 or more, do we need to plot the sample data? Yes!! Why? Need to check for outliers.
Page 610, E41 Pretend that each data set described is a random sample and that you want to do a significance test or construct a confidence interval for the unknown mean. Use the sample size and the shape of the distribution to decide which of these descriptions (I–IV) best fits each data set
Page 610, E41 • There are no outliers, and there is no evidence of skewness. Methods based on the normal distribution are suitable. • The distribution is not symmetric, but the sample is large enough that it is reasonable to rely on the robustness of the t-procedure and construct a confidence interval, without transforming the data to a new scale
Page 610, E41 III. The shape suggests transforming. With a larger sample, this might not be necessary, but for a skewed sample of this size transforming is worth trying. IV. It would be a good idea to analyze this data set twice, once with the outliers and once without.
Page 610, E41 • weights, in ounces, of bags of potato chips
Page 610, E41 • weights, in ounces, of bags of potato chips n = 15; fairly symmetric with outlier
Page 610, E41 • weights, in ounces, of bags of potato chips n = 15; fairly symmetric with outlier IV. It would be a good idea to analyze this data set twice, once with the outliers and once without.
Page 610, E41 B. Per capita gross domestic product (GNP) for various countries
Page 610, E41 B. Per capita gross domestic product (GNP) for various countries n = 34, strongly skewed right
Page 610, E41 B. Per capita gross domestic product (GNP) for various countries n = 34, strongly skewed right III. The shape suggests transforming. With a larger sample, this might not be necessary, but for a skewed sample of this size transforming is worth trying.
Page 610, E41 • Batting averages of American League players
Page 610, E41 • Batting averages of American League players n > 40, fairly symmetric, no outliers
Page 610, E41 • Batting averages of American League players n > 40, fairly symmetric, no outliers I. There are no outliers, and there is no evidence of skewness. Methods based on the normal distribution are suitable.
Page 610, E41 D. self-reported grade-point averages of 67 students