1 / 22

# Analysis of Variance (ANOVA) - PowerPoint PPT Presentation

Analysis of Variance (ANOVA). ANOVA can be used to test for the equality of three or more population means We want to use the sample results to test the following hypotheses. H 0 :  1  =   2  =   3  =  . . . =  k  H a : Not all population means are equal

I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.

## PowerPoint Slideshow about ' Analysis of Variance (ANOVA)' - moshe

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.

- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript

• ANOVA can be used to test for the equality of three or more population means

• We want to use the sample results to test the following hypotheses.

H0: 1=2=3=. . . = k

Ha: Not all population means are equal

• Rejecting H0 means that at least two population means have different values.

• The values from each population are normally distributed.

• The variance, 2, is the same for all of the populations.

• The observations must be independent.

• Given these assumptions, what is the source of variation among the observed variables?

Two values could come from different populations, or distributions, which could have different means.

Two values could come from the same distribution.

ANOVA:Testing for the Equality of k Population Means

• The ANOVA table

• Between-treatments estimate of 2

• Within-treatments estimate of 2

• Comparing these two estimates: The F Test

• Compute the jth sample mean and variance, j = 1,…,k

• Compute the overall sample mean

Note: the book calls this nT

• Sum of squares due to treatments

• A between-treatments estimate of 2, MSTR, can be computed by

• The within-treatments estimate of 2

Source of Sum of Degrees of Mean

Variation Squares Freedom Squares F

Between SSTR k - 1 MSTR MSTR/MSE

Within SSE nT - k MSE

Total SST nT - 1

• Kelli Home Products, Inc. is considering marketing a long lasting car wax. Three different waxes (Type 1, Type 2, and Type 3) have been developed. In order to test the durability of these waxes, 5 new cars were waxed with Type 1, 5 with Type 2, and 5 with Type 3. Each car was then repeatedly run through an automatic carwash until the wax coating showed signs of deterioration. The number of times each car went through the carwash is shown on the next slide. Kelli Home Products, Inc. must decide which wax to market. Are the three waxes equally effective?

ObservationType 1Type 2Type 3

1 48 73 51

2 54 63 63

3 57 66 61

4 54 64 54

5 62 74 56

Sample Mean 55 68 57

Sample Variance 26.0 26.5 24.5

The F-Test

• If the null hypothesis is true and the ANOVA assumptions are valid, the sampling distribution of MSTR/MSE is an F distribution with k – 1 numerator and nT – k denominator df

• If the means of the k populations are not equal, the value of MSTR/MSE will be inflated because MSTR overestimates 2.

Reject H0

MSTR/MSE

F

Critical Value

Sampling Distribution of MSTR/MSE

• The figure below shows the rejection region associated with a level of significance equal to  where F denotes the critical value.

Do Not Reject H0

The F-Test

• We will reject H0 if the resulting value of MSTR/MSE appears to be too large to have been selected at random from the appropriate F distribution.

• Our decision rule becomes

• Assuming  = .05, F.05 = 3.89 (2 d.f. numerator, 12 d.f. denominator). Therefore, reject H0 if F > 3.89

• Test Statistic:

F = MSTR/MSE = 245/25.667 = 9.55

• Decision: Reject H0

• There appears to be a difference in the types of wax at a 5% level of significance.

• If ANOVA provides statistical evidence to reject the null hypothesis of equal population means, then Fisher’s least significance difference (LSD) procedure can be used to determine where the differences occur.

• Pairwise hypothesis tests of the form

• Test Statistic

• Rejection rule:

• Hypotheses

• Rejection Rule

where

• Assuming  = .05, t.025,12 = 2.179.

• Test Statistic

• Conclusion

The mean number of hours worked at Plant 1 is not equal to the mean number worked at Plant 2.

• Note that

• Since 13 > 6.98, and 11 > 6.98, we conclude that types 1 and 2 have different means, and types 2 and 3 have different means, but there is no significant difference between the means of types 1 and 3.

• Use Tools/Data Analysis/ANOVA: Single Factor

• Data should be in columns which represent each treatment

• Statistical studies can be classified as being either experimental or observational.

• In an experimental study, one or more factors are controlled so that data can be obtained about how the factors influence the variables of interest.

• In an observational study, no attempt is made to control the factors.

• Cause-and-effect relationships are easier to establish in experimental studies than in observational studies.