1 / 35

Analysis of Variance: Randomized Blocks

Learn about the analysis of variance for randomized block designs and multivariate models, and how to use Excel for analysis. Understand the assumptions, interpretations, and test statistics involved.

arwood
Download Presentation

Analysis of Variance: Randomized Blocks

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Analysis of Variance: Randomized Blocks Farrokh Alemi Ph.D. Kashif Haqqi M.D.

  2. Additional Reading • For additional reading see Chapter 13 in Michael R. Middleton’s Data Analysis Using Excel, Duxbury Thompson Publishers, 2000. • Example described in this lecture is based in part on Chapter 14, Sections 3 through 5 of Keller and Warrack’s Statistics for Management and Economics. Fifth Edition, Duxbury Thompson Learning Publisher, 2000. • Read any introductory statistics book about Analysis of Variance

  3. Which Approach Is Appropriate When? • Analysis of Variance described here expands single factor ANOVA to multiple factors and analysis of more than 2 matched groups of populations. • Choosing the right method for the data is the key statistical expertise that you need to have. • You might want to review a decision tool that we have organized for you to help you in choosing the right statistical method.

  4. Do I Need to Know the Formulas? • You do not need to know exact formulas. • You do need to know where they are in your reference book. • You do need to understand the concept behind them and the general statistical concepts imbedded in the use of the formulas. • You do not need to be able to do Analysis of Variance by hand. You must be able to do it on a computer using Excel or other software.

  5. Objectives Randomized Block Design Repeated Measure Design Sources of Variance Test Statistic An Example Assumptions Results of ANOVA Understanding Blocking ANOVA with replication Factorial Experimental Design An Example How to Analyze Data From Factorial Designs? Take Home Lesson Table of Content

  6. Objectives • To learn the assumptions and the interpretation of Analysis of Variance for randomized block design. • To learn assumptions and the interpretation of Analysis of Variance for multifactor models. • To use Excel to do Analysis of Variance.

  7. Single and Multiple Factors • The ANOVA we discussed so far applies to one single factor (one quantitative response variable). • We have seen in paired matched studies how making sure that the same or similar subjects receive the treatments reduces variations and allows more informative tests. • We now extend the ANOVA model described earlier to situations where more than 2 populations are matched or in our new terminology to situations were there is “randomized block designs”.

  8. Randomized Block Design • If the subjects who receive a particular treatment are the same, or essentially the same, then we have a randomized block design. • For example, if different treatment is provides to patients in low, medium and high severity then severity is used to create a block design. • A block design removes differences among the experimental subjects within a particular treatment and therefore reduces the variations in response variable.

  9. Repeated Measure Design • Is a special form of randomized block design when the same subjects receive different treatments. • For example, surveying same patients at monthly intervals is a repeated measure design. • The same patients receive different treatment. • Repeated measures reduces variation due to differences of subjects across treatment programs.

  10. Sources of Variance In randomized block design we partition the total variation in the data (i.e. the difference between each observation and the grand mean) into three sources: • Sum of square treatment, SST • Sum of square of errors, SSE • Sum of square of blocks, SSB SS(Total) = SST + SSB + SSE

  11. Calculation of Sources of Variance

  12. Calculation of Mean Sources of Variance

  13. Test Statistic • Test statistic for treatment is MST/MSE distributed as an F distribution with k-1 and n-k-b-1 degrees of freedom. • Test statistics for effect of blocks is MSB/MSE distributed as an F distribution with b-1 and n-k-b-1 degrees of freedom.

  14. An Example in Health Care • 200 Patients at a Nursing home were followed for seven months. • Each month we recorded their daily living activity score (measured on an interval scale). • Sample of data are shown or download full data. • Did patients’ daily living activity change over time?

  15. Displaying the data We need to see if the apparent changes in some months are real or due to random chance

  16. Components of ANOVA • Response variable is daily living activity score. • Treatment are the months. • The experimental plan is randomized block design. • We use two factor ANOVA without replication. (“With replication” is used when measures are repeated for different levels of the same factor).

  17. What Are the Null Hypotheses? Means for each patients are the same and means for all 7 months are equal: 1=2=3=4=5= 6=7

  18. What Are the Alternative Hypotheses? At least two months have different means. At least two patients have different means.

  19. Assumptions • The variable of interest is quantitative. • The problem is to compare 2 or more means. • The experimental plan is a blocked randomized design. • Treatment observations are distributed according to a Normal distribution. • The variance of the samples are equal.

  20. Verifying Assumptions • Response variable is quantitative. • The Problem is comparison of seven means. • Assumption of blocked sample design is appropriate as repeated measures are used. • Same subjects are rated across the seven months.

  21. Verifying Assumptions (Continued) • Samples have Normal distribution. Month one data is shown. Other months were also Normal but not displayed.

  22. Verifying Assumptions (Continued) Equality of variances will be examined after the ANOVA is done.

  23. Excel Setup For ANOVA • Prepare data so that columns correspond to treatment and rows to blocks. • Select tools, data analysis, ANOVA without replication. • Include as input the column corresponding to blocks and all treatment columns.

  24. Results of ANOVA • First part shows averages and variances for each block (in this case patients). • First 10 patients are shown in this slide. • There are 7 observations per patient over the 7 months. • Means differ but are differences significant.

  25. Results of ANOVA (Continued) • Next, treatment data are described. • Assumption of equal variances are met as variances are in the same range. • Means differ but are differences significant.

  26. Result of ANOVA (Continued) • Next, sum of square table is shown. • Rows correspond to patients, columns to months. • Note total variation = SST+SSB+SSE. • Note mean sum of square is calculated by dividing sum of squares by degrees of freedom.

  27. Result of ANOVA (Continued) • Test statistic for rows is 2.6 and larger than the critical value. Probability of observing this high an F value is 0. • Reject the hypothesis that patients had same means. • Similarly, reject the hypothesis of same means across the months.

  28. Understanding Blocking • Blocking is the extension of matched pair design to more than 2 populations • Blocking reduces variation and improves our ability to detect differences in treatment. You can see this in the formula for total sum of square = SST+SSB+SSE • In the absence of blocking SSB will be added to SSE

  29. ANOVA with replication • It is possible to have multiple blocks. • For each possible block and treatment combination there may be multiple observations (replicated measures). • How would we use ANOVA for these circumstances?

  30. Factorial Experimental Design • In designing data collection it is important to create as much efficiency as possible. • The most optimal design is a factorial experimental design (typically analyzed using ANOVA with replication or multiple regression).

  31. How to Create Factorial Designs? • For each factor (or block), take two levels the maximum and the minimum. • Examine all possible combinations of the factors. For a 3 factor model, this will lead to two to the power of 3, or 8, possible combinations. For a four factor model this leads to 2 to power of 4 possible combinations or 16 combinations. • Measure the response variable for all possible combinations with replication.

  32. A Factorial Design for 3 Factors • Note there are eight unique cases. No case has the same level of the three factors. • The combination was created by repeating every 4 cases for factor one, every 2 cases for factor two and every case for factor three.

  33. An Example In Health Care • Three factors are assumed to affect consumer satisfaction: waiting time, travel time and bed side manner. Design an experiment to understand the relative influence of the three factors.

  34. How to Analyze Data From Factorial Designs? • Data can be analyzed using ANOVA with replications, if for each combination of factors there are repeated measures. • Excel provides a method for analyzing 2 factors with all levels of the factors specified. This is a limited method of analysis. An easier, more generalized approach, is to analyze the data using Multiple regression. A concept we introduce later.

  35. Take Home Lesson • Experimental design affects the method of the analysis. • An effective approach is block randomized design (an extension of matched pair t-test). In these circumstances we use two factor ANOVA without replication. • An optimal design is factorial experimental design. In these circumstances an ANOVA with replication is appropriate.

More Related