1 / 57

Designing Experiments

Designing Experiments. Introduction 2 k factorial designs 2 k r factorial designs 2 k-p fractional factorial designs One-factor experiments Two-factor full factorial design without replications Two-factor full factorial design with replications

price-bates
Download Presentation

Designing Experiments

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Designing Experiments • Introduction • 2k factorial designs • 2kr factorial designs • 2k-p fractional factorial designs • One-factor experiments • Two-factor full factorial design without replications • Two-factor full factorial design with replications • General full factorial designs with k factors

  2. Introduction To Experiment Design • You know your metrics • You know your factors • System parameters that are going to be varied in the study – affect the response variable • You know your levels • The alternative values that the factors will take on • You’ve got your method selected -- instrumentation and test workloads • Now what?

  3. Goals in Experiment Design • Obtain maximum information • With minimum work • Typically meaning minimum number of experiments • More experiments aren’t better if you’re the one who has to perform them • Well-designed experiments are also easier to analyze

  4. Experimental Replications • The system under study will be run with varying levels of different factors, potentially with differing workloads • A run with a particular set of levels and other inputs is a replication • Often, you need to do multiple replications with a single set of levels and other inputs • For statistical validation

  5. Interacting Factors • Some factors have effects completely independent of each other • Doubling the factor’s level may halve the response, regardless of other factors • But the effects of some factors depends on the values of other factors • Interacting factors • Presence of interacting factors complicates experimental design

  6. Basic Problem in Designing Experiments • You have chosen some number of factors • They may or may not interact • How can you design an experiment that captures the full range of the levels? • With minimum amount of work • Which combination or combinations of the levels of the factors do you measure?

  7. Common Mistakes in Experimentation • Ignoring experimental error • Uncontrolled parameters • Not isolating effects of different factors • One-factor-at-a-time experiment designs • Interactions ignored • Designs require too many experiments

  8. Types of Experimental Designs • Simple designs • Full factorial design • Fractional factorial design

  9. Experimental Design (l1,0, l1,1, … , l1,n1-1) x (l2,0, l2,1, … , l2,n2-1) x … x (lk,0, lk,1, … , lk,nk-1) k different factors, each factor with nilevels Factor 1 Factor 2 Factor k

  10. Simple Designs • Vary one factor at a time • For k factors with ith factor having ni levels - • Assumes factors don’t interact • Usually more effort than required • Don’t use it, usually

  11. fix Simple Designs (l1,0, l1,1, … , l1,n-1) x (l2,0, l2,1, … , l2,n-1) x … x (lk,0, lk,1, … , lk,n-1) Factor 1 Factor 2 Factor k vary

  12. Simple Designs (l1,0, l1,1, … , l1,n-1) x (l2,0, l2,1, … , l2,n-1) x … x (lk,0, lk,1, … , lk,n-1) Factor 1 Factor 2 Factor k

  13. Full Factorial Designs • For k factors with ith factor having ni levels - • Test every possible combination of factors’ levels • Captures full information about interaction • A heck of a lot of work, though

  14. Full Factorial Designs (l1,0, l1,1, … , l1,n-1) x (l2,0, l2,1, … , l2,n-1) x … x (lk,0, lk,1, … , lk,n-1) Factor 1 Factor 2 Factor k

  15. Reducing the Work in Full Factorial Designs • Reduce number of levels per factor • Generally a good choice • Especially if you know which factors are most important - use more levels for them • Reduce the number of factors • But don’t drop important ones • Use fractional factorial designs

  16. Fractional Factorial Designs • Only measure some combination of the levels of the factors • Must design carefully to best capture any possible interactions • Less work, but more chance of inaccuracy • Especially useful if some factors are known not to interact

  17. Fractional Factorial Designs (l1,0, l1,1, … , l1,n-1) x (l2,0, l2,1, … , l2,n-1) x … x (lk,0, lk,1, … , lk,n-1) Factor 1 Factor 2 Factor k

  18. 2k Factorial Designs • Used to determine the effect of k factors • Each with two alternatives or levels • Often used as a preliminary to a larger performance study • Each factor measured at its maximum and minimum level • Perhaps offering insight on importance and interaction of various factors

  19. Unidirectional Effects • Effects that only increase as the level of a factor increases • Or visa versa • If this characteristic is known to apply, a 2k factorial design at minimum and maximum levels is useful • Shows whether the factor has a significant effect

  20. 22 Factorial Designs • Two factors with two levels each • Simplest kind of factorial experiment design • Concepts developed here generalize • A form of regression can be easily used here • Simplest to show with an example

  21. 22 Factorial Design Example • The Time Warp Operating System • Designed to run discrete event simulations in parallel • Using an optimistic method • Goal is fastest possible completion of a given simulation • Usually quality is expressed in terms of speedup • Here, the simpler metric of runtime is used

  22. Factors and Levels for Time Warp Example • First factor - number of nodes used to run the simulation • Vary between 8 and 64 • Second factor - whether or not dynamic load management is used • To migrate work from node to node as load in the simulation changes • Other factors exists, but ignore them for now

  23. Defining Variables for the 22 Factorial TW Example if 8 nodes if 64 nodes if no dynamic load management if dynamic load management used

  24. Sample Data For Example • Single runs of one benchmark simulation 8 Nodes 64 Nodes NO DLM 820 217 DLM 776 197

  25. Regression Model for Example • y = q0 + qAxA + qBxB + qABxAxB • Note this is a nonlinear model 820 = q0 -qA - qB + qAB 217 = q0 +qA - qB - qAB 776 = q0 -qA + qB - qAB 197 = q0 +qA + qB + qAB

  26. Regression Model, Con’t • 4 equations in 4 unknowns Another way to look at it shown in this table - Experiment A B y 1 -1 -1 y1 2 1 -1 y2 3 -1 1 y3 4 1 1 y4

  27. Solving the Equations q0 = 1/4(820 + 217 + 776 + 197) = 502.5 qA = 1/4(-820 + 217 - 776 + 197) = -295.5 qB = 1/4(-820 - 217 + 776 + 197) = -16 qAB = 1/4(820 - 217 - 776 + 197) = 6 So, y = 502.5 - 295.5xA - 16xB + 6xAxB

  28. The Sign Table Method • Another way of looking at the problem in a tabular form I A B AB y 1 -1 -1 1 820 1 1 -1 -1 217 1 -1 1 -1 776 1 1 1 1 197 2010 -1182 -64 24 Total 502.5 -295.5 -16 6 Total/4

  29. Allocation of Variation for 22 Model • Calculate the sample variance of y Numerator is the SST - total variation SST = 22qA2 + 22qB2 + 22qAB2 • We can use this to explain what causes the variation in y

  30. Terms in the SST • 22qA2 is part of variation explained by the effect of A - SSA • 22qB2 is part of variation explained by the effect of B - SSB • 22qAB2 is part of variation explained by the effect of the interaction of A and B - SSAB SST = SSA + SSB + SSAB

  31. Variations in Our Example • SST = 350449 • SSA = 349281 • SSB = 1024 • SSAB = 144 • We can now calculate the fraction of the total variation caused by each effect (e.g. SSA/SST)

  32. Fractions of Variation in Our Example • Fraction explained by A is 99.67% • Fraction explained by B is 0.29% • Fraction explained by the interaction of A and B is 0.04% • So almost all the variation comes from the number of nodes • So if you want to run faster, apply more nodes, don’t turn on dynamic load management

  33. General 2k Factorial Designs • Used to explain the effects of k factors, each with two alternatives or levels • 22 factorial designs are a special case • Methods developed there extend to the more general case • But many more possible interactions between pairs (and trios, etc.) of factors

  34. 2k Factorial Designs With Replications • 2k factorial designs do not allow for estimation of experimental error • No experiment is ever repeated • But usually experimental error is present • And often it’s important • Handle the issue by replicating experiments • But which to replicate, and how often?

  35. 2kr Factorial Designs • Replicate each experiment r times • Allows quantification of experimental error • Again, easiest to first look at the case of only 2 factors

  36. 22r Factorial Designs • 2 factors, 2 levels each, with r replications at each of the four combinations • y = q0 + qAxA + qBxB + qABxAxB + e • Now we need to compute effects, estimate the errors, and allocate variation • We can also produce confidence intervals for effects and predicted responses

  37. Computing Effects for 22r Factorial Experiments • We can use the sign table, as before • But instead of single observations, regress off the mean of the r observations • Compute errors for each replication using similar tabular method • Similar methods used for allocation of variance and calculating confidence intervals

  38. Example of 22r Factorial Design With Replications • Same Time Warp system as before, but with 4 replications at each point (r=4) • No DLM, 8 nodes - 820, 822, 813, 809 • DLM, 8 nodes - 776, 798, 750, 755 • No DLM, 64 nodes - 217, 228, 215, 221 • DLM, 64 nodes - 197, 180, 220, 185

  39. 22r Factorial Example Analysis Matrix I A B AB y Mean 1 -1 -1 1 (820,822,813,809) 816 1 1 -1 -1 (217,228,215,221) 220.25 1 -1 1 -1 (776,798,750,755) 769.75 1 1 1 1 (197,180,220,185) 195.5 2001.5 -1170 -71 21.5 Total 500.4 -292.5 -17.75 5.4 Total/4 q0= 500.4 qA= -292.5 qB= -17.75 qAB= 5.4

  40. N yi Estimation of Errors for 22r Factorial Example • Figure differences between predicted and observed values for each replication • Now calculate SSE

  41. Allocating Variation • We can determine the percentage of variation due to each factor’s impact • Just like 2k designs without replication • But we can also isolate the variation due to experimental errors • Methods are similar to other regression techniques for allocating variation

  42. Variation Allocation in Example • We’ve already figured SSE • We also need SST, SSA, SSB, and SSAB • Also, SST = SSA + SSB + SSAB + SSE • Use same formulae as before for SSA, SSB, and SSAB

  43. Sums of Squares for Example • SST = SSY - SS0 = 1,377,009.75 • SSA = 1,368,900 • SSB = 5041 • SSAB = 462.25 • Percentage of variation for A is 99.4% • Percentage of variation for B is 0.4% • Percentage of variation for A/B interaction is 0.03% • And 0.2% (apx.) is due to experimental errors

  44. Confidence Intervals For Effects • Computed effects are random variables • Thus, we would like to specify how confident we are that they are correct • Using the usual confidence interval methods • First, must figure Mean Square of Errors

  45. Calculating Variances of Effects • Variance of all effects is the same - • So standard deviation is also the same • In calculations, use t- or z-value for 22(r-1) degrees of freedom

  46. Calculating Confidence Intervals of Effects for Example • At 90% level, using the t-value for 12 degrees of freedom, 1.782 • And standard deviation of effects is 3.68 • Confidence intervals are qi-+(1.782)(3.68) • q0 - (493.8,506.9) • qA - (-299.1,-285.9) • qB - (-24.3,-11.2) • qAB - (-1.2,11.9)

  47. Predicted Responses • We already have predicted all the means we can predict from this kind of model • We measured four, we can “predict” four • However, we can predict how close we would get to the sample mean if we ran m more experiments

  48. N N N y ym ym Formula for Predicted Means • For m future experiments, the predicted mean is Where

  49. N ym Example of Predicted Means • What would we predict as a confidence interval of the response for no dynamic load management at 8 nodes for 7 more tests? • 90% confidence interval is (811.6,820.4) • We’re 90% confident that the mean would be in this range

  50. Visual Tests for Verifying Assumptions • What assumptions have we been making? • Model errors are statistically independent • Model errors are additive • Errors are normally distributed • Errors have constant standard deviation • Effects of errors are additive • Which boils down to independent, normally distributed observations with constant variance

More Related