Advanced Statistics for Interventional Cardiologists

Advanced Statistics for Interventional Cardiologists

What you will learn 1st day 2ndday Introduction Basics of multivariable statistical modeling Advanced linear regression methods Hands-on session: linear regression Bayesian methods Logistic regression and generalized linear model Resampling methods Meta-analysis Hands-on session: logistic regression and meta-analysis Multifactor analysis of variance Cox proportional hazards analysis Hands-on session: Cox proportional hazard analysis Propensity analysis Most popular statistical packages Conclusions and take home messages

What you will learn • Multifactor Analysis of Variance • ANOVA Basics • Regression versus ANOVA • Model assumptions • Test principle – F-test • GLM approach • Contrasts • Multiple comparisons • Power and Sample size • Diagnostics • Non-parametric alternative • Two-Factor ANOVA • Interaction effect • Analysis of Covariance • Repeated Measures • MANOVA

Use of Analysis of Variance • ANOVA models basically are used to analyze the effect of qualitative explanatory variables (independent, factors) on a quantitative response variable (dependent). • In multifactor studies, ANOVA models are employed to determine key factors and whether the different factors interact.

Regression versus Analysis of Variance Simple ANOVA model : Comparing means of groups made by a qualitative variable. No specification of the nature of the statistical relationship with the response variable. Simple Regression model: Fitting a mean that changes as a function of a quantitative variable. Regression allows predictions (extrapolations) of the response variable. Source: Statistics in Practice, Moore en McCabe, 2006

Single-Factor ANOVAExample The graph below contains the results of a study that measured the response of 30 subjects to treatments and placebo. Let’s evaluate if there are significant differences in mean response.

Single-Factor ANOVABasic Ideas and Assumptions • Used to simultaneously compare two or more group means based on independent samples from each group. • We assume that the samples are from normally distributed populations, all with the same variance. • The larger the variation among sample group means relative to the variation of individual measurements within the groups, the greater the evidence that the hypothesis of equal group means is untrue.

N=17 -1,392 -0,174 Kurtosis (Vorm) Skewness (Symmet -0,395 -0,182 N=56 Single-Factor ANOVANormality check Kolmogorov-Smirnov or Shapiro Wilk Test Graphically, with a Q-Q Plot (with Kurtosis en Skewness)

Single-Factor ANOVAAssumptions Observations i . i . d Identically Distributed Comparing is not possible ! Comparing is ... Independent We cannot predict an observation from another observation. And as we have tests for normality (Kolmogorov-Smirnov, Shapiro Wilk), there exist … Tests for equal variances (eg. Levene) (execute before we start with Anova tests)

Gr1 Gr2 Gr3 X11 X12 X13 ... ... ... Xn3,3 Xn1,1 Xn2,2 X,1 X,2 X,3 X, X Single-Factor ANOVA Test Principle Anova Model H0: m1 = m2 = … = mk Ha: not all µi are equal Principle of the global test: If the variability between the groups is significantly greater than the variability within the groups: reject H0

Single-Factor ANOVAF-Test • Null hypothesis: μ1 = μ2 = …μk • Alternative hypothesis: not all μi are equal • Test statistic: F = MSG/MSE • MSG: estimate for the variability among groups (per df) • MSE: estimate for the variability within groups (per df) • Decision rule: reject H0 if • Demonstration: http://bcs.whfreeman.com/ips5e/

Variability between groups Effect of Indep Var Mean square = Sum of Squares / df MS = SS / df Fob s= MStreat/MSerror Reject H0 when Fobs > Fcrit Variability within groups Effect of unknown Indep Var or measurement errors. Reject H0 if p < .05 Residual Variance Single-Factor ANOVAAnova table Interpretation of rejection of H0 : At least one of the group means is different from another group mean.

Single-Factor ANOVAExample So what’s the verdict for the drug effect ? F value of 3,98 is significant with a p-value of 0,03, which confirms that there is a significant difference in the means. The F test does not give any specifics about which means are different, only that there is at least one pair of means that is statistically different. The R-square is the proportion of variation explained by the model.

Regression Approach (GLM)Example From linear regression to the general linear model. Coding scheme for the categorical variable defines the interpretation of the parameter estimates.

Regression Approach (GLM)Example - Regressor construction • Terms are named according to how the regressor variables were constructed. • Drug[a-placebo] means that the regressor variable is coded as 1 when the level is “a”, - 1 when the level is “placebo”, and 0 otherwise. • Drug[d-placebo] means that the regressor variable is coded as 1 when the level is “d”, - 1 when the level is “placebo”, and 0 otherwise. • You can write the notation for Drug[a-placebo] as ([Drug=a]-[Drug=Placebo]), where [Drug=a] is a one-or-zero indicator of whether the drug is “a” or not. • The regression equation then looks like: Y = b0 + b1*((Drug=a)-(Drug=placebo)) + b2*(Drug=d)-(Drug=placebo)) + error

Regression Approach (GLM) Example – Parameters and Means • With this regression equation, the predicted values for the levels “a”, “d” and “placebo” are the means for these groups. • For the “a” level: Pred y = 7.9 + -2.6*(1-0) + -1.8*(0-0) = 5.3 For the “d” level: Pred y = 7.9 + -2.6*(0-0) + -1.8(1-0) = 6.1 For the “placebo” level: Pred y = 7.9 + -2.6(0-1) + -1.8*(0-1) = 12.3 • The advantage of this coding system is that the regression parameter tells you how different the mean for that group is from the means of the means for each level (the average response across all levels). • Other coding schemes result in different interpretations of the parameters.

Example

Example CAMELOT Study, JAMA 2004

What you will learn Multifactor Analysis of Variance ANOVA Basics Regression versus ANOVA Model assumptions Test principle – F-test GLM approach Contrasts Multiple comparisons Power and Sample size Diagnostics Non-parametric alternative Two-Factor ANOVA Interaction effect Analysis of Covariance Repeated Measures MANOVA

Single-Factor ANOVAContrasts • Contrasts are often used to analyze (a priori or post-hoc) which group (or factor level) means are different. • A contrast L is a comparison involving two or more factor level means and is defined as a linear combination of the factor level means µi where the coefficients ci sum to zero. L = c1µ1+c2µ2+…+ckµk with c1 + c2 + …+ ck = 0 • Examples L = µ1 - µ2 or L = µ1 - 1/3 µ2 - 1/3 µ3- 1/3 µ4

c²j MSerr ∑ nj Lobs Lobs tobs = = SELob √ Single-Factor ANOVAContrasts t-Test for a linear contrast Hypothesis : H0: L = c1m1+c2m2+…+ckmk = 0 versus H1: L≠0 Estimation of the contrast : We decide to reject H0 when ׀tobs׀> t N-k, 1-a/2 and accept that L is not equal to zero.

Single-Factor ANOVAMultiple comparisons • In a study we often want to make several comparisons, such as comparing many pairs of means. • Making multiple comparisons increases the possibility of committing a Type 1 error (declaring something significant that is not in fact significant). • The more tests you do, the more likely you are to find a significant difference that occurred by chance alone. • If you are comparing all possible pairs of means in a large ANOVA lay-out, there are many possible tests, and a Type 1 error becomes very likely.

Single-Factor ANOVAAdjusting forMultiple comparisons • There are many methods that modify tests tocontrol for an overall error rate when doing simultaneous comparisons. • With the method of Bonferroni the overall error rate is divided by the total number of comparisons you want to make. So we test differences between means at a significance level α* = α / c. • Other multiple comparison methods such as Tukey-Kramer, Sidak or Gabriel are less conservative than Bonferroni. This means that they are more powerful and able to detect smaller differences.

Single-Factor ANOVAAdjusting forMultiple comparisons What can we conclude about the differences between the groups using the comparison circles and the tables on the next slide ?

Single-Factor ANOVAAdjusting forMultiple comparisons Both “a” and “d” appear significantly different than “placebo” with unadjusted tests. Only Drug “a” is significantly different than “placebo” with the Tukey-Kramer adjusted t-tests The difference in significance occurs because the quantile that is multiplied with the SE to create a Least Significant Difference has grown from 2.05 to 2.47 between Student’s test and the Tukey-Kramer test

SPSS: ANOVA T test with P-value no longer 0.05 but 0.05/n of tests performed

Single-Factor ANOVAPower and Sample Size • Power is the probability of achieving a certain significance when the true means and variances are specified. • You can use the power concept to help choose a sample size that is likely to give significance for certain effect sizes and variances. • Power has the following ingredients • The effect size – that is the seperation of the means • The standard deviation of the error or the variance • Alpha, the significance level • The number of observations, the sample size

Single-Factor ANOVAPower and Sample size • Increase the effect size. Larger differences are easier to detect. For example, when designing an experiment to test a drug, administer as large a difference in doses as possible. Also, use balanced designs. • Decrease Residual Variance. If you have less noise it is easier to find differences. Sometimes this can be done by blocking or testing within subjects of by selecting a more homogeneous sample.

Single-Factor ANOVAPower and Sample size • Increase the sample size. With larger samples the standard error of the estimate of effect size is smaller. The effect is estimated with more precision. Roughly, the precision increases in proportion to the square root of the sample size. • Accept less protection. Increase alpha. There is nothing magic about alpha=0.05. A larger alpha lowers the cut-off value. A statistical test with alpha=0.20 declares significant differences more often (and also leads to false conclusions more often).

Single-Factor ANOVAPower and Sample size If you want 90% probability (power) of achieving a significance of 0.01, then the sample size needs to be slightly above 70. For the same power at 0.05 significance, the sample size only needs to be 50

ANOVA DiagnosticsResiduals As in regression, residuals, studentized residuals and studentized deleted residuals are used for diagnosing ANOVA model departures. Plots of residuals against fitted values, residual dot plots and normal probability plots are helpful in diagnosing following departures from the ANOVA model: Nonnormality of error terms Nonconstancy of error variance Outliers and Influential observations Nonindependence of error terms

ANOVA DiagnosticsUnequal Variances ANOVA assumes the variance is the same for all groups. Various F-based methods test for equality of the variances. If unequal variances are of concern, you can consider Welch Anova (test in which the observations are weighted by the reciprocals of the esimated variances) or a nonparametric approach or a transformation of the response variable such as the square root or the log.

Single-Factor ANOVANonparametric Alternative Nonparametric procedures do not depend on the distribution of the error term, often the only requirement is that the distribution is continuous. They are based on the ranks of the data, thus ignoring the spacing information between the data. Kruskal-Wallis test statistic (h) has an approximate chi-square distribution with k-1 degrees of freedom. Decision rule: reject H0 if

Kruskal-Wallis testExample What is your conclusion from the Kruskal-Wallis test ? Compare with the Anova results.

Analysis of VarianceDemonstration How to do an analysis of variance with the EXCEL data analysis option ?

What you will learn Multifactor Analysis of Variance ANOVA Basics Regression versus ANOVA Model assumptions Test principle – F-test GLM approach Contrasts Multiple comparisons Power and Sample size Diagnostics Non-parametric alternative Two-Factor ANOVA Interaction effect Analysis of Covariance Repeated Measures MANOVA

Two-Factor ANOVAIntroduction • A method for simultaneously analyzing two factors affecting a response. • Group effect: treatment group or dose level • Blocking factor whose variation can be separated from the error variation to give more precise group comparisons: study center, gender, disease severity, diagnostic group, … • One of the most common ANOVA methods used in clinical trial analysis. • Similar assumptions as for single-factor anova. • Non-parametric alternative : Friedman test

Two-Factor ANOVAExample Do different treatments cause differences in mean response ? Is there a difference in mean response for males and females ? Is there an interaction between treatment and gender ?

Two-Factor ANOVAInteraction Effect Two-way Anova allows to evaluate the effect of the individual factors on the response (main effects) and to evaluate interaction effects. Interaction: treatment affects the response differently depending on the level of the other factor (block). Source: Common Statistical Methods for Clinical Research, 1997, Glenn A. Walker

Response score of subject k in column i androw j Effect of treatment factor (a levels or i columns ) Interaction effect Overall Mean Error or Effect of not measured variables Effect of blocking factor (b levels or j rows) Two-Factor ANOVAThe Model

Effect of treatment Effect of Blocking factor Error or residual variance Interaction Two-Factor ANOVAAnova Table

Two-Factor ANOVAExample (GLM approach) Questions How much of the variation of the response is explained by the model ? What do you conclude from the Lack of Fit test ? Which of the factors have a significant effect on the response ? What is the mean response for the males ? What is the mean response for subjects treated with D ? What can you do to improve the fit ?

Two-Factor ANOVAExample: Leverage Plots

Two-Factor ANOVAExample with Interaction Questions How much of the variation of the response is explained by the model ? What can you conclude from the effect test table ? What is the mean response for males treated with A ? An interesting phenomenon, which is true only for balanced designs, is that the estimates and SS for the main effects is the same as in the fit without interaction. The F tests are different. Why ? The interaction effect test is identical to the lack-of-fit test in the previous model.

Two-Factor ANOVAExample with Interaction

Two-Factor ANOVAExample Profile plot

Two-Factor ANOVAExample Interaction Plot The plot visualizes that treatment D has a different effect on mean response of males compared to females.

Two-Factor ANOVA Example with Excel ANOVA can easily be done with the data analysis module from Excel ANOVA table from Excel What can you conclude from this Anova table ?

What you will learn Multifactor Analysis of Variance ANOVA Basics Two-Factor ANOVA Analysis of Covariance Repeated Measures MANOVA

Advanced Statistics for Interventional Cardiologists