1 / 30

HSRP 734: Advanced Statistical Methods June 19, 2008

HSRP 734: Advanced Statistical Methods June 19, 2008. Extensions of Logistic Regression. Outcomes with more than 2 categories Categories have order Unordered Conditional logistic regression Analysis of matched data. Extensions of Logistic Regression. Exact methods for small samples

takara
Download Presentation

HSRP 734: Advanced Statistical Methods June 19, 2008

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. HSRP 734: Advanced Statistical MethodsJune 19, 2008

  2. Extensions of Logistic Regression • Outcomes with more than 2 categories • Categories have order • Unordered • Conditional logistic regression • Analysis of matched data

  3. Extensions of Logistic Regression • Exact methods for small samples • Fisher’s exact • Exact logistic regression • Correlated/Clustered data • GEE method • Mixed models

  4. Extensions of Logistic Regression • Outcomes with more than 2 categories (polytomous or polychotomous) • Cumulative logit model – Proportional odds model for ordinal outcomes (ordered categories) • Generalized logit model for nominal outcomes or non-proportional odds models (unordered categories)

  5. Extensions of Logistic Regression • Cumulative logit model • Fits a logistic regression model with g-1 intercepts for a g category outcome and one model coefficient for each predictor • Models cumulative probability of being in a “lower” category

  6. Ordinal Logistic Regression • Odds ratios take on interpretation “% increase/decrease in the odds of being in a lower/higher category” • Subject to the “Proportional Odds” assumption

  7. Extensions of Logistic Regression • Generalized logit model • Fits a logistic regression model with g-1 intercepts and g-1 model coefficients for a g category outcome • Model captures the multinomial probability of being in a particular category using generalized logits

  8. Nominal Logistic Regression • Odds ratios have regular interpretation, just have to be careful with which comparisons are being made (reference category) • Does not assume “Proportional Odds”

  9. SAS

  10. Conditional logistic regression • Can use for matched data (e.g., case-control studies) • Provides unbiased estimates of odds ratios and CI’s

  11. SAS

  12. Extensions to Logistic Regression • Exact Logistic Regression • Small Sample Size • Adequate sample size but rare event (sparse data)

  13. Fisher’s exact test • Exact test for RxC table where Chi-square test assumptions are doubtful • Why not always use Fisher’s exact test and Exact logistic regression?

  14. SAS

  15. Extensions of Logistic Regression • Longitudinal data / repeated measures data / Clustered data with binary outcomes • Multilevel models (nested data structures) • GEE (Generalized Estimating Equations) • GLMM (Generalized Linear Mixed Models)

  16. Two methods for handling clustered outcomes • Mixed models • Likelihood based • Use random effects to model clustered observations • continuous outcome (but now extended for categorical) • Generalized Estimating Equation (GEE) • Non-likelihood based • Can handle large number of clusters • categorical outcome

  17. GEE • GEE can be used in • Longitudinal studies • repeated measures of the same individual form a cluster • Community studies • subjects clustered by neighborhood • Familial studies • subjects clustered by family • Epidemiological studies • Different forms of clusters – e.g., pedigree

  18. GEE • In general GEE has 3 sets of parameters to estimate: • Regression parameter (population-averaged effects) • Correlation parameter (cluster parameter) • Scale factor (not uncommon to assume =1)

  19. Comparing SLR and GEE

  20. GEE • In its simplest form, GEE can be considered an extension of logistic regression for clustered data • Clustered data are common • Time: Longitudinal analysis with repeated measurements on individual (e.g., BL, 1m, 2m, 6m follow-up) • Individual: Cross-sectional analysis with multiple outcomes (e.g., left eye, right eye) • Background: Subjects clustered because of common geographical or social background (e.g., clinic)

  21. Correlation structure • Correlation structure • Often called the working correlation structure in GEE • Specifies how the observations within a cluster are related • Often assumes correlation structure uniform throughout clusters

  22. Unstructured • All correlation coefficients free to take any value • E.g.,

  23. Exchangeable • Any responses within the same cluster has the same correlation • Simple (1 parameter to estimate)

  24. Autogressive AR(1) • Correlation between responses depends on the interval of time between responses • Farther apart responses => weaker correlation • Only 1 parameter to estimate!

  25. Correlation matrix • Selection of a “working correlation structure” is at the discretion of the researcher! • How does the correlation structure affects the results?

  26. Properties of GEE estimators • How about estimate of correlation if “working” correlation matrix is not correctly specified? • Model-based estimate => not consistent • Empirical (robust) estimate => still consistent

  27. Properties of GEE estimators • Even if correlation structure misspecified, estimate for logistic regression is still consistent • if correlation misspecified, estimate not as efficient (SE is larger) • This property contributes to the popularity of GEE • GEE works well with larger #’s of clusters

  28. SAS

  29. Review

More Related