1 / 12

Labs 6 & 7 Case-Control Analysis ----Logistic Regression Henian Chen, M.D., Ph.D.

Labs 6 & 7 Case-Control Analysis ----Logistic Regression Henian Chen, M.D., Ph.D. Logistic Regression for Intercept only SAS Program proc logistic data =case_control978 descending ; model status=; run ; * Descending: to get the probability and OR for dependent variable=1 SAS Output

nicki
Download Presentation

Labs 6 & 7 Case-Control Analysis ----Logistic Regression Henian Chen, M.D., Ph.D.

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Labs 6 & 7 Case-Control Analysis ----Logistic Regression Henian Chen, M.D., Ph.D. Applied Epidemiologic Analysis - P8400 Fall 2002

  2. Logistic Regression for Intercept only SAS Program proclogisticdata=case_control978 descending; model status=; run; * Descending: to get the probability and OR for dependent variable=1 SAS Output The LOGISTIC Procedure Model Information Data Set WORK.CASE_CONTROL978 Response Variable status Number of Response Levels 2 Number of Observations 978 Model binary logit Optimization Technique Fisher's scoring Applied Epidemiologic Analysis - P8400 Fall 2002

  3. Logistic Regression for Intercept only SAS Output Response Profile Ordered Total Value status Frequency 1 1 200 2 0 778 Probability modeled is status=1. Model Convergence Status Convergence criterion (GCONV=1E-8) satisfied. -2 Log L = 990.8635 Analysis of Maximum Likelihood Estimates Standard Wald Parameter DF Estimate Error Chi-Square Pr > ChiSq Intercept 1 -1.3584 0.0793 293.5837 <.0001 Applied Epidemiologic Analysis - P8400 Fall 2002

  4. Logistic Regression for Intercept only Log [Y/(1-Y)] = α Y = eα / (1+ eα) = exp(α) / [1 + exp(α)] In our model, α = -1.3584, -1.3584 is the log odds of cancer for total sample. The odds (eα) is 0.2571. Y = exp(-1.3584) / [1 + exp(-1.3584)] =0.2045 =200/(200+778) Y is related to α in Logistic Model Applied Epidemiologic Analysis - P8400 Fall 2002

  5. Logistic Regression for Dichotomous Predictor Alcohol Consumption (alcgrp): 0=0-39 gm/day; 1=40+ gm/day SAS Program proclogisticdata=case_control978 descending; model status=alcgrp; run; SAS Output Model Fit Statistics Criterion Intercept Only Intercept and Covariates -2 Log L 990.863 901.036 Likelihood Ratio Test G = 990.863 – 901.036 = 89.827 df = 1 The model with variable ‘alcgrp’ is significantly. Applied Epidemiologic Analysis - P8400 Fall 2002

  6. Logistic Regression for Dichotomous Predictor SAS Output Analysis of Maximum Likelihood Estimates Standard Wald Parameter DF Estimate Error Chi-Square Pr > ChiSq Intercept 1 -2.5911 0.1925 181.1314 <.0001 alcgrp 1 1.7641 0.2132 68.4372 <.0001 Odds Ratio Estimates Point 95% Wald Effect Estimate Confidence Limits alcgrp 5.836 3.843 8.864 -2.5911 is the log odds of cancer for light drinkers (alcgrp=0). Log odds of cancer for heavy drinkers (alcgrp=1) is –0.827 (-2.5911 + 1.7641). Y = 0.0697 for light drinkers, and 0.3043 for heavy drinkers. OR = exp(β) = exp(1.7641) = 5.836 Heavy drinkers (alcgrp=1) are about 6 times more likely to get cancer than light drinkers (alcgrp=0). OR is not related to α in Logistic Model Applied Epidemiologic Analysis - P8400 Fall 2002

  7. Logistic Regression for Ordinal Predictor Alcohol Consumption (alcgrp4): 0=0-39 gm/day; 1=40-79 gm/day 2=80-119 gm/day; 3=120+ gm/day SAS Program proclogisticdata=case_control978 descending; model status=alcgrp4; run; SAS Output Model Fit Statistics Criterion Intercept Only Intercept and Covariates -2 Log L 990.863 846.467 Likelihood Ratio Test G = 990.863 – 846.467 = 144.396 df = 1 The model with variable ‘alcgrp4’ is significantly. Applied Epidemiologic Analysis - P8400 Fall 2002

  8. Logistic Regression for Ordinal Predictor SAS Output Analysis of Maximum Likelihood Estimates Standard Wald Parameter DF Estimate Error Chi-Square Pr > ChiSq Intercept 1 -2.4866 0.1459 290.4172 <.0001 alcgrp4 1 1.0453 0.0934 125.2007 <.0001 Odds Ratio Estimates Point 95% Wald Effect Estimate Confidence Limits alcgrp4 2.844 2.368 3.416 OR = exp(1.0453) = 2.844. Men with alcgrp4=1 are about 3 times more likely to get cancer than men with alcgrp4=0. This OR is also for alcgrp4= 1 vs. alcgrp4=2; or alcgrp4=2 vs. alcgrp4=3. OR = exp[(3-1)*1.0453] = exp(2.0906) = 8.090 for alcgrp4=1 vs. alcgrp4=3 OR = exp[(3-0)*1.0453] = exp(3.1359) = 23.009 for alcgrp4=0 vs. alcgrp4=3 Applied Epidemiologic Analysis - P8400 Fall 2002

  9. Logistic Regression for Continuous Predictor Alcohol Consumption (alcohol): daily consumption in grams SAS Program proclogisticdata=case_control978 descending; model status=alcohol; run; SAS Output Analysis of Maximum Likelihood Estimates Standard Wald Parameter DF Estimate Error Chi-Square Pr > ChiSq Intercept 1 -2.9741 0.1807 270.9266 <.0001 alcohol 1 0.0261 0.00232 126.4179 <.0001 Odds Ratio Estimates Point 95% Wald Effect Estimate Confidence Limits alcohol 1.026 1.022 1.031 Applied Epidemiologic Analysis - P8400 Fall 2002

  10. Logistic Regression for Continuous Predictor OR = exp(0.0261) = 1.026. The odds of cancer increase by a factor of 1.026 for each unit in alcohol consumption OR = exp[40*(0.0261)] = exp(1.044) = 2.8406 for a 40-grams increase in alcohol consumption per day OR = exp[120*(0.0261)] = 22.825 for a man who drinks 160 grams per day compare with a man who is similar in other respects but drinks 40 grams per day. Applied Epidemiologic Analysis - P8400 Fall 2002

  11. Interaction in Logistic Regression model status = α + β1 alcgrp + β2 tobgrp β1 : the effect of alcohol on cancer, controlling for tobacco (i.e., the same OR across levels of tobacco) β2 :the effect of tobacco on cancer, controlling for alcohol (i.e., the same OR across levels of alcohol) model status = α + β1 alcgrp + β2 tobgrp + β3 alcgrp*tobgrp β1 : the effect of alcohol on cancer among non-smokers (tobgrp=0) β2 :the effect of tobacco on cancer among non-drinkers (alcgrp=0) β3 : interaction between smokers and drinkers Applied Epidemiologic Analysis - P8400 Fall 2002

  12. Interaction in Logistic Regression model status = -3.33 + 2.28 (alcgrp) + 1.38 (tobgrp) –0.98 (alcgrp*tobgrp) Log odds odds A: alcgrp=0 & tobgrp=0 2.28*0 + 1.38*0 – 0.98*0*0 = 0.00 1.00 B: alcgrp=1 & tobgrp=0 2.28*1 + 1.38*0 – 0.98*1*0 = 2.28 9.78 C: alcgrp=0 & tobgrp=1 2.28*0 + 1.38*1 – 0.98*0*1 = 1.38 3.97 D: alcgrp=1 & tobgrp=1 2.28*1 + 1.38*1 – 0.98*1*1 = 2.68 14.59 Odds Ratio A vs. B 9.78 = 9.78/1.00 A vs. C 3.97 = 3.97/1.00 A vs. D 14.59 = 14.59/1.00 B vs. D 1.49 = 14.59/9.78 C vs. D 3.68 = 14.59/3.97 Applied Epidemiologic Analysis - P8400 Fall 2002

More Related