1 / 31

Detecting Spatial Clustering in Matched Case-Control Studies

Detecting Spatial Clustering in Matched Case-Control Studies. Andrea Cook, MS Collaboration with: Dr. Yi Li November 4, 2004. Outline. Motivation Petrochemical exposure in relation to childhood brain and leukemia cancers Cumulative Geographic Residuals Unconditional Conditional

Download Presentation

Detecting Spatial Clustering in Matched Case-Control Studies

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Detecting Spatial Clustering in Matched Case-Control Studies Andrea Cook, MS Collaboration with: Dr. Yi Li November 4, 2004

  2. Outline • Motivation • Petrochemical exposure in relation to childhood brain and leukemia cancers • Cumulative Geographic Residuals • Unconditional • Conditional • Simulation Results • Type I error • Power Calculations • Application • Childhood Leukemia • Childhood Brain Cancer • Software • Discussion • Limitations • Future Research

  3. Taiwan Petrochemical Study Matched Case-Control Study • 3 controls per case • Matched on Age and Gender • Resided in one of 26 of the overall 38 administrative districts of Kaohsiung County, Taiwan • Controls selected using national identity numbers (not dependent on location).

  4. Study Population Due to dropout approximately 50% 3 to 1 matching, 40% 2 to 1 matching, and 10% 1 to 1 matching.

  5. Map of Kaohsiung

  6. Cumulative Residuals • Unconditional (Independence) • Model definition using logistic regression • Extension to Cluster Detection • Conditional (Matched Design) • Model definition using conditional logistic regression • Extension to Cluster Detection

  7. Logistic Model Assume the logistic model where, and the link function, Therefore the likelihood score function for is with information matrix

  8. Residual Formulation Then define a residual as, where is the solution to . Assuming the model is correctly specified would imply there is no pattern in residuals. => Use Residuals to test for misspecification. Cumulative Residuals for Model Checking; Lin, Wei, Ying 2002

  9. Hypothesis Test Hypothesis of interest, Geographic Location, (ri, ti ) Independent of Outcome, Yi|Xi  Cumulative Geographic Residual Moving Block Process is Patternless

  10. Unconditional Cluster Detection Define the Cumulative Geographic Residual Moving Block Process as,

  11. Asymptotic Distribution However, the asymptotic distribution of is difficult to simulate, but it has been shown to be equivalent to the following, conditional on the observed data, distribution, where

  12. Significance Test Testing the NULL • Simulate N realizations of by repeatedly simulating , while fixing the data at their observed values. • Calculate P-value

  13. Conditional Logistic Model Type of Matching: 1 case to Ms controls Data Structure: Assume that conditional on , an unobserved stratum-specific intercept, and given the logit link, implies, The conditional likelihood, conditioning on is,

  14. Score and Information Denote the conditional likelihood score as, with information matrix,

  15. Conditional Residual Then define a residual as, where is the solution to . => Use these correlated Residuals to test for patterns based on location.

  16. Conditional Cumulative Residual Define the Conditional Cumulative Residual Moving Block Process as, Which has been shown to be asymptotically equivalent to, where and that are independent of observed data.

  17. Testing the NULL Simulate N realizations of by repeatedly simulating , while fixing the data at their observed values. Calculate P-value Significance Test

  18. Simulation • Choice of Gi or Gis • Unconditional Normal Discrete • Conditional Normal Discrete 1 to 1 2 to 1 3 to 1 • Type I error • Power Calculations

  19. Type I error • Unconditional • Generate N xi and yi from Unif(0,10) • Type I error is the percentage of found significant clusters. • Conditional • Generate N xis and yis from Unif(0,10) • Type I error is the percentage of found significant clusters.

  20. Type I error Unconditional Conditional

  21. Power Calculations • Two Power Calculations

  22. Power Calculations • Single Hotspot

  23. Power Calculations • Multiple Hotspots

  24. Power Calculations • Unconditional • Conditional

  25. Application • Study: Kaohsiung, Taiwan Matched Case-Control Study • Method: Conditional Cumulative Geographic Residual Test (Normal and Mixed Discrete)

  26. Results Odds Ratio (p-values) Marginally Significant Clustering for both outcomes without adjusting for smoking history.

  27. Childhood Leukemia

  28. Childhood Brain Cancer

  29. Software • R macro to handle both unconditional and conditional data • Dataset: • X and Y coordinates of each participant • Case/control variable • Covariate matrix • Stratum Variable for conditional data • Takes just a few minutes to run!

  30. Discussion Cumulative Geographic Residuals • Unconditional and Conditional Methods for Binary Outcomes • Can find multiple significant hotspots holding type I error at appropriate levels. • Not computer intensive compared to other cluster detection methods Taiwan Study • Found a possible relationship between Childhood Leukemia and Petrochemical Exposure, but not with the outcome Childhood Brain Cancer.

  31. Discussion Future Research • Failure Time Data • Recurrent Events • Relocation of Study Participants • Surveillance

More Related