slide1 n.
Download
Skip this Video
Loading SlideShow in 5 Seconds..
Tutorial: Causal Model Search PowerPoint Presentation
Download Presentation
Tutorial: Causal Model Search

Loading in 2 Seconds...

play fullscreen
1 / 93

Tutorial: Causal Model Search - PowerPoint PPT Presentation


  • 800 Views
  • Uploaded on

Tutorial: Causal Model Search. Richard Scheines Carnegie Mellon University. Peter Spirtes, Clark Glymour , Joe Ramsey, others. Goals. Convey rudiments of graphical causal models Basic working knowledge of Tetrad IV. Tetrad: Complete Causal Modeling Tool. Tetrad.

loader
I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
capcha
Download Presentation

Tutorial: Causal Model Search


An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
    Presentation Transcript
    1. Tutorial: Causal Model Search Richard Scheines Carnegie Mellon University Peter Spirtes, Clark Glymour, Joe Ramsey, others

    2. Goals Convey rudiments of graphical causal models Basic working knowledge of Tetrad IV

    3. Tetrad: Complete Causal Modeling Tool

    4. Tetrad • Main website: http://www.phil.cmu.edu/projects/tetrad/ • Download: http://www.phil.cmu.edu/projects/tetrad/current.html • Data files: www.phil.cmu.edu/projects/tetrad_download/download/workshop/Data/ • Download from Data directory: • tw.txt • Charity.txt • Optional: • estimation1.tet, estimation2.tet • search1.tet, search2.tet, search3.tet

    5. Outline Motivation Representing/Modeling Causal Systems Estimation and Model fit Causal Model Search

    6. StatisticalCausal Models: Goals • Policy, Law, and Science: How can we use data to answer • subjunctive questions (effects of future policy interventions), or • counterfactual questions (what would have happened had things been done differently (law)? • scientific questions (what mechanisms run the world) • Rumsfeld Problem: Do we know what we do and don’t know: Can we tell when there is or is not enough information in the data to answer causal questions?

    7. Causal Inference Requires More than Probability Prediction from Observation ≠ Prediction from Intervention P(Lung Cancer 1960 = y | Tar-stained fingers 1950 = no) ≠ P(Lung Cancer 1960 = y | Tar-stained fingers 1950set = no) In general: P(Y=y | X=x, Z=z) ≠ P(Y=y | Xset=x, Z=z) Causal Prediction vs. Statistical Prediction: Non-experimental data (observational study) P(Y=y | X=x, Z=z) P(Y,X,Z) P(Y=y | Xset=x, Z=z) Causal Structure Background Knowledge

    8. Causal Search Causal Search: Find/compute all the causal models that are indistinguishable given background knowledge and data Represent features common to all such models Multiple Regressionis often the wrong tool for Causal Search: Example: Foreign Investment & Democracy

    9. Does Foreign Investment in 3rd World Countries inhibit Democracy? Foreign Investment Timberlake, M. and Williams, K. (1984). Dependence, political exclusion, and government repression: Some cross-national evidence. American Sociological Review 49, 141-146. N = 72 PO degree of political exclusivity CV lack of civil liberties EN energy consumption per capita (economic development) FI level of foreign investment

    10. Foreign Investment Correlations po fi en cv po 1.0 fi -.175 1.0 en -.480 0.330 1.0 cv 0.868 -.391 -.430 1.0

    11. Case Study: Foreign Investment Regression Results po = .227*fi - .176*en + .880*cv SE (.058) (.059) (.060) t 3.941 -2.99 14.6 Interpretation: foreign investment increases political repression

    12. Case Study: Foreign Investment Alternatives • There is no model with testable constraints (df > 0) that is not rejected by the data, in which FI has a positive effect on PO.

    13. Tetrad Demo • Load tw.txt data • Estimate regression • Search for alternatives • Estimate alternative

    14. Tetrad Hands-On • Load tw.txt data • Estimate regression

    15. Outline • Motivation • Representing/Modeling Causal Systems • Causal Graphs • Standard Parametric Models • Bayes Nets • Structural Equation Models • Other Parametric Models • Generalized SEMs • Time Lag models

    16. Causal Graphs Causal Graph G = {V,E} Each edge X  Y represents a direct causal claim: X is a direct cause of Y relative to V Years of Education Income Years of Education Skills and Knowledge Income

    17. Causal Graphs Omitteed Causes Omitteed Common Causes Not Cause Complete Common Cause Complete Years of Education Years of Education Skills and Knowledge Skills and Knowledge Income Income

    18. Modeling Ideal Interventions Interventions on the Effect Post Pre-experimental System Room Temperature Sweaters On

    19. Modeling Ideal Interventions Interventions on the Cause Post Pre-experimental System Room Temperature Sweaters On

    20. Pre-intervention graph “Soft” Intervention “Hard” Intervention Interventions & Causal Graphs Model an ideal intervention by adding an “intervention” variable outside the original system as a direct cause of its target. Intervene on Income

    21. Tetrad Demo & Hands-On Build and Save an acyclic causal graph: with 3 measured variables, no latents with 5 variables, and at least 1 latent

    22. Parametric Models

    23. Instantiated Models

    24. Causal Bayes Networks The Joint Distribution Factors According to the Causal Graph, P(S,YF, L) = P(S) P(YF | S) P(LC | S)

    25. Causal Bayes Networks The Joint Distribution Factors According to the Causal Graph, P(S = 0) = 1 P(S = 1) = 1 - 1 P(YF = 0 | S = 0) = 2 P(LC = 0 | S = 0) = 4 P(YF = 1 | S = 0) = 1- 2 P(LC = 1 | S = 0) = 1- 4 P(YF = 0 | S = 1) = 3 P(LC = 0 | S = 1) = 5 P(YF = 1 | S = 1) = 1- 3 P(LC = 1 | S = 1) = 1- 5 P(S) P(YF | S) P(LC | S) = f() All variables binary [0,1]:  = {1, 2,3,4,5,}

    26. Tetrad Demo & Hands-On Attach a Bayes PM to your 3-variable graph Define the Bayes PM (# and values of categories for each variable) Attach an IM to the Bayes PM Fill in the Conditional Probability Tables.

    27. Structural Equation Models Causal Graph • Structural Equations For each variable X  V, an assignment equation: X := fX(immediate-causes(X), eX) • Exogenous Distribution: Joint distribution over the exogenous vars : P(e)

    28. Linear Structural Equation Models Causal Graph Path diagram Equations: Education := Education Income :=Educationincome Longevity :=EducationLongevity Exogenous Distribution: P(ed, Income,Income) - i≠jei ej(pairwise independence) - no variance is zero E.g. (ed, Income,Income) ~N(0,2) 2 diagonal, - no variance is zero Structural Equation Model: V = BV + E

    29. Simulated Data

    30. Tetrad Demo & Hands-On Attach a SEM PM to your 3-variable graph Attach a SEM IM to the SEM PM Change the coefficient values. Simulate Data from both your SEM IM and your Bayes IM

    31. Outline Motivation Representing/Modeling Causal Systems Estimation and Model fit Model Search

    32. Estimation

    33. Estimation

    34. Tetrad Demo and Hands-on Select Template: “Estimate from Simulated Data” Build the SEM shown below – all error standard deviations = 1.0 (go into the Tabular Editor) Generate simulated data N=1000 Estimate model. Save session as “Estimate1”

    35. Estimation

    36. Coefficient inference vs. Model Fit Coefficient Inference: Null: coefficient = 0 p-value = p(Estimated value bX1 X3 ≥ .4788 | bX1 X3 = 0 & rest of model correct) Reject null (coefficient is “significant”) when p-value < a, a usually = .05

    37. Coefficient inference vs. Model Fit Coefficient Inference: Null: coefficient = 0 p-value = p(Estimated value bX1 X3 ≥ .4788 | bX1 X3 = 0& rest of model correct) Reject null (coefficient is “significant”) when p-value < < a, a usually = .05, Model fit: Null: Model is correctly specified (constraints true in population) p-value = p(f(Deviation(Sml,S))≥ 5.7137 | Model correctly specified)

    38. Tetrad Demo and Hands-on • Create two DAGs with the same variables – each with one edge flipped, and attach a SEM PM to each new graph (copy and paste by selecting nodes, Ctl-C to copy, and then Ctl-V to paste) • Estimate each new model on the data produced by original graph • Check p-values of: • Edge coefficients • Model fit • Save session as: “session2”

    39. What influences giving? Sympathy? Impact? Charitable Giving "The Donor is in the Details", Organizational Behavior and Human Decision Processes, Issue 1, 15-23, with G. Loewenstein, R. Scheines. N = 94 TangibilityCondition [1,0] Randomly assigned experimental condition Imaginability [1..7]How concrete scenario I Sympathy [1..7] How much sympathy for target Impact [1..7] How much impact will my donation have AmountDonated [0..5] How much actually donated

    40. Theoretical Hypothesis Hypothesis 2

    41. Tetrad Demo and Hands-on Load charity.txt (tabular – not covariance data) Build graph of theoretical hypothesis Build SEM PM from graph Estimate PM, check results

    42. 10 MinuteBreak

    43. Outline • Motivation • Representing/Modeling Causal Systems • Estimation and Model fit • Model Search • Bridge Principles (Causal GraphsProbability Constraints): • Markov assumption • Faithfulness assumption • D-separation • Equivalence classes • Search

    44. Statistical Inference Background Knowledge e.g., X2prior in time to X3 Constraint Based Search X1_||_ X2means:P(X1, X2) =P(X1)P(X2) X1_||_X2| X3means:P(X1, X2 | X3) =P(X1| X3)P(X2| X3)

    45. Score Based Search Model Score Background Knowledge e.g., X2prior in time to X3

    46. Independence Equivalence Classes:Patterns & PAGs • Patterns(Verma and Pearl, 1990): graphical representation of d-separation equivalence among models with no latent common causes • PAGs: (Richardson 1994) graphical representation of a d-separation equivalence class that includes models with latent common causes and sample selection bias that are d-separation equivalent over a set of measured variables X

    47. Patterns

    48. Patterns: What the Edges Mean

    49. Patterns

    50. Tetrad Demo and Hands On