1 / 29

Methodological Workshop 3: Fixed Effects Models and Multi-Level Models

Methodological Workshop 3: Fixed Effects Models and Multi-Level Models. Yu Xie University of Michigan. What’s Common?. Both the fixed effects model and the multi-level model utilize clustered data.

sana
Download Presentation

Methodological Workshop 3: Fixed Effects Models and Multi-Level Models

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Methodological Workshop 3:Fixed Effects Models and Multi-Level Models Yu XieUniversity of Michigan

  2. What’s Common? • Both the fixed effects model and the multi-level model utilize clustered data. • Both the fixed effects model and the multi-level model are designed to handle cross-context heterogeneity.

  3. Different Objectives • Fixed effects model and multi-level model are very different research designs: • Fixed effects model controls for (or absorbs) pre-treatment heterogeneity (type I heterogeneity) • Multi-level model models both forms of heterogeneity across contexts.

  4. Application of Different Principles • The fixed effects model is essentially an application of the social grouping principle (with a group being a cluster) • The multi-level model is essentially an application of the social context principle.

  5. Using Different Assumptions • The fixed effects model assumes no type II heterogeneity bias (often constant effects model), or additive effects of heterogeneity across contexts (i.e., clusters). • The multi-level model relaxes homogeneity assumption at the individual level but assumes that both forms of heterogeneity are at the context level and can be modeled adequately with contextual covariates.

  6. A General Lesson: Tradeoff between Data and Assumption • “When observed data are thin, it takes strong assumptions to yield sharp results. There is no free information in statistics. Either you collect it, or you assume it.” (Xie 1996, AJS).

  7. Fixed effects model • Sibling model as an example • Family SES, environment are shared • Yi1 =b0 + b1Xi1 + ai + ei1 • Yi2 =b0 + b1Xi2 + ai + ei2 • a andXmay be correlated. • Take difference between the two eq. • Yi2 -Yi1=b1 (Xi2 -Xi1)+ (ei2- ei1) • Resulting in a more robust equation • Properties of the fixed effects approach: • All fixed-characteristics are controlled • It consumes a lot of information • Unobserved heterogeneity (Type I) is controlled for at the group level (fixed effects)

  8. Example: Critique of Zhou and Hou (1999): Positive Benefits of Send-Down? • “More interestingly, our findings also reveal some positive consequences of the send-down experience. For instance, when compared with urban youth, a noticeably higher proportion of the send-down youth attained a college education after 1977. Partly as a result of their educational attainment, these sent-down youth, especially those with shorter rural durations, were equally likely to enter favorable employment (type of occupation and work organizations) in the urban labor force, despite their relatively short urban labor force experience.” (Zhou and Hou 1999: 32)

  9. Speculated Reason for the Beneficial Effects • The unusual hardship faced by sent-down youth forced them to be more adaptive and thus acquire skills to survive.

  10. In Our Recent Study (Xie, Yang, and Greenman 2008) • We analyze data from the survey of Family Life in Urban China that we conducted in three large cities (Shanghai, Wuhan, and Xi’an) in 1999. • We use some items designed for this study.

  11. Statistical Analyses • (1) We present the differences in six socioeconomic indicators between respondents who experienced send-down with those who did not experience send-down. • (2) We present results from a fixed-effects model capitalizing on the sibling structure in our data. • (3) We examine educational attainment closely as a time-varying covariate and its endogenous role in affecting early returns of sent-down youth.

  12. Table 1: Descriptive Differences between Respondents with Send-Down Experience and Respondents without Send-Down Experience Notes: *p<.1, **p<.05, ***p<.01

  13. After We Control for Covariates (Table 2) • There are no differences in salary or income. • Short-term sent-down youth still have higher levels of education than the other two groups (non-sent-down and long-term sent-down).

  14. Potential Sources of Bias • Some sent-down youth did not return to cities or did not return to the same cities. • There can be unobserved family-level characteristics associated with both send-down and outcomes. • We use a fixed effects model based on sibling pairs to address both problems.

  15. Table 3 : Unadjusted Differences by Send-Down Experience Using Sibling Pairs Notes: *p<.1, **p<.05, ***p<.01

  16. What’s Going On? • If there are no effects of send-down (from the fixed effects model), why do we observe differences in education between short-term sent-down youth and long-term sent-down youth? • The answer largely lies in “pre-treatment” differences.

  17. Table 4: Unadjusted Differences by Duration Duration <6 Duration > 6 Notes: *p<.1, **p<.05, ***p<.01

  18. Conclusion • Did send-down experience benefit youth? -- No. • Our analyses of the new data show that the send-down experience did not benefit the youth who were affected. • Differences in social outcomes between those who experienced send-down and those who did not are either non-existent or spurious due to other social processes.

  19. Accounting for Heterogeneous Responses with Social Context Principle • Possible with nested data, assuming that patterns of relationships are homogeneous (or following a distribution) within social contexts (by time or space). • dk is allowed to vary across k (k=1,…K), social context, but is homogeneous within k, conditional on X.

  20. Multi-level Model (MLM) • Yik = ak + dkDik + b’Xik + eikak = l+fzk+mk dk = g+szk+nk • Other names: hierarchical linear models, random-coefficient models, growth-curve models, and mixed models. • Units of analysis at a lower level are nested within higher-level units of analysis • Examples: • Students within schools • Observations over time within persons (growth curve)

  21. Problems without MLM • If we ignore higher-level units of analysis => we cannot account for context (individualistic approach) • If we ignore individual-level observation and rely on higher-level units of analysis, we may commit ecological fallacy (aggregated data approach) • Without explicit modeling, sampling errors at second level may be large =>unreliable slopes • Homoscedasticity and no serial correlation assumptions of OLS are violated (an efficiency problem). • No distinction between parameter variability and sampling variability.

  22. Advantages of MLM • Cross-level comparisons • Controls for differences across higher levels

  23. Example: Xie and Hannum (1996) • (1) Where • Y = earnings, • X1 = years of schooling, • X2 = years of work experience, • X4 = a dummy variable denoting membership in the Communist Party of China (1 = party member), • X5 a dummy variable denoting gender (1 = female). • Note two interactions.

  24. Consider regional heterogeneity • For the ith person in kth city: • Instead of using fixed effects for the intercept b0k, and full interactions for slope parameters, Xie and Hannum modeled these parameters in a multilevel model. • Let z be a city-level covariate that measures the degree of economic reform. Let us assume that individual-level parameters depend on z in the following linear regressions:

  25. Cross-City Model (“meta analysis”)

  26. Combining the two levels => We can see that the city-level covariate z interacts with most of the individual-level predictors.

  27. Special Cases • Special case 1: If all the coefficients of the city-level covariate (z) are zero, we have what is called “random coefficient model” • Special case 2: If all the coefficients of the city-level covariate (z) are zero and there are no random coefficients in all slope coefficients (except the intercept), we have what is called “variance component model”. [See Table 3.]

  28. Summary: Four ways to conceptualize variability in parameters where Pk is the number of predictors at the 2nd level, and K is the number of units at the second level.

  29. References • Xie, Yu. 1996. “Review of Identification Problems in the Social Sciences by Charles Manski.” American Journal of Sociology 101:1131-1133. • Xie, Yu and Emily Hannum. 1996. “Regional Variation in Earnings Inequality in Reform-Era Urban China.” American Journal of Sociology 101:950-992. • Xie, Yu, Yang Jiang, and Emily, Greenman. 2008. “Did Send-Down Experience Benefit Youth? A Reevaluation of the Social Consequences of Forced Urban-Rural Migration during China’s Cultural Revolution.” Social Science Research 37: 686-700.

More Related