220 likes | 259 Views
Using Multilevel Modeling in Institutional Research. Ling Ning & Mayte Frias Senior Research Associate Neil Huefner Associate Director Timo Rico Executive Director. Multilevel Data Structure.
E N D
Using Multilevel Modeling in Institutional Research Ling Ning & Mayte Frias Senior Research Associate Neil Huefner Associate Director Timo Rico Executive Director
Multilevel Data Structure • In institutional research, the data structure in the population is usually hierarchical, with students nested within instructors, majors, departments, colleges and campus divisions. • Sampling is conducted using multi-stage cluster sampling, rather than simple random sampling. • The above two factors give rise to multilevel data, in which lower level units are nested within higher level units
Overview • Motivations for using Multilevel Modeling (MLM) in Institutional Research • Consequences of not using MLM with clustered data • What is Multilevel Modeling (MLM)? • Random intercept, random slope and cross-level model • Computation software
How can the use of Multilevel Modeling (MLM) positively contribute to Institutional Research? • Allows one to calculate standard error more accurately • Offers insight into how effects vary at higher levels within the structure • Permits the inclusion of predictors at different levels within the model
Motivations for MLM|Standard Error Adjustment • Conventional methods assume independent observations • Knowing the information of one student tells nothing about another student’s information • Often not realistic in institutional research • Students in the same majors/student support programs same context more similar • Characteristics of one student information for the major information of another student in the same major
Motivations for MLM|Standard Error Adjustment Intraclass Correlation (ICC; ρ) • One way to quantify the degree of clustering (i.e., common effect associated with majors ) • = variance associated with majors • = student variance • Roughly speaking, correlation between students in the same major
Motivations for MLM|Standard Error Adjustment Consequences of Clustering • Clustering Overlapping information • Clustering Information SE • Ignore clustering Inflated Type I error (α) Spurious results Inflated information SE too small
Motivations for MLM|Standard Error Adjustment Example: Effect of a Tutoring Program on retention • 1000 students nested within 63 majors • 315 Participants , 685 Non-participants • Model: Participation Retention
Motivations for MLM|Standard Error Adjustment The Inflation of the alpha level of 0.05 in the presence of intra-class correlation (ICC) Source: Based on Barcikowski, R.S.(1981). Statistical power with group mean as the unit of analysis. Journal of Educational Statistics, 6, 267-285.
What is MLM|Unconditional Random Intercept Model (RIM) Level1: log(Pij/(1-Pij))= β0j (j = 1, 2, … , 63) Level 2: 0j=γ00 +U0j
What is MLM|Two-level Random Intercept Model (RIM) Level1: log(Pij/(1-Pij))= β0j + β1Tutoringij +β2HS GPAij (j = 1, 2, … , 63) Level 2: 0j=γ00 + γ01 STEM0j+U0j
What is MLM|Random Intercept Model Level1: log(Pij/(1-Pij))= β0j + β1Tutoringij +β2HS GPAij (j = 1, 2, … , 63) Level 2: 0j=γ00 + γ01 STEM0j+U0j logit (Retention) Major 1 … Major 5 Average Tutoring Effect … Major 63 0j : γ00 : 0 Tutoring
What is MLM|Random Slope Model Level1: log(Pij/(1-Pij))= β0j + β1jTutoringij +β2HS GPAij (j = 1, 2, … , 63) Level 2: 0j=γ00 + γ01 STEM0j+U0j 1j=γ10 + U1j
What is MLM|Random Slope Model Level1: log(Pij/(1-Pij))= β0j + β1jTutoringij +β2HS GPAij (j = 1, 2, … , 63) Level 2: 0j=γ00 + γ01 STEM0j+U0j 1j=γ10 + U1j Major 1 logit (Retention) … Major 5 Average Tutoring Effect …… Major 63 0j : 1j γ00 : Tutoring 0
What is MLM|Cross-level Model Level1: log(Pij/(1-Pij))= β0j + β1jTutoringij +β2HS GPAij (j = 1, 2, … , 63) Level 2: 0j=γ00 + γ01 STEM0j+U0j 1j=γ10 + γ11 STEM0j+U1j
What is MLM|Retention in each major 63Majors RandomSlope Random Intercept
MLM|Computing Software • Specialized programs for fitting multilevel models • HLM • MLWin • General-purpose statistical software • SAS • R • Stata • Mplus
MLM|Motivations Recap. • Learning about effects that vary by higher level cluster e.g., A student support program that is more effective in some majors than others • Using all the data to perform inferences for groups with small sample size e.g., A program director wants to know how effective her program is for majors with very small number of students. • Prediction e.g., To predict a new student’s outcome
MLM|Motivations Recap. 4. Analysis of data from cluster sampling Many national surveys [e.g., PISA(the Program for International Student Assessment)] used multi-stage probability sample design. 5. Including predictors at different levels e.g., A student support program that is more effective in some majors than others due to some characteristics associated with the major such as STEM or Non-STEM. 6. Getting the right standard error The estimated standard errors of regression coefficients might be wrong when we use multiple regression to analyze multilevel data.
MLM|Good References. • Hoox, Joop (2010).Multilevel Analysis, Techniques and Applications, Routledge • http://joophox.net/mlbook2/MLbook.htm • Gelman, A., and Hill, J. (2006) Data Analysis Using Regression and Multilevel/Hierarchical Models. Cambridge. http://www.stat.columbia.edu/~gelman/arm/ • Singer, J. (2003) Applied Longitudinal Data Analysis: Modeling Change and Event Occurance. Oxford • http://gseacademic.harvard.edu/alda/ • Snijders, T., and Bosker, R. (2012) Multilevel Analysis: An Introduction to Basic and Advanced Multilevel Modeling, 2nd Ed. Sage. • http://www.stats.ox.ac.uk/~snijders/
MLM|Questions • http://csaa.ucdavis.edu/contact.html Contact us