Multiple imputation
1 / 17

Multiple Imputation - PowerPoint PPT Presentation

  • Uploaded on

Multiple Imputation. Multiple Regression. Input From SPSS. *** ***; PROC IMPORT OUT= WORK.IntroQuest DATAFILE = " C:\Users\Vati\Documents\StatData\IntroQ\IntroQ .sav " DBMS=SPSS REPLACE ; Run; Use the Import Wizard to bring the data into SAS.

I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
Download Presentation

PowerPoint Slideshow about 'Multiple Imputation' - flynn

An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.

- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript
Multiple imputation

Multiple Imputation

Multiple Regression

Input from spss
Input From SPSS

*** ***;


DATAFILE= "C:\Users\Vati\Documents\StatData\IntroQ\IntroQ.sav"


  • Use the Import Wizard to bring the data into SAS.

Check for missing data
Check For Missing Data

procmeans n nmiss; run;

Check correlates of missingness with satm
Check Correlates of Missingness With SATM

* p < .05

Oh crap
Oh Crap !

  • We have a lot of missing data on SATM

  • Missingness on SATM is associated with statophobia and year.

  • It is not missing completely at random.

  • Need to employ multiple imputation.

Create five imputations
Create Five Imputations

  • ProcMI seed=69301 out=MIdata; var gender ideal nucoph SATM year; run;

Patterns of missingness
Patterns of Missingness

  • Most frequent pattern of missing data is missing on SATM only


Means by pattern of missingness
Means By Pattern of Missingness


Analyze the imputed data
Analyze the Imputed Data

ProcRegoutest = MRbyImputcovout;

Model Statoph = gender ideal nucoph SATM year / stb; By _Imputation_; run;

ProcMIAnalyze; modeleffects intercept gender ideal nucoph SATM year; run;

  • See the complete output here: XYZZY

  • In every imputation, Gender, SATM, and Year have significant effects.

Proc mianalyze output
ProcMIAnalyze Output

  • Pools the results from the five imputations.

  • The variance in the scores is partitioned between that among imputations and that within imputations.

  • Ideally, little of the variance is due to differences among imputations.

Multiple imputation

  • Relative Increase in Variance” is the increase in variance due to having missing data imputed (relative to the condition where no data are missing). Low is good.

  • “Fraction of Missing Information,” is an index of how much more precise the parameter estimate would have been if there had been no missing data. Low is good.

Multiple imputation

  • Relative efficiency” tells you how much poweryou have for the number of imputations you have employed relative to what you would have if you used an uncountably large number of imputations.

  • High is good.


  • Women report greater fear of the stats course than do men.

  • Reported Math Aptitude is inversely correlated with fear of stats.