1 / 12

Influential Observations in Regression

Influential Observations in Regression. Measurements on Heat Production as a Function of Body Mass and Work Effort. M. Greenwood (1918). “On the Efficiency of Muscular Work,” Proc. Roy. Soc. Of London, Series B , Vol. 90, #627, pp. 199-214. Data Description.

Download Presentation

Influential Observations in Regression

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Influential Observations in Regression Measurements on Heat Production as a Function of Body Mass and Work Effort. M. Greenwood (1918). “On the Efficiency of Muscular Work,” Proc. Roy. Soc. Of London, Series B, Vol. 90, #627, pp. 199-214

  2. Data Description • Study involved Algerians accustomed to heavy labor. Experiment consisted of several hours on stationary bicycle. • Dependent (Response) Variable: • Heat Production (Calories) • Independent (Explanatory/Predictor) Variables: • Work Effort (Calories) • Body Mass (kg) • Model: • H = b0 + b1W + b2M + e

  3. Raw Data (Table III, p.203)

  4. Estimated Regression Coefficients • Note that that we can conclude, controlling for the other factor: • Work Effort increase  Heat Production increases (p = .0136) • Body Mass increase does not Heat Production increases (p = .1957)

  5. Plot of Residuals versus Fitted Values Huge, Positive, Residual

  6. Influential Measures (I) Note: n=37, p*=3 Parameters

  7. Standardized / Studentized Residuals

  8. Influential Measures (II)

  9. Influential Measures (III)

  10. Diagnosing Influential Observations • Clearly, Observation #19 exerts a huge influence (although it has a small hat or leverage value, so it must be near center of Mass/Work observations • Upon further review to author’s original calculations provided in paper, the mean and S.D. are much to high for H (but exactly the same for M and W). • Could observation been a “typo”? • Try replacing H19=3936 with H19=2936 • Note: Do not do this arbitrarily, check your data sources in practice

  11. Analysis with Corrected Data Point Note that both factors are significant, and that the intercept and body mass coefficients have changed drastically

  12. Plot of Residuals versus Predicted Values

More Related