1 / 13

Residuals, Influential Points, and Outliers

Residuals, Influential Points, and Outliers. Objective. To develop an understanding of the impact of unusual features in the relationship between two quantitative variables. Residual =. Observed y – Predicted y for a given value of x.

rollison
Download Presentation

Residuals, Influential Points, and Outliers

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Residuals, Influential Points, and Outliers

  2. Objective • To develop an understanding of the impact of unusual features in the relationship between two quantitative variables.

  3. Residual = Observed y – Predictedy for a given value of x Residuals are used in order to find the best LSRL (line of fit)

  4. Residual Plot We use this to decide whether or not the original data actually follows a linear pattern random scatter = true linear relationship

  5. Bad Residual Plots Curved Patterns Increasing or Decreasing spread in scatter

  6. Properties of Residual Plots • Always make your y-axis the set of residuals • You may use either the x-value or the y-value for you x-axis (though minitab will use x-values as a default). In either case your graph should look the same • On your graphing calculator RESID appears in the LIST menu after you have run LinReg(a + bx). • Be sure to update LinReg(a + bx) for each new set of data.

  7. Additional Items that can Influence LSRL • Outliers • Influential Points • Leverage

  8. Outliers will create large residuals outlier Large residual changes LSRL Notice that the regression line does not change drastically by an outlier in the y-direction

  9. Leverage: x-value far from the mean

  10. Influential Point An observed value is said to be influential if when it is removed for the data set it would significantly change the value of the LSRL. Most texts will only use outliers with leverage in the x-direction as influential points (in the y-direction they are simply called outliers).

  11. Note: Though it is tempting, we cannot just simply remove outliers or influential point from our data set. The best thing to do is create a LSRL for the data with this point and then without this point. Once you compare these two lines of fit, you will often learn a great deal about the data that your are trying to model.

  12. 2000 Presidential Election

  13. Resource: http://arts.bev.net/roperldavid/politics/fl2000.htm

More Related