1 / 10

Lecture 18

Lecture 18. Simple Linear Regression (Chapters 18.1-18.5). Interaction plots in ANOVA.

gailk
Download Presentation

Lecture 18

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Lecture 18 • Simple Linear Regression (Chapters 18.1-18.5)

  2. Interaction plots in ANOVA • It is a good idea to always look at the interaction plots when doing a two-way ANOVA, regardless of whether or not the test for interactions is significant. Interaction plots display the basic results of study. • If there really are no interactions, then the interaction plots will consist of parallel lines.

  3. Regression Analysis • The goal: Estimate E(Y|X) = conditional mean of Y given X based on a sample. • Simple Linear Regression: Assumes E(Y|X) is a straight line in X.

  4. Uses of Regression Analysis • Descriptive. Describe the association between y and x in the population observed. • Passive prediction. Predict y based on x where you do not plan to manipulate x, e.g., predict today’s stock price based on yesterday’s stock price. • Control. Predict what y will be if you change x, e.g., predict what your earnings will be if you obtain different levels of education.

  5. Lurking Variables • A lurking variable is a variable that has an important effect on the relationship among the variables in a study but is not included among the variables studied. • Examples: • Y=Salaries of Presbyterian Ministers over time, X=Price of rum in Havana over time, Lurking Variable = Inflation rate over time. • Y=Pellagra rate in village, X=Amount of flies in village, Lurking Variable = Amount of corn in diet.

  6. Pitfalls in Regression Analysis • (1) Descriptive: If using simple linear regression, need to make sure E(Y|X) is actually approximately a straight line. • (2) Passive Prediction: Need to beware of pitfall for (1) plus extrapolation and lurking variables • Control: Need to beware of pitfalls for (1) and (2) plus extra caution about lurking variables. Requires a cause-and-effect relationship. Best found through a controlled experiment.

  7. Example of Pitfall • A researcher measures the number of television sets per person X and the average life expectancy Y for the world’s nations. The regression line has a positive slope – nations with many TV sets have higher life expectancies. Could we lengthen the lives of people in Rwanda by shipping them TV sets?

  8. Residual Plots Against Time • Many lurking variables change systematically over time. • Useful method for detecting lurking variables: Plot residuals against time order of observation is available. If a systematic pattern is found, an understanding of the background of the data might allow you to guess what the lurking variables are. • Another useful residual plot: Plot residuals vs. location of observations.

  9. Residual Plot vs. Time Example • Goal: Predict elementary mathematics enrollment (X) at college based on number of freshman students (Y). Linear Fit Math enrollment = -283.8905 + 0.6511345 Freshman students

  10. Residual Plot vs. Time • Residual plot suggests that a change took place between 1994 and 1995 that caused higher proportion of students to take math courses. • In fact, one of schools in university changed its program in 1995 to require entering students to take another math course. • Conclusion: The math dept. shouldn’t use data from before 1995 for predicting future enrollment.

More Related