1 / 20

Correlation and Covariance

Correlation and Covariance. James H. Steiger. Goals for Today. Introduce the statistical concepts of Covariance Correlation Investigate invariance properties Develop computational formulas. Covariance.

MikeCarlo
Download Presentation

Correlation and Covariance

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Correlation and Covariance James H. Steiger

  2. Goals for Today • Introduce the statistical concepts of • Covariance • Correlation • Investigate invariance properties • Develop computational formulas

  3. Covariance • So far, we have been analyzing summary statistics that describe aspects of a single list of numbers • Frequently, however, we are interested in how variables behave together

  4. Smoking and Lung Capacity • Suppose, for example, we wanted to investigate the relationship between cigarette smoking and lung capacity • We might ask a group of people about their smoking habits, and measure their lung capacities

  5. Smoking and Lung Capacity

  6. Smoking and Lung Capacity • With SPSS, we can easily enter these data and produce a scatterplot.

  7. Smoking and Lung Capacity • We can see easily from the graph that as smoking goes up, lung capacity tends to go down. • The two variables covary in opposite directions. • We now examine two statistics, covariance and correlation, for quantifying how variables covary.

  8. Covariance • When two variables covary in opposite directions, as smoking and lung capacity do, values tend to be on opposite sides of the group mean. That is, when smoking is above its group mean, lung capacity tends to be below its group mean. • Consequently, by averaging the product of deviation scores, we can obtain a measure of how the variables vary together.

  9. The Sample Covariance • Instead of averaging by dividing by N, we divide by . The resulting formula is

  10. Calculating Covariance

  11. Calculating Covariance • So we obtain

  12. Invariance Properties of Covariance • The covariance is invariant under listwise addition, but not under listwise multiplication. Hence, it is vulnerable to changes in standard deviation of the variables, and is not scale-invariant.

  13. Invariance Properties of Covariance

  14. Invariance Properties of Covariance • Multiplicative constants come straight through in the covariance, so covariance is difficult to interpret – it incorporates information about the scale of the variables.

  15. The (Pearson) Correlation Coefficient • Like covariance, but uses Z-scores instead of deviations scores. Hence, it is invariant under linear transformation of the raw scores.

  16. Alternative Formula for the Correlation Coefficient

  17. Computational Formulas -- Covariance • There is a computational formula for covariance similar to the one for variance. Indeed, the latter is a special case of the former, since variance of a variable is “its covariance with itself.”

  18. Computational Formula for Correlation • By substituting and rearranging, you obtain a substantial (and not very transparent) formula for

  19. Computing a correlation

  20. Computing a Correlation

More Related