70 likes | 196 Views
This section covers essential principles of correlation and regression, emphasizing the importance of plotting data before analysis to identify linear relationships. It highlights the potential for distortion by extreme values and the risks associated with extrapolation beyond the data range. Lurking variables can misrepresent relationships, with correlations often overstated when based on averaged data. Distinctions between correlation and causation are clarified, underlining the necessity of controlled experiments to establish true causal relationships.
E N D
Interpreting Correlation and Regression Section 4.2
Correlation and Regression • Describe only linear relationship. • Strongly influenced by extremes in data. • Always plot data first. • Extrapolation – Use of regression line or curve outside the values of the domain of explanatory variable.
Averaged Data • Correlations based on averages, not actual data, are usually too high. • Smooths out data. • Does not allow for scatter among individuals.
Lurking Variables • Variables that influence the two studied variables but are not in the study. • Can falsely suggest a strong relationship. • Can hide a relationship. • Ex: Herbal tea in nursing homes/Ice cream and drowning.
Association does not imply Causation • Strong associations ≠ cause/effect relation. • Causation – Change in x causes change in y. • Common Response – Both x and y respond to changes in some unobserved variable. • Confounding – The effect on y of x is mixed up with effects on y of other variables.
Experiments • Best way to get good evidence that x causes y. • Only x is changed, while lurking variables are controlled.