1 / 53

Chapter 3 Examining Relationships

Chapter 3 Examining Relationships. Section 3.1 Scatterplots. Terms to Know. A response variable measures an outcome of a study. An explanatory variable attempts to explain the observed outcomes. Example of an Explanatory and Response Variable.

abel
Download Presentation

Chapter 3 Examining Relationships

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Chapter 3Examining Relationships Section 3.1 Scatterplots

  2. Terms to Know A response variable measures an outcome of a study. An explanatory variable attempts to explain the observed outcomes.

  3. Example of an Explanatory and Response Variable One degree day is accumulated for each degree a day’s average temp falls below or rises above 65 degrees.

  4. Key Concept The statistical techniques used to study relations among variables are more complex than one-variable methods. Fortunately we build on the tools used for examining individual variables. The principles that guide examination are the same. • Start with a graph • Look for an overall pattern and deviations from the pattern • Add numerical descriptions of specific aspects of the data • Sometimes there is a way to describe that

  5. Term to Know The most effective way to display the relation between two quantitative variables is a scatterplot. Plot the explanatory variable, if there is one, on the x-axis, and the response variable on the y-axis. Each individual in the data appears as a point.

  6. ScatterPlot

  7. Interpreting Scatterplots To interpret a scatterplot, look first for a pattern. The pattern should reveal direction, form and strength of the relationship between two variables. Refer to Figure 3.1 on page 175. Form: two clusters Direction: Negatively associated Strength: moderate

  8. Strong, Positive Association with Linear Form

  9. Some Relationships Have No Direction or Pattern

  10. Not All Relationships are Linear

  11. Yes, you can have outliers on ScatterPlots An outlier in any graph of data is an individual observation that falls outside the overall pattern of the graph (ie: WV)

  12. Add a Third Variable (Categorical) of Southern and non-Southern by Using Different Symbols

  13. Scatter Plot Heads Up When several individuals have exactly the same data, they occupy the same point on the scatter plot. Some software packages address the issue by using different symbols for multiple individuals with the same data. You can do the same by hand. However, your calculator does not. So be careful. Use trace to identify such cases.

  14. Scatterplots display direction, form, strength and relationship between two variables. However, our eyes are not a good judge of the strength of the relationship.

  15. Key Concept Correlation measure the direction and strength of the linear relationship between two quantitative variables. Correlation is usually written as r.

  16. Facts About Correlation • No distinction between explanatory and response variable • Requires two quantitative variables • Unit change of observation does not change correlation • Positive r indicates positive association, negative r indicates negative association • Range: -1 < r < 1 • Measures strength of linear relationships of two variables only • Is not resistant to outliers

  17. Correlation Exercise • Technology Toolbox, page 186 • Yes, The process is long and convoluted, but there is a shortcut using LinReg Command

  18. Key Concept 1) A key thing to remember when working with correlations is never to assume a correlation means that a change in one variable causes a change in another. Sales of personal computers and athletic shoes have both risen strongly in the last several years and there is a high correlation between them, but you cannot assume that buying computers causes people to buy athletic shoes (or vice versa).

  19. Key Concept 2) Correlation only describes linear relationships only, now matter how strong how strong the curved relationship may be. • Like mean and standard deviation, correlation, r, is not resistant to outliers • Correlation is not a complete summary of a two variable relationship. You should give the means of x and y.

  20. Homework • Read 3.2 • Complete problems 1, 2, 6, 7 ,8, 13, 15, 19, 21, 23

  21. Chapter 3Examining Relationships Section 3.2 Least-Squares Regression

  22. Key Term Least Squares Regression is a method for finding a line the summarizes the relationship between two variables that show a linear trend. We often use a regression line to predict the value of y for a given value of x. Regression, unlike correlation requires that we have an explanatory variable and a response variable.

  23. Regression Line for Predicting Gas Consumption from Degree Days

  24. Why is it Called A Least-Squares Regression Line (“LSRL”)?

  25. Why is it Called A Least-Squares Regression Line (“LSRL”)?

  26. Correlation and Regression Appletwww.whfreeman.com

  27. LSRL – Using TI84 Enter NEA data in L1 and Fat data in L2

  28. NEA/Fat Least-Squares Regression Line Exercise Complete Technology Toolbox on page 210

  29. Interpret you regression equation in terms of your variables(ie: fat gain = a + b(NEA change)

  30. Use your Model to predict weight gain given an NEA of 400 (interpolation) Use your Model to predict weight gain given an NEA of 1000 (extrapolation)

  31. Equation of the Least-Squares Regression Line • You can manually calculate the equation of the Least-Squares Regression Line With slope And Intercept

  32. Homework • Exercises 3.29 – 32, 35, 36 • Read Section 3.3

  33. Chapter 3Examining Relationships Section 3.2 Least-Squares Regression (Continued) Section 3.3 Correlation and Regression Wisdom

  34. Key Concept • A residual is the difference between and observed value of the response variable and the value predicted by the regression line. That is, residual = observed y – predicted y

  35. Residual is the distance between actual and predicted y

  36. Example residual plot

  37. Interpreting a Residual Plot • The uniform scatter of points indicates the regression line fits the data well, so the line is a good model.

  38. Interpreting a Residual Plot • The residual have a curved pattern, so a straight line is an inappropriate model

  39. Interpreting a Residual Plot • The response variable y has more spread for larger values of the explanatory variable x, so prediction will be less accurate when x is large.

  40. Create a Residual Plot with Hand-Span Data Follow procedures detailed in Technology toolbox on page 219

  41. Key Concept The coefficient of determination, r2, is the fraction of the variation in the values of y that is explained by least-squares regression of y on x. Eg: ____% of the variation in Height is accounted for by the linear relationship between hand size and Height.

  42. Facts about Least-Squares Regression • Fact 1 – The distinction between explanatory and response variables is essential in regression

  43. Facts about Least-Squares Regression • Fact 2 – There is a close connection between correlation and the slope of the least squares line. The slope is:

  44. Facts about Least-Squares Regression • Fact 3 – The least squares line always passes through the point

  45. Constructing the Least-Squares Example Suppose we have explanatory and response variables and we know that the mean of x=17, mean of y=161.111, sx=19.696, sy=33.479 and the correlation r = .997. Even though we don’t know the actual data, we can still construct the equation for the least-squares line and use it to make predictions.

  46. Constructing the Least-Squares Example So the Least-squares Line has an equation

  47. Facts about Least-Squares Regression • Fact 4 – The correlation r describes the strength of a straight-line relationship. In the regression setting, this description takes a specific form: The square of the correlation, r2 , is the fraction of the variation in the values of y that is explained by the least-squares regression of y on x.

  48. Key Concept • Correlation and regression describe only linear relationships • Extrapolation (using a model outside of the range of the data) often produces unreliable predictions

  49. Outliers and Influential Observations in Regression • An outlier is an observation that lies outside the overall pattern of the other observations • An observation is influential for a statistical calculation if removing it would markedly change the result of the calculation. Points that are outliers in the x direction of a scatter plot are often influential for the least-squares regression line. (Example: Revisit correlation applet)

  50. Child 19 and Child 18 are both outliers. Child 18 is more influential.

More Related