1 / 14

3.1: Scatterplots

3.1: Scatterplots. So far: 1-variable statistics. Side-by-side boxplots, stemplots, & histograms made comparisons visible. Now: statistical studies w/more than 1 variable (=bivariate!). We will be concentrating on relationships among several variables for the same group of individuals.

justis
Download Presentation

3.1: Scatterplots

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. 3.1: Scatterplots • So far: 1-variable statistics. Side-by-side boxplots, stemplots, & histograms made comparisons visible. • Now: statistical studies w/more than 1 variable (=bivariate!). We will be concentrating on relationships among several variables for the same group of individuals. • Example: How is an SAT math score related to an SAT verbal score?

  2. Preliminary Questions • What individuals does the data describe? • What are the variables? (How are they measured?) • Are the variables quantitative or categorical? • Do you want to explore the nature of the relationship, or do you think that some of the variables explain – or even cause – changes in the others?

  3. Correlation Info… • r>0 = positive association; r<0 = negative association. r always falls between –1 and 1. Values near 0 = weak linear relationship. As values near 1 or –1, the strength of the relationship increases and points lie closer to a straight line. • r does not change when we change the units of measurement for x, y, or both. • r has no unit of measurement. • r measures the strength of linear relationships only (not curved). • r is strongly affected by outlying observations. • Correlation is not a complete description of 2-variable data; always give the mean and standard deviation along with r.

  4. Differing Correlations

  5. More on the variables… • There may or may not be explanatory or response variables; depends on how you want to use the data. • Calling one variable explanatory and the other response doesn’t necessarily mean that changes in one cause the changes in the other.

  6. Example • Jim wants to know how the median SAT math and verbal scores are related to each other. He doesn’t think either score explains or causes the other. Which one is the explanatory variable? • Julie has the same data. She wants to know if she can predict a state’s SAT math score if she knows its SAT verbal score. Which one is the explanatory and which one is the response?

  7. Principles guiding examination of data • Start with a graph • Look for an overall pattern/deviations from the pattern • Add numerical descriptions of specific aspects of the data • Look for a way to describe the overall pattern very briefly.

  8. Scatterplot • Shows the relationship between two quantitative variables • Always plot the explanatory variable (if there is one) on the horizontal axis (x-axis), and the response variable on the vertical axis (y-axis) of a scatterplot. • If there is no explanatory-response distinction, either variable can go on either axis.

  9. Interpreting Scatterplots: Overall pattern • Form: Linear Direction: Positively Associated Strength: Put in L1 to find r!

  10. Correlation

  11. Is there a correlation between the number of beers a student consumes and their BAC? • Find the vital stats • Describe the form, direction, and strength • Draw the graph

  12. Example • Competitive divers are scored from 1-10. We have the scores of two judges, Ivan and George, on a large number of dives. After calculating, we find that r = .9. However, the mean of Ivan’s scores are 3 points lower than George’s mean. Do these facts contradict each other?

More Related