1 / 16

Bi- Variate Data

Bi- Variate Data. PPDAC. We are looking for a set of data that is affected by the other data sets in our spreadsheet. This variable is called dependent because its values are affected by the other data sets Sometimes it is called the response variable (because it responds)

maren
Download Presentation

Bi- Variate Data

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Bi-Variate Data PPDAC

  2. We are looking for a set of data that is affected by the other data sets in our spreadsheet. • This variable is called dependent because its values are affected by the other data sets • Sometimes it is called the response variable (because it responds) • This variable must be “variable 2” on iNZight i.e. on the y-axis. Types of data

  3. The x-axis is called the explanatory variable or the independent variable Dependant or Response Variable Explanatory or Independent Variable Types of data

  4. We can only plot scatter diagrams of numerical data, non numerical data like colours, countries etc cannot be plotted Choosing your variables

  5. You may like to look at ‘advanced’ – ‘Scatter Plot Matrix’ to get an overview of all the combinations of graphs. • Look for areas where the fit isn’t good. • Clusters • Fanning out or in (data points are further away from the trendline) • Gaps in data Choosing your variables

  6. Import the correct CSV file into iNZight and click and drag variables into variable 1 and 2 positions. • For each scatter diagram you must add a linear trend line and note the equation and ‘r’ value. Drawing Graphs

  7. For Achieved. • Problem: Write a question that clearly investigates the relationship between two variables. 2. Plan: I will use iNZight to produce a scatter plot and equation. I will observe the graph to decide if the equation is valid. 3. Data: Describe the data including the correct units and show some understanding of the context. 4. Analyse: Use iNZight to draw a scatter graph and produce the trend curve. For Achieved

  8. 5. Analyse: Describe what you see in the scatter graph (use T.A.R.S.O.G. for this). 6. Analyse: Describe the relationship between the two variables in terms of "as xxx increases, yyy ...“ 7. Predicition: Make a prediction (interpolation) using the iNZight equation, is it valid? Reliable? 8. Conclusion: Answer your problem question, is there a relationship? For Achieved

  9. Problem:This report considers the relationship between the stride length and the time to complete a marathonin  minutes for the purpose of predicting the time to run a marathon. • Plan: The independent variable is the stride length which is measured in centimeters. The dependent variable is the marathon minutes, measured in minutes. • Data: The data is a sample taken from marathons in NZ. Purpose statement (Basic)

  10. 4. Analysis

  11. T is for trend, is it linear or not?A is for association, is it positive or negative?R is for relationship, is it strong or weak?S is for scatter, is it constant or not? Fan?O is for outliers, can you spot any?G is for groups, are there any? 5. Analysis

  12. As the carrot increases the price of the diamond increases. For every increase in carrot the price increases by approximately $7800. 6. Analysis

  13. Must include • A interpolation and extrapolation • A comment about the strength of the prediction (critique) 7. Predictions

  14. Answer your purpose statement by highlighting the key points of the analysis. 8. Conclusion

  15. The correlation coefficient is a number value between -1 and 1 • The sign shows if it is positive or negative correlation • These are only a guide (different books give different values.) Correlation Coefficient.

  16. It is only designed to measure linear relationships! (Not appropriate for curved relationships) r = 1 r = -1 r = 0 Correlation Coefficient

More Related