Regression and correlation
1 / 23

Regression and Correlation - PowerPoint PPT Presentation

  • Uploaded on

Regression and Correlation. BUSA 2100, Sect. 12.0 - 12.2, 3.5. Introduction to Regression. Forecasts (predictions) are often based on the relationship between 2 or more variables. Ex. 1 : Advertising expenditures and sales. Example 2 : Daily high temperature and demand for electricity.

I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
Download Presentation

PowerPoint Slideshow about ' Regression and Correlation' - reed-mitchell

An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.

- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript
Regression and correlation

Regression and Correlation

BUSA 2100, Sect. 12.0 - 12.2, 3.5

Introduction to regression
Introduction to Regression

  • Forecasts (predictions) are often based on the relationship between 2 or more variables.

  • Ex. 1: Advertising expenditures and sales.

  • Example 2: Daily high temperature and demand for electricity.

  • X = independent variable, the variable being used to make a forecast; Y = dependent variable, the variable being forecasted.

  • Identify X and Y in Examples 1 and 2.

  • Y depends on X.

Straight lines
Straight Lines

  • A regression line can be used to show mathematically how variables are related.

Regression example
Regression Example

  • To determine the equation of a line, all we need are the slope and Y-intercept.

  • Example: Pizza House builds restaur-ants near college campuses.

  • Before building another one, it plans to use X = student enrollment (1000s) to estimate Y = quarterly sales ($1000s).

  • A sample of 6 existing restaurants is chosen.

Pizza restaurant problem
Pizza Restaurant Problem

  • Resulting data pairs are shown below.

  • X Y 4 95 6 155 9 140 11 210 12 250 15 260

Scatter diagram line of best fit
Scatter Diagram & Line of Best Fit

  • Draw a scatter diagram on the board.

  • Use a hiatus so that the X, Y axes don’t have to begin at zero. All units must be the same size within axes.

  • By trial and error, draw some lines through the data. The regression line is the one line that fits the data best. (Also called the line of best fit.)

Line of best fit continued
Line of Best Fit (Continued)

  • As indicated earlier, YF is a forecasted value (on the regression line). Y is an actual value (one of the dots).

Regression formulas
Regression Formulas

  • Based on calculus, the equation of a regression line (line of best fit) can be found using these formulas.

Regression formulas page 2
Regression Formulas, Page 2

  • Carry out the numerical coefficients (b1 and b0) 3 or 4 decimal places; then round to 2 or fewer places at the end.

  • Substitute the numbers into the regression equation: YF = b0 + b1X.

  • We will complete the restaurant prob-lem, using a table to organize the data.

Restaurant problem page 2
Restaurant Problem, Page 2

  • X Y XY X2 Y2 4 95 380 16 9025 6 155 930 36 24025 9 140 1260 81 19600 11 210 2310 121 44100 12 250 3000 144 62500 15 260 3900 225 67600 SUM 57 1110 11780 623 226850

Meaning and uses of the regression equation
Meaning and Uses of the Regression Equation

  • Example: Vidalia State University has an enrollment of 9,800. Forecast pizza sales for a restaurant near the campus.

Accuracy of forecasts using regression
Accuracy of Forecasts Using Regression

  • The accuracy of forecasts depends on how closely the points in a scatter diagram fit the regression line.

  • If the linear relationship is too weak (the deviations are too large), there are large forecast errors and there may be no need to pursue use of a regression line.

Evaluating accuracy of regression forecasts
Evaluating Accuracy of Regression Forecasts

  • It is best to have an estimate of forecast accuracy before using a regression line.

  • 3 ways to estimate forecast accuracy:

Introduction to correlation
Introduction to Correlation

  • Def.: The coefficient of correlation (r) is a numerical measure of the strength of the linear relationship between 2 variables.

  • Values of r are always between -1 & 1; i.e., between 0 and 1 in absolute value.

  • r = 0 means no correlation; r = +-1 means perfect correlation; both rare.

Positive and negative correlation
Positive and Negative Correlation

  • Definition:Two variables X, Y have a positive correlation if large values of X tend to be associated with large values of Y; similarly, for small values.

  • X, Y must be measurable quantitatively.

  • Example of positive correlation:

Positive and negative correlation page 2
Positive and Negative Correlation, Page 2

  • Definition:Two variables X, Y have a negative correlation if large values of X tend to be associated with small values of Y, and vice-versa.

  • Example of negative correlation:

  • Graph positive and negative correlation.

High and low correlation
High and Low Correlation

  • General guidelines: Degree of Forecast Correlation Accuracy

    • very high very good high good moderate medium low fair very low poor

Formula for correlation
Formula for Correlation

  • Use regression for forecasts only if r is .70 or larger, in absolute value.

Regression analysis summary
Regression Analysis Summary

  • Steps in regression analysis:

  • (1) Collect data pairs, using 2 related variables.

  • (2) Calculate the correlation, r.

  • (3) (a) If r >= .70, in absolute value, find the regression equation and use it for forecasting.

  • (b) If r < .70, don’t use regression.

Multiple regression
Multiple Regression

  • Regression analysis with one independ-ent variable (X) is called simple regres-sion.

  • Regression analysis with 2 or moreindependent variables (X1, X2, etc.) is called multiple regression.

Multiple regression and line of average relationship
Multiple Regression and Line of Average Relationship

  • State the multiple regression equation.

  • A regression equation is also called the line of average relationship. Explain in terms of GPA example.

  • Correlation does not necessarily imply cause and effect. Illustrate with example.