1 / 24

SIMPLE REGRESSION AND CORRELATION

SIMPLE REGRESSION AND CORRELATION.

Download Presentation

SIMPLE REGRESSION AND CORRELATION

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. SIMPLE REGRESSION AND CORRELATION Both regression and correlation between two sets of variables measure strength of relationship. In the case of linear regression, we will examine the amount of variability in one variable (Y, the dependent variable) that is explained by changes in another variable (X, the independent variable). Specifically, we will look for straight line or linear changes in Y as X changes. Regression analysis is usually done in situation in which we have control of the X variable and can measure it essentially without error. For simplicity, we will avoid discussing curvilinear relationship between variables.

  2. Regression and correlation.., Correlation analysis is used when both the variables are experimental and measured with error. It is more preliminary than regression analysis and generally measure the correlationship between two variables of interest. Let us consider two examples to further highlight the differences between regression and correlation analysis.

  3. Example 10.1 A biology student wishes to determine the relationship between temperature and heart rate (heart beat/minute) in the common leopard frog. He manipulates the temperature in 20 C increments ranging from 2 to 180C and records the heart rate at each interval. His data are presented in table form below

  4. Example 10.1 … How should he proceed to describe the relationship between these variables (temp. and heart rate) ? Clearly the two variables have functional dependence – as the temperature increases the heart rate increases. Here the temperature is controlled by the student and can take exactly the same values in another experiment with a different frog. Temperature is the INDEPENDENT or “predictor” variable (X). Heart rate is determined by temperature and is, therefore, the DEPENDENT variable or “response” variable (Y).

  5. Example 10.2 A biologist interested in the morphology of west Indian Chitons and he measured the length and width of each of 10 chitons as

  6. Example 10.2…. This data set is fundamentally different from the data in Example 10.1 because neither variable is under biologist’s control. To try to predict length from width is as logical as to try to predict width from length. Both variables are free to vary (Fig 10.2). A correlational study is more appropriate here than a regression analysis. Because some of the calculations are similar, regression and correlation are often confused.

  7. Fig 10.2

  8. SIMPLE LINEAR REGRESSION We assume

  9. Linear Model Assumptions • X’s are fixed and measured without error • The expected or mean value for the variable Y for a given value of X is described by a linear function where and are constant real numbers and . and represent the intercept and slope, respectively, of the linear relationship between X and Y.

  10. Linear Model Assumptions 3. For any fixed value of X, there may be several corresponding values of the dependent variable Y. For example, for fixed temperature several frogs may show several results. However, we assume that for any such given below Xi , the Yi ‘s are independent of each other and normally distributed. We can represent each Yi value as Y is described as the expected value ( ) plus a deviation (ei) from that expectation. We assume eis are normally distributed error terms with a mean of zero.

  11. Linear Model Assumptions… 4. The variances of the distributions of Y for different values of X are assumed to be equal. To describe the experimental regression relationship between Y and X we need to do the following • Graph the data to ascertain that an apparent linear relationship exists • Find the best fitting straight line for the data set. • Test whether or not the fitted line explains a significant portion of the variability in Y i.e. test whether the linear relationship is real or not.

  12. Regression coefficient The regression coefficient or slope (b) Y changes for every unit change in X. Therefore, b has unit as the original data set have. If we have the value of ‘b’ we can calculate the value of ‘a’ from

  13. Calculation of b Referring to the example of temperature and heart rate relationship in frog we have n=9 b= 1.78. THIS MEANS, FOR EVERY 1 DEGREE CHANGE IN TEMP., THERE IS 1.78 BIT/MIN HEART RATE INCREASES OR DECREASES.

  14. Simple Linear Correlation Analysis Correlation analysis is used to measure the intensity of association observed between any pair of variables. We are largely concerned with whether two variables are interdependent or co-vary. Here we do not express one variable as a function of the other and do not imply that Y is dependent on X as we did with regression analysis. Both X and Y are measured with error and we wish to estimate the degree to which these variables vary together.

  15. …Correlation A widely used index of the association of two quantitative variables is Pearson Product-Moment Correlation Coefficient, usually called correlation coefficient (r).

  16. ….Correlation -1≤r≤1 , r2 = r2 = Coefficient of determination. Correlation coefficients and the corresponding coefficients of determination

  17. Correlation… The standard error of the coefficient is sr = Using this standard error we can develop a test of hypothesis for Ho: = 0 Ha: ≠ 0 with the test statistic With v = n-2

  18. Example 10.4 : Analysis of example 10.2 as a correlation problem Let X be the chiton length (cm) and Y be the chiton width (cm). The data for the problem and the preliminary calculations

  19. Example 10.4

  20. Correlation…. Test whether there is a significant correlation with α = 0.05 and v = n-2 = 10-2 = 8 So the test statistic is

  21. Correlation

  22. Solve the problem • Followings are the records of amount of feed ingested (kg) and live weight (Kg) of broilers. Test whether there is any significant correlation between amount of feed intake and body weight. How much weight gains a broiler out of 1 kg feed. r=0.726, sig (2-tailed) p<0.017

More Related