html5-img
1 / 15

More on data transformations

More on data transformations. No recipes, but some advice. If the primary problem is non-linearity, look at a scatter plot of the data to suggest plausible transformations. It is possible to use transformations other than ln(x) and ln(y). Try fitting. if the trend in your.

mahina
Download Presentation

More on data transformations

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. More on data transformations No recipes, but some advice.

  2. If the primary problem is non-linearity, look at a scatter plot of the data to suggest plausible transformations. It is possible to use transformations other than ln(x) and ln(y).

  3. Try fitting if the trend in your data follows either of these patterns.

  4. Try fitting if the trend in your data follows either of these patterns.

  5. Try fitting if the trend in your data follows either of these patterns.

  6. Try fitting if the trend in your data follows either of these patterns.

  7. Try fitting if the trend in your data follows any of these patterns.

  8. If the variances are unequal and/or error terms are not normal, try a “power transformation” on y.

  9. Family of power transformations A power transformationony involves transforming the response by taking it to some power λ. That is: Most commonly, for interpretation reasons, λ is a number between -1 and 2, such as -1, -0.5, 0, 0.5, (1), 1.5, and 2. When λ = 0, the transformation is taken to be the natural log transformation. That is:

  10. If the variances are unequal, try “stabilizing the variance” by transforming y.

  11. If the response y is a Poisson count… A common (now archaic?) recommendation is to transform the response using the square root transformation: and stay within the linear regression framework. Perhaps, now, the advice should be to use Poisson regression.

  12. If the response y is a binomial proportion... A common (now archaic?) recommendation is to transform the response using the arcsine transformation: and stay within the linear regression framework. Perhaps, now, the advice should be to use a form of logistic regression.

  13. If the response y isn’t anything special… A common recommendation is to try the natural log transformation: Or the reciprocal transformation:

  14. It’s okay to remove some data points to make the transformation work better. Just make sure you report the scope of the model.

  15. It’s better to give up some model fit than to lose clear interpretations. Just make sure you report that that’s what you did.

More Related