1 / 24

Data Transformations

Data Transformations. Data Transformations. For some data sets, it may be necessary to transform variables e.g. change units (lb to kg, ˚ C to ˚ F, etc.) This is simply a change in the scale, and such transformations are called ‘Linear’.

Download Presentation

Data Transformations

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Data Transformations

  2. Data Transformations • For some data sets, it may be necessary to transform variables • e.g. change units (lb to kg, ˚C to ˚F, etc.) • This is simply a change in the scale, and such transformations are called ‘Linear’. • Linear transformations consist of (1) multiplying all the observations by a constant, (2) adding a constant to all observations, or (3) both.

  3. Data Transformations • Multiplicative transformation example • Y = weight in kg • Y’ = weight in lb • Y’ = 2.2Y • Additive transformation example • Measurements of nitrate (mg/l) → Y • Y = 0.3, 0.35, 0.5, 0.42, 0.38, 0.56… • Add 1 to each number → Y’ • Y’ = 1.3, 1.35, 1.5, 1.42, 1.38, 1.56…

  4. Data Transformations • Additive and Multiplicative example • Body temperature measurements in ˚C (Y) were taken for 47 women; if we convert to ˚F (Y’): • Y’ = 1.8Y + 32 • Multiplicative transformations affect S in the same way that they affect the mean: • e.g., if mean Y = 22, and mean Y’ = 2.2Y • then SY’ = 2.2SY

  5. Data Transformations • Additive transformations, however, don’t affect S

  6. Data Transformations • Additive transformations thus effectively move probability distributions to the left or the right – but the shape of the histogram is unchanged. • Multiplicative transformations shrink or stretch the probability distribution

  7. Nonlinear Transformations • These sorts of transformations affect data in more complex ways. • Examples:

  8. Nonlinear Transformations • These transformations do change the essential shape of frequency distributions • They are thus used to try and make distributions more symmetric – i.e., are tools to achieve normality.

  9. Transformations to achieve normality • If the distribution is skewed to the right (the most common problem) then each of the following transformations will help produce a more symmetric distribution. • The transformations are listed in order of how much they will pull in a right-skewed distribution.

  10. Transformations to achieve normality • Percentage or proportion data is a special case – it often appears binomially distributed • e.g., 0-100%, 0-1 • Here the appropriate transformation is:

  11. Results • Tables and figures - must have a purpose

  12. Results: Tables • When to use: • Present numerical values • Large amounts of information • Rules • Numbered consecutively • Must be able to stand alone • Vertical arrangement • Title goes above the table • Definitions/’explanations’ go below the table

  13. “Bad Table” Table 6. Growth rate of cell cultures and activity of ornithine decarboxylase (ODC) and succinate dehydrogenase (SDH) in Pseudomonas aeruginosa in response to various carbon sources

  14. “Good Table” Table 7. Growth rate of cell cultures and activity of ornithine decarboxylase (ODC) and succinate dehydrogenase (SDH) in Pseudomonas aeruginosa in response to various carbon sources

  15. Table 4. Response of male fighting fish (Betta splendens) to their image in a mirrora aPrior to the experiment, fish had been visually isolated from one another for 2 wk. Observation period for each fish was 30 s.

  16. Results: Figures • Use to illustrate important points • summarize your data • Number graphs consecutively • separately from tables • Must be able to stand alone • Titles go below figure or on separate “Figure Legends” page • Know when to use specific types of graphs • Bar graph vs histogram • Scatter plot vs line graph

  17. Bar graph (refer to page 57) Problems?

  18. Bar graph (refer to page 57) Cleared quadrat Control quadrat

  19. Results: Graphs • Do not forget to include error bars • Is your data significant? • Are there differences • Complete figure legend

  20. Cleared quadrat Control quadrat Figure 2. Production of flowers by three species of plants in the absence of interspecific competition and under natural conditions

  21. Cleared quadrat Control quadrat Figure 2. Production of flowers by three species of plants in the absence of interspecific competition (cleared quadrats) and under natural conditions (control quadrats). The plants were Campanula rapunculoides, Epilobium angustifolium, and Hieracium aurantiacum. Plotted are means for eight randomly chosen quadrats. Each 1 x 1 m2.

  22. Text • Data summary • Do not discuss or draw conclusions • Statistics • Incorporate statistics into the verbal text • Be careful when using the word “significant” • Refer to appropriate tables and figures • When do you use “Figure” and when do you use “Fig.”?

  23. As shown in Figure 1, the shoreline of Hicks Pond was generally predominated by grasses and sedges. Observed frequencies of turtles obtaining food differed significantly from expected frequencies (x2=58.19, df=8, P<0.001; Fig. 2).

More Related