1 / 20

1.2 Data Classification

1.2 Data Classification. Definitions. Qualitative data- -consists of attributes, labels, or non-numerical entries. Quantitative data-- consists of numerical measurements or counts. The base prices of several vehicles are shown in the table. Which data are Qualitative data and

nantai
Download Presentation

1.2 Data Classification

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. 1.2 Data Classification

  2. Definitions • Qualitative data--consists of attributes, labels, or non-numerical entries. • Quantitative data-- consists of numerical measurements or counts.

  3. The base prices of several vehicles are shown in the table. Which data are Qualitative data and which are quantitative data? Explain your reasoning. Example 1

  4. Try it Yourself • The populations of several U.S. cities are shown in the table. Which data are qualitative data and which are quantitative data? (Source: U.S. Census Bureau)

  5. Levels of Measurement • Another data characteristic is the data’s level of measurement. The level of measurement determines which statistical calculations are meaningful. • The four levels of measurement, in order from lowest to highest, are:

  6. nominal level of measurement – • qualitative; Data at this level are • categorized using names, labels, • or qualities. No mathematical • computations can be made at this level. 2) Ordinal level of measurement – qualitative or quantitative; Data at this level can be arranged in order, but differences between data entries are not meaningful.

  7. Example 2 Classifying Data by level Two data seta are shown. Which data set consists of data at the nominal level? Which data set consists of data at the ordinal level? Explain your reasoning. Top 5 TV Programs (from 5/14/01 to 5/20/01) 1. E.R. 2. Friends 3. Law and Order 4. West Wing 5. Will & Grace Network Affiliates in Portland, Oregon KATU (ABC) KGW (NBC) KOIN (CBS) KPDX (FOX)

  8. Solution: The first data set lists the rank of five TV programs. The data consists of the ranks 1,2,3,4, and 5. Because the rankings can be listed in order, these data are at the ordinal level. Note that the difference between a rank of 1 and 5 has no mathematical meaning, The second data set consists of the call letters of each network affiliate in Portland. The call letters are simply the names of network affiliates, so these data are at the nominal level.

  9. Try It yourself 2: • Consider the following data sets. For each data set, decide whether the data are at the nominal level or at the ordinal level. • The final standings for the Northeast Division if the National Hockey League • A collection of phone numbers • Identify what each data set represents. • Specify the level of measurement.

  10. Try It Yourself Anwers • (1a) The final standings represent a ranking of hockey teams. (1b) Ordinal (2a) The collection of phone numbers represents labels. No mathematical computations can be made. (2b) Nominal

  11. 3) interval level of measurement— quantitative; The data can be ordered and you can calculate meaningful differences between data entries. At the interval level, a zero entry simply represents a position on a scale (temperature—0° ); the entry is not an inherent zero. 4) ratio level of measurement-- similar to data at the interval level; with the added property that a zero entry is an inherent (none--$0) zero . A ratio of two data values can be formed so one data value can be expressed as a multiple of another.

  12. To distinguish between data at the interval level and at the ratio level, determine whether the expression “ twice as much” has any meaning in the context of the data. For instance, $100 is twice as much as $50, so these data are at the ratio level. On the other hand, 100˚C is not twice as warm as 50˚C, so these data are at the interval level.

  13. Example 3 Classifying Data by Level Two data sets are shown at the left. Which data set consists of data at the interval level? Explain your reasoning.(Source: major league baseball)

  14. Solution Both of these data sets contain quantitative data. Consider the dates of the Yankees’ World series victories, It makes sense to find differences between specific dates. For instance, the time between the Yankees’ first and last World series victories is 200 –1923 = 77 But it does not make sense to write a ratio using these dates. So, these data are at the interval level. Using the home run totals, you can find differences and write ratios. From the data, you can see that New York hit fifty-five more home runs than Kansas city hit, and that Anaheim hit about twice as many home runs as Minnesota hit. So these data are at the ratio level.

  15. Try It Yourself 3 • Decide whether the data are at the interval level or at the ratio level. • 1) The body temperatures (F) of an athlete who is exercising • 2) The heart rates, in beats per minute, of an athlete who is exercising • Identify what each data set represents • Specify the level of measurement

  16. Try It Yourself Answers • (1a) The collection of body temperatures represents data that can be ordered, but makes no sense written as a ratio. • (1a) Interval • (2a) The collection of heart rates represents data that can be ordered and written as a ratio that makes sense. • (2a) Ratio

  17. Temperatures are always “interval.”

  18. Inches (measurements of height and length) are always “ ratio.”

More Related