1 / 26

Descriptive Statistics Measures of Center

Descriptive Statistics Measures of Center. Essentials Notation Measures of Center Mean Median Mode Mid-range Example Which to Use: Mean vs. Median vs. Mode Additional Topics (not addressed). E ssentials: Measures of Center (The great mean vs. median conundrum.).

kwinkler
Download Presentation

Descriptive Statistics Measures of Center

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Descriptive StatisticsMeasures of Center Essentials Notation Measures of Center Mean Median Mode Mid-range Example Which to Use: Mean vs. Median vs. Mode Additional Topics (not addressed)

  2. Essentials: Measures of Center(The great mean vs. median conundrum.) • Be able to identify the characteristics of the median, mean and mode, and to which types of data each applies. • Be able to calculate the median, mean and mode, as appropriate, for a set of data. • Affected by vs. resistant to extreme values. What are the implications for the mean and median?.

  3. Some Notation •  denotes the addition of a set of values • X (capital)is the variable usually used to represent the individual data values • xi(small letter) represents a single value of a variable from the first value, x1, to the last value xn • n represents the number of data values in a sample • N represents the number of data values in a population

  4. Measures of Center • Measures of Central Tendency • Indicate where the center or most typical value of a data set lies • Are often thought of as averages • Include the Mean, Median, Mode, and Midrange

  5. The Mean (Arithmetic) The Formula: • The “average” of a set of data. • Is the sum of the observations divided by the number of observations. • Is used only with quantitative data.

  6. Population Mean vs. Sample Mean • A Sample Mean is represented by the lower case letter x with a bar above it (called x-bar) • A Population Mean is represented by the lower case Greek letter m (mu)

  7. Median • The middle observation in a set of data. • Divides the data such that 50% of the observations lie below the median and 50% lie above it. • Is used only with quantitative data. • To obtain the median, the data must be placed in increasing order.

  8. MEDIAN: The Formula First: Arrange the scores in increasing order. Second: Apply the formula (n+1)/2. (Where n is the number of data values.) • If there is an ODD number of scores, the middle score is the value of the Median. • e.g: 1, 3, 6 => Median is (n+1)/2 = (3+1)/2 = 2 (position). So, the Median is value in the second position of the list of values. Here the second value is the number 3. • If there is an EVEN number of scores, the Median lies between the two middle scores. • e.g: 1, 2, 8, 15 => Median is (n+1)/2 = (4+1)/2 = 2.5 (position). So, the Median is the data value that lies 1/2 way between the second and third data values. Here that value would be 5. Remember, the formula computes a position, not a data value.

  9. Calculating a Median: • Determine the median for the following backpack weights: • Backpack weights (lb): 10, 14, 12, 18, 32, 15, 22, 19, 23, 61.

  10. MODE: The Formula • The most frequently occurring score in a data set. • Obtain the frequency of each value. • A Frequency Table based upon Single-Value Grouping or a Dot Plot would display this information. • Used with both qualitative and quantitative data. • It is the only measure of center for qualitative data. • There may be more than one Mode • If there are two modes, the data set is bimodal. • If there are more than two modes, the data set is multimodal. • If there is the same number of each value, then there is no mode

  11. Midrange • The Midrange is a measure of center of a distribution. It indicates the value midway between the highest and lowest values in a data set. To find the midrange. Highest Value + Lowest Value 2

  12. Example: Comparing the Mean, Median, and Mode Find the mean, median, and mode of the sample ages of a class shown. Which measure of central tendency best describes a typical entry of this data set? Are there any outliers? Source: Larson/Farber 4th ed.

  13. Solution: Comparing the Mean, Median, and Mode Mean: Median: Mode: 20 years (the entry occurring with thegreatest frequency) Source: Larson/Farber 4th ed.

  14. Solution: Comparing the Mean, Median, and Mode Mean ≈ 23.8 yrs. Median = 21.5 yrs. Mode = 20 yrs. • The mean takes every entry into account, but is influenced by the outlier of 65. • The median here was determined by taking the middle two entries into account, and it is not affected by the outlier. • In this case the mode exists, but it doesn't appear to represent a typical entry. Source: Larson/Farber 4th ed.

  15. Solution: Comparing the Mean, Median, and Mode Sometimes a graphical comparison can help you decide which measure of central tendency best represents a data set. In this case, it appears that the median best describes the data set. Source: Larson/Farber 4th ed.

  16. Mean vs. Median vs. Mode • Which is the best Measure of Center???? • MEAN: • Is sensitive to the influence of extreme scores (outliers), which will “pull” the mean away from the center. • Involves ALL data values in the calculation • MODE: • May not be anywhere near the center of the data. • Not really aimed at finding the middle of the data. • Is the ONLY “Measure of Center” for Qualitative Data. • MEDIAN: • Is resistant to the influence of extreme values. • Only uses One or Two points in its calculation.

  17. Additional Topics

  18. Weighted Means • Weighted Mean – a mean computed with different scores assigned different weights. To find the weighted mean

  19. Weighted Example: Finding a Weighted Mean You are taking a class in which your grade is determined from five sources: 50% from your test mean, 15% from your midterm, 20% from your final exam, 10% from your computer lab work, and 5% from your homework. Your scores are 86 (test mean), 96 (midterm), 82 (final exam), 98 (computer lab), and 100 (homework). What is the weighted mean of your scores? If the minimum average for an A is 90, did you get an A? Source: Larson/Farber 4th ed.

  20. Solution: Finding a Weighted Mean Your weighted mean for the course is 88.6. You did not get an A. Source: Larson/Farber 4th ed.

  21. Weighted Means Example • Calculating a GPA. • Given the following four grades, calculate the semester GPA. • Statistics A (of course; 3 CrHrs; numeric value for an A = 4) • History B (3 CrHr; B = 3) • Physics C (3 CrHr; C = 2) • Physical Education C (1 CrHr) • The grade numeric equivalents are the x values. The credit hour values are the weights. • Calculate the student’s GPA.

  22. Finding a Mean From a Frequency Table (Grouped Data) When we view data in a frequency table, it is impossible to know the exact values falling in a particular class. To find this value, obtain the product of each frequency and class midpoint (here “x”), add the products, and then divide by the sum of the frequencies.

  23. Finding the Mean of a Frequency Distribution • Find the midpoint of each class. • Find the sum of the products of the midpoints and the frequencies. • Find the sum of the frequencies. • Find the mean of the frequency distribution. In Words In Symbols Source: Larson/Farber 4th ed.

  24. Example: Find the Mean of a Frequency Distribution Use the frequency distribution to approximate the mean number of minutes that a sample of Internet subscribers spent online during their most recent session. Source: Larson/Farber 4th ed.

  25. Example: Find the Mean of a Frequency Distribution Use the frequency distribution to approximate the mean number of minutes that a sample of Internet subscribers spent online during their most recent session. Source: Larson/Farber 4th ed.

  26. End of Slides

More Related