1 / 20

Statistical Reasoning for everyday life

Statistical Reasoning for everyday life. Intro to Probability and Statistics Mr. Spering – Room 113. 4.2 Shapes of Distribution. CLASS WORK: Worksheet REVIEW ACTIONS are REMEMBERED , WORDS can be FORGOTTEN! MAKE an EFFORT, NOT an EXCUSE. 4.2 Shapes of Distribution. Variation:

Download Presentation

Statistical Reasoning for everyday life

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Statistical Reasoningfor everyday life Intro to Probability and Statistics Mr. Spering – Room 113

  2. 4.2 Shapes of Distribution • CLASS WORK: • Worksheet REVIEW • ACTIONS are REMEMBERED, WORDS can be FORGOTTEN! • MAKE an EFFORT, NOT an EXCUSE

  3. 4.2 Shapes of Distribution • Variation: • Describes how widely data are spread out about the center of a distribution. • ????How would you expect the variation to differ between the heights of NCAA Division 1A Men’s College Basketball Centers and the heights of all High School Boy Basketball Players???? • NCAA Division 1A Centers less variation • High School Boy Basketball Players more variation

  4. 4.3 Measures of Variation • How do we investigate variation? • Study all of the raw data… • Range… • Quartiles… • Five-number summary (BOXPLOT or BOX-and-WHISKER)… • Interquartile range… • Semi-quartile range… • Percentiles… • MAD… • Variance & Standard Deviation…

  5. 4.3 Measures of Variation • RANGE: • The range of a distribution is the difference between the highest and lowest data values.

  6. 4.3 Measures of Variation • Find the range of the data. • 4.1, 5.2, 5.6, 6.2, 7.2, 7.7, 7.7, 8.5, 9.3, 11.0 • Range = 11.0 – 4.1 = 6.9

  7. 4.3 Measures of Variation • Misleading range: • Which Quiz Set has greater variation? Quiz Set 1: 1, 10, 10, 10, 10, 10, 10, 10, 10, 10 Quiz Set 2: 2, 3, 4, 5, 6, 7, 8, 9, 10, 8, 9, 10, 6, 5 ** Even though Set 1 has a greater range than Set 2 (9 > 8). Set 2 has a greater variation because Set 1 contains an outlier. Therefore, we use quartiles.**

  8. 4.3 Measures of Variation • Quartiles: • Quartiles divide the data into four quarters. • Lower Quartile (1st Quartile): is the median of the data values in the lower half of a data set. Exclude the middle value in the data set if the number of data points is odd. • Middle Quartile (2nd Quartile): is the overall median • Upper Quartile (3rd Quartile): is the median of the data values in the upper half of a data set. Exclude the middle value in the data set if the number of data points is odd.

  9. 4.3 Measures of Variation • Find quartiles… Example 1 – Upper and lower quartiles Lower Quartile Q1 Median Q2 Upper Quartile Q3

  10. 4.3 Measures of Variation • Example 2 – Range and quartiles • A year ago, Angela began working at a computer store. Her supervisor asked her to keep a record of the number of sales she made each month. • The following data set is a list of her sales for the last 12 months: • 34, 47, 1, 15, 57, 24, 20, 11, 19, 50, 28, 37 • Use Angela's sales records to find: • the median • b) the range • c) the upper and lower quartiles • Find quartiles…

  11. 4.3 Measures of Variation • Answers • The values in ascending order are: • 1, 11, 15, 19, 20, 24, 28, 34, 37, 47, 50, 57. a) Median  = (6th + 7th observations) ÷ 2            = (24 + 28) ÷ 2            = 26 b) Range = difference between the highest and lowest values              = 57 - 1              = 56

  12. 4.3 Measures of Variation c) Lower quartile = value of middle of first half of data Q1                           = the median of 1, 11, 15, 19, 20, 24                           = (3rd + 4th observations) ÷ 2                           = (15 + 19) ÷ 2                           = 17 d) Upper quartile = value of middle of second half of data Q3                       = the median of 28, 34, 37, 47, 50, 57                       = (3rd + 4th observations) ÷ 2                       = (37 + 47) ÷ 2                       = 42 These results can be summarized as follows:

  13. 4.3 Measures of Variation • Five-number summary: • Consists of the following… • Low Value • Q1 (lower quartile) • Q2 (median) • Q3 (upper quartile) • High Value # Summary

  14. 4.3 Measures of Variation Vertical box plot showing “normal” distribution “FORESHADOWING” • BOXPLOT or BOX-and-WHISKER: • Box plots show variation along the number line. Steps for creating a box plot: • Draw a number line that spans the entire data set. • Above the number line, enclose the values from the lower to the upper quartile in a box. • Draw a line through the box at the value corresponding to the median. • Add “whiskers” extending to the low and high values.

  15. 4.3 Measures of Variation • Example of 5 number summary and box plot. So for the data set 1, 4, 9, 12, 12, 16, 23, 24 here is our box plot:

  16. 4.3 Measures of Variation Digest of BOXPLOTS and SKEWNESS Symmetric Right-Skewed Left-Skewed Q1 Q2 Q3 Q1 Q2 Q3 Q1 Q2 Q3

  17. 4.3 Measures of Variation • Below is a Box-and-Whisker plot for the following data: 0 2 2 2 3 3 4 5 5 10 27 • The data are right skewed, as the plot depicts Min Q1 Q2 Q3 Max 0 2 3 5 27

  18. 4.3 Measures of Variation • Interquartile range: i.e. If the five number summary is low: 3, high: 23, Q1: 4, Q2: 12, Q3: 19. Then the interquartile range is IQR: (Q3-Q1) = (19 – 4) =15. Interquartile range The interquartile range is another range used as a measure of the variation. The difference between upper and lower quartiles (Q3–Q1), which is called the interquartile range, also indicates the dispersion of a data set. The inter-quartile range spans 50% of a data set, and eliminates the influence of outliers because, in effect, the highest and lowest quarters are removed.

  19. 4.3 Measures of Variation • Next Time: • Semi-quartile range… • Percentiles… • MAD… • Variance & Standard Deviation… • According to the box-n-whisker above what are the values for the 5 number summary: • Low: 12 • Q1: 22 • Q2: 31 • Q3: 45 • High: 50

  20. 4.3 Measures of Variation • Classwork: • PRACTICE MAKES PERMANENT • Pg 174 # 2-6 even and # 25-27 (Letters a, b only)

More Related