1 / 27

Chapter 3 Descriptive Statistics: Numerical Methods

Chapter 3 Descriptive Statistics: Numerical Methods. Measures of Variability Measures of Relative Location and Detecting Outliers Exploratory Data Analysis Measures of Association Between Two Variables. . . %. x. Measures of Variability.

pchoi
Download Presentation

Chapter 3 Descriptive Statistics: Numerical Methods

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Chapter 3 Descriptive Statistics: Numerical Methods • Measures of Variability • Measures of Relative Location and Detecting Outliers • Exploratory Data Analysis • Measures of Association Between Two Variables   % x

  2. Measures of Variability • It is often desirable to consider measures of variability (dispersion), as well as measures of location. • For example, in choosing supplier A or supplier B we might consider not only the average delivery time for each, but also the variability in delivery time for each. • Range • Inter-quartile Range • Variance • Standard Deviation • Coefficient of Variation

  3. Measures of Variation Variation Range Variance Standard Deviation Coefficient of Variation Population Standard Deviation Interquartile Range Population Variance Sample Variance Sample Standard Deviation

  4. Measures of variation give information on the spread or variability of the data values. Variation Same center, different variation

  5. Range • Simplest measure of variation • Difference between the largest and the smallest observations: Range = xmaximum – xminimum Example: 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 Range = 14 - 1 = 13 Chap 3-5

  6. Example: Apartment Rents • Range Range = largest value - smallest value Range = 615 - 425 = 190

  7. Interquartile Range • The interquartile range of a data set is the difference between the third quartile and the first quartile. • It is the range for the middle 50% of the data.

  8. Example: Apartment Rents • Interquartile Range 3rd Quartile (Q3) = 525 1st Quartile (Q1) = 445 Interquartile Range = Q3 - Q1 = 525 - 445 = 80

  9. Variance • The variance is a measure of variability that utilizes all the data. • It is based on the difference between the value of each observation (xi) and the mean (x for a sample, m for a population).

  10. Variance • The variance is the average of the squared differences between each data value and the mean. • If the data set is a sample, the variance is denoted by s2. • If the data set is a population, the variance is denoted by  2.

  11. Variance for Grouped Data • Sample Data • Population Data

  12. Standard Deviation • Most commonly used measure of variation • Shows variation about the mean • The standard deviation of a data set is the positive square root of the variance. • If the data set is a sample, the standard deviation is denoted s. • If the data set is a population, the standard deviation is denoted  (sigma).

  13. Calculation Example:Sample Standard Deviation Sample Data (Xi) : 10 12 14 15 17 18 18 24 n = 8 Mean = x = 16

  14. Coefficient of Variation • Measures relative variation • Always in percentage (%) • Shows variation relative to mean • Is used to compare two or more sets of data measured in different units Population Sample

  15. Example: Apartment Rents • Variance • Standard Deviation • Coefficient of Variation

  16. Measures of Relative Locationand Detecting Outliers • z-Scores • Detecting Outliers

  17. z-Scores • The z-score is often called the standardized value. • It denotes the number of standard deviations a data value xi is from the mean. • A data value less than the sample mean will have a z-score less than zero. • A data value greater than the sample mean will have a z-score greater than zero. • A data value equal to the sample mean will have a z-score of zero.

  18. Example: Apartment Rents • z-Score of Smallest Value (425) Standardized Values for Apartment Rents

  19. Detecting Outliers • An outlier is an unusually small or unusually large value in a data set. • A data value with a z-score less than -3 or greater than +3 might be considered an outlier. • It might be an incorrectly recorded data value. • It might be a data value that was incorrectly included in the data set.

  20. Example: Apartment Rents • Detecting Outliers The most extreme z-scores are -1.20 and 2.27. Using |z| > 3 as the criterion for an outlier, there are no outliers in this data set. Standardized Values for Apartment Rents

  21. Exploratory Data Analysis • Five-Number Summary

  22. Five-Number Summary • Smallest Value • First Quartile • Median • Third Quartile • Largest Value

  23. Example: Apartment Rents • Five-Number Summary Lowest Value = 425 First Quartile = 450 Median = 475 Third Quartile = 525 Largest Value = 615

  24. Measures of Association between Two Variables • Covariance • Correlation Coefficient

  25. Covariance • The covariance is a measure of the linear association between two variables. • Positive values indicate a positive relationship. • Negative values indicate a negative relationship.

  26. Covariance • If the data sets are samples, the covariance is denoted by sxy. • If the data sets are populations, the covariance is denoted by .

  27. Correlation Coefficient • The coefficient can take on values between -1 and +1. • Values near -1 indicate a strong negative linear relationship. • Values near +1 indicate a strong positive linear relationship. • If the data sets are samples, the coefficient is rxy. • If the data sets are populations, the coefficient is .

More Related