1 / 31

Describing Data Using Numerical Measures

Describing Data Using Numerical Measures. Mean. The mean is a numerical measure of the center of a set of quantitative measures computed by dividing the sum of the values by the number of values in the data set. Population Mean. where:  = population mean (mu) N = number of data values

tlowry
Download Presentation

Describing Data Using Numerical Measures

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Describing Data Using Numerical Measures

  2. Mean The mean is a numerical measure of the center of a set of quantitative measures computed by dividing the sum of the values by the number of values in the data set.

  3. Population Mean where:  = population mean (mu) N = number of data values xi = ith individual value of variable x

  4. Population MeanExample 3-1 Table 3-1: Foster City Hotel Data

  5. Population MeanExample 3-1 The population mean for the number of rooms rented is computed as follows:

  6. Sample Mean where: = sample mean (pronounced “x-bar”)n = sample size xi = ith individual value of variable x

  7. Sample MeanHousing Prices Example {xi} = {house prices} = {$144,000; 98,000; 204,000; 177,000; 155,000; 316,000; 100,000}

  8. Median The median is the center value that divides data that have been arranged in numerical order (i.e. an ordered array) into two halves.

  9. MedianHousing Prices Example {xi} = {house prices} = {$144,000; 98,000; 204,000; 177,000; 155,000; 316,000; 100,000} Ordered array: $98,000; 100,000; 144,000; 155,000; 177,000; 204,000; 316,000 Middle Value Median = 155,000

  10. MedianAnother Housing Prices Example {xi} = {house prices} = {$144,000; 98,000; 204,000; 177,000; 155,000; 316,000; 100,000; 177,000; 177,000; 170,000} Ordered array: $98,000; 100,000; 144,000; 155,000; 170,000; 177,000; 177,000; 177,000; 204,000; 316,000 Middle Values Median = (170,000 + 177,000)/2 = 173,500

  11. Skewed Data Right-skewed data: Data are right skewed if the mean for the data is larger than the median. Left-skewed data: Data are left skewed if the mean for the data is smaller than the median.

  12. Skewed Data(Figure 3-3) Median Mean Mean Median Mean = Median a) Right-Skewed b) Left-Skewed c) Symmetric

  13. Percentiles The pth percentile in a data array is a value that divides the data into two parts. The lower segment contains at least p% and the upper segment contains at least (100 - p)% of the data. The median is the 50th percentile.

  14. Quartiles Quartiles in a data array are those values that divide the data set into four equal-sized groups. The median corresponds to the second quartile.

  15. Measures of Variation A set of data exhibits variation if all of the data are not the same value.

  16. Range The range is a measure of variation that is computed by finding the difference between the maximum and minimum values in the data set. R = Maximum Value - Minimum Value

  17. Interquartile Range The interquartile range is a measure of variation that is determined by computing the difference between the first and third quartiles. Interquartile Range = Third Quartile - First Quartile

  18. Variance & Standard Deviation The population variance is the average of the squared distances of the data values from the mean. The standard deviation is the positive square root of the variance.

  19. Population Variance where: = population mean N = population size 2 = population variance (sigma squared)

  20. Population Variance(Bryce Lumber Example)

  21. Population Standard Deviation(Bryce Lumber Example)

  22. Sample Variance where: = sample mean n = sample size s2 = sample variance

  23. Sample Standard Deviation where: = sample mean n = sample size s = sample standard deviation

  24. The Empirical Rule If the data distribution is bell-shaped, then the interval: contains approximately 68% of the values in the population or the sample contains approximately 95% of the values in the population or the sample contains virtually all of the data values in the population or the sample

  25. The Empirical Rule(Figure 3-11) 95% 68% X

  26. Tchebysheff’s Theorem Regardless of how the data are distributed, at least (1 - 1/k2) of the values will fall within k = 1 standard deviations of the mean. For example: At least (1 - 1/12) = 0% of the values will fall within k=1 standard deviation of the mean At least (1 - 1/22) = 3/4 = 75% of the values will fall within k=1 standard deviation of the mean At least (1 - 1/32) = 8/9 = 89% of the values will fall within k=1 standard deviation of the mean

  27. 6 Sigma Quality • Specification for a quality characteristic is six standard deviation away from the mean of the process distribution. • Translates into process output that does not meet specifications two out of one billion times.

  28. Sigma Quality Levels s Sigma ( ) Defects per Million Quality Level Opportunities for Defects s 317,400 1 s 45,400 2 s 2700 3 s 63 4 s 0.57 5 s 0.002 6

  29. Sigma Quality Level Concepts s Sigma ( ) Equated to Quality Level Relative Area s 1 Floor space of a typical factory s 2 Floor space of a typical supermarket s 3 Floor space of a small hardware store s 4 Floor space of a typical living room s 5 Area under a typical desk telephone s 6 Top surface of a typical diamond s 7 Point of a sewing needle

  30. Standardized Data Values A standardized data value refers to the number of standard deviations a value is from the mean. The standardized data values are sometimes referred to as z-scores.

  31. Standardized Data Values STANDARDIZED SAMPLE DATA where: x = original data value = sample mean s = sample standard deviation z= standard score

More Related