1 / 25

CHAPTER 5 INTRODUCTORY CHI-SQUARE TEST

CHAPTER 5 INTRODUCTORY CHI-SQUARE TEST This chapter introduces a new probability distribution called the chi-square distribution. This chi-square distribution will be used in carrying out hypothesis to analyze whether:

Download Presentation

CHAPTER 5 INTRODUCTORY CHI-SQUARE TEST

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. CHAPTER 5 INTRODUCTORY CHI-SQUARE TEST • This chapter introduces a new probability distribution called the chi-square distribution. • This chi-square distribution will be used in carrying out hypothesis to analyze whether: i. A sample could have come from a given type of POPULATION DISTRIBUTION. ii. Two nominal variable/categorical variable could be INDEPENDENT and HOMOGENEOUS of each other. • The chi-square test that will be discussed are: i. Goodness-of-fit Test ii. The Chi-square Test For Homogeneity iii. The Chi-square Test For Independence

  2. 1) Goodness-of-fit Test • In Goodness-of-fit test, chi-square analysis is applied for the purpose of examine whether sample data could have been drawn from a population having a specific probability distribution. • In Goodness-of-fit test, the test procedures are appropriate when the following conditions are met : i. The sampling method is simple random sampling. ii. The population is at least 10 times as large as the sample. iii. The variable under study is categorical (qualitative variable). iv. The expected value (ei) for each level of the variable is at least 5.

  3. The table frequency distribution layout: where • Test procedure to run the Goodness-of-fit test: 1. State the null hypothesis and alternative hypothesis 2. Determine: i. The level of significance, ii. The degree of freedom, Find the value of from the table of chi-square distribution

  4. 3. Calculate the value of where the • Determine the rejection region to reject : • or • . 5. Make decision/conclusion.

  5. Example: The authority claims that the proportions of road accidents occurring in this country according to the categories User attitude (A), Mechanical Fault (M), Insufficient Sign Board (I) and Fate (F) are 60%, 20%, 15% and 5% respectively. A study by an independent body shows the following data. Can we accept the claim at significance level ? Solution: 1.

  6. 2. 3. • Since . Thus we accept • We conclude that we have no evidence to reject the claim.

  7. Example: The number of students playing truancy in a school over 200 school days is showing below. If X is a random variable representing the number of students playing truancy per day, test the hypothesis that X follows the Poisson distribution with mean 3 per day at Solution:

  8. 3) 4) Since 4.472<15.086, so we accept Ho. 5) We conclude that there is not enough evidence to reject the claim.

  9. 2) The Chi-Square Test for Homogeneity • The homogeneity test is used to determine whether several populations are similar or equal or homogeneous in some characteristics. • This test is applied to a single categorical variable from two different population • The test procedure is appropriate when satisfy the below conditions : i. For each population, the sampling method is simple random sampling ii. Each population is at least 10 times as large as the sample iii. The variable under study is categorical iv. If sample data are displayed in contingency table (population x category levels), the expected value (ei) for each cell of the table is at least 5.

  10. Two dimensional Contingency Table layout: • The above is contingency table (r x c) wherer denotes as the number of categories of the row variable,c denotes as the number of categories of the column variable • is the observed frequency in cell i, j • be the total frequency for row category i • be the total frequency for column category j • be the grand total frequency for all cell (i, j)

  11. Test procedure to run Chi-square test for homogeneity: • State the null hypothesis and alternative hypothesis Eg: 2. Determine: i. The level of significance, ii. The degree of freedom, where Find the value of from the table of chi-square distribution 3. Calculate the value of using the formula below:

  12. 4. Determine the rejection region TO REJECT Ho: i. If ii. If p – value approach; 5. Make decision

  13. Example: Four machines manufacture cylindrical steel pins. The pins are subjected to a diameter specification. A pin may meet the specification or it may be too thin or too thick. Pins are sampled from each machine and the number of pins in each category is counted. Table below presents the results. Test

  14. Solution: 1. 2. From table of chi-square: • Construct a contingency table: Calculation of the expected frequency:

  15. Using the observed and expected frequency in the contingency table, we calculate using the formula given:

  16. 4. 5.

  17. Exercise: 200 female owners and 200 male owners of Proton cars selected at random and the colour of their cars are noted. The following data shows the results: Use a 1% significance level to test whether the proportions of colour preference are the same for female and male.

  18. 3) Chi-Square Test for Independence • This test is applied to a single population which has 2 categorical variables. • To determine whether there is a significant association between the 2 categorical variables. • Eg : In an election survey, voter might be classified by gender (female and male) and voting preferences (democrate ,republican or independent) . This test is used to determine whether gender is related to voting preferences. • The test is appropriated if the following are met : 1. The sampling method is simple random sampling ii. Each population is at least 10 times as large as the sample iii. The variable under study is categorical iv. If sample data are displayed in contingency table (population x category levels), the expected value for each cell of the table is at least 5.

  19. Note: The procedure for the Chi-square test for independence is the same as the Chi-square test for homogeneity. The only different between these two test is at the determination of the null and alternative hypothesis. The rest of the procedure are the same for both tests. This theorem is useful in testing the following hypothesis:

  20. Example: Insomnia is disease where a person finds it hard to sleep at night. A study is conducted to determine whether the two attributes, smoking habit and insomnia disease are dependent. The following data set was obtained. Use a 5% significance level to conduct the study.

  21. Solution: The contingency table 1. 2.

  22. 3. Using the observed and expected frequency in the contingency table, we calculate using the formula given:

  23. Since • We conclude that the smoking habit and insomnia disease are not independent.

  24. Exercise: A study is conducted to determine whether student’s academic performance are independent of their active in co-curricular activities. The following data set was obtained: Use a 5% significance level to conduct the study.

More Related