1 / 6

Relations in Categorical Data

Relations in Categorical Data. When a researcher is studying the relationship between two variables, if both variables are numerical then scatterplots, the correlation coefficient and regression analysis are useful tools.

kaden
Download Presentation

Relations in Categorical Data

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Relations in Categorical Data

  2. When a researcher is studying the relationship between two variables, if both variables are numerical then scatterplots, the correlation coefficient and regression analysis are useful tools. But, when the variables are categorical the study of a possible relationship between the variables begins with a two way table. Let’s consider an example where a retailer over a period of time will study the types of wine that are sold. The three types are French, Italian, and the rest of the wines labeled Other. It is also of interest to see if the type of music playing in the store has had an influence on the type of wine purchased. So, when each bottle of wine is sold the type of music playing in the store (None, French music, or Italian music) must be noted. Let’s add some numbers and the table on the next slide.

  3. Music Wine None French Italian French 30 39 30 Italian 11 1 19 Other 43 35 35 You will notice all these numbers add up to 243. This number represents the total number of bottles of wine sold during the study. But, the number also represents the total types of music that were played when the wines were sold. On the next slide I will add a column and a row to the table. The column will contain the actual sum of each type of wine and the total number of bottles and the row will contain the actual number of song types and the total number of songs.

  4. Music Wine None French Italian Total French 30 39 30 99 Italian 11 1 19 31 Other 43 35 35 113 Total 84 75 84 243 You will notice all these numbers add up to 243. This number represents the total number of bottles of wine sold during the study. But, the number also represents the total types of music that were played when the wines were sold. So, the total column is really the distribution on the types of wines sold and we could make pie charts and bar graphs from this information. The column is often called the marginal distribution on the row variable because it is written in the margin. The total row has a similar interpretation.

  5. In this example the real interest of the retailer is understanding types of wine sold. Thus the type of wine sold is the response variable and we put it here in the rows. The variable type of music is thought to explain the wine type sold and is thus the explanatory variable, sometimes called the treatment variable in this example and here put in the columns. To explore the possible relationship between the variables we look at each value of the explanatory variable (here in columns). Then for each number in a column we take each number and divide by the column total. We see this on the next slide. Check my work – please!

  6. Music Wine None French Italian Total French .36 .52 .36 .41 Italian .13 .01 .23 .13 Other .51 .47 .42 .47 Total 1 1 1 1 When no music was playing, wines other than French or Italian made up the majority of wines purchased. French was the second most common, followed by Italian. When French music was playing, more people bought French wine, very few people bought Italian wine, and the percent who bought other wines decreased. When Italian music was playing, French wine made up the same percent of bottles purchased as when no music was playing, but more people bought Italian wine and fewer people bought other wines. So, the music played does seem to matter. French wines sell in greater proportion when French music is played than when not, and the same is true for Italian wines and music.

More Related