1 / 20

Displaying & Describing Categorical Data

Displaying & Describing Categorical Data. Chapter 3. Look at different types of data and checking a condition before plugging ahead. Writing clear explanations in context. Independence and the importance to statistics. Simpson’s Paradox: think deeply or be mislead!!. Objective.

lot
Download Presentation

Displaying & Describing Categorical Data

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Displaying & Describing Categorical Data Chapter 3

  2. Look at different types of data and checking a condition before plugging ahead. • Writing clear explanations in context. • Independence and the importance to statistics. • Simpson’s Paradox: think deeply or be mislead!! Objective

  3. Number 1 Rule of Data Analysis? Draw a picture!!! Frequency Table: records totals and categories names!! Frequency Tables

  4. Counts are useful but sometimes we want to know the fraction or proportion of data in each category. • These counts are expressed as percentages. • Relative Frequency Table: displays percentages rather than counts Frequency Tables

  5. Both these types of tables describe the distribution across categories. Frequency Tables

  6. The area principle says that the area occupied by a part of the graph should correspond to the magnitude of the value it represents. • Violations of this principle are a common way to lie with statistics. Area Principle

  7. A bar chart displays the distribution of a categorical variable, showing the counts for each category next to each other for easy comparison. Bar Charts

  8. If we are interested in the relative proportion we could replace the counts with percentages and use a relative frequency bar chart. Bar Charts

  9. Pie Charts show the whole group of cases as a circle . They slice the circle into pieces whose sizes are proportional to the fraction of the whole in each category. Pie Charts

  10. Think before you draw!! • Always check the categorical condition. • If you want to make a relative frequency bar chart or a pie chart, make sure the categories do not overlap. • If they overlap you can still make the chart but your percentages will not add up to 100. What should you pick!!

  11. To look at two categorical variables together, we often arrange the counts in a two-way table. • A contingency table displays counts and, sometimes percentages of individuals falling into named categories on two or more variables. The table categorizes the individuals on all variables at once to reveal possible patterns in one variable that may be contingent on the category of the other. Contingency Tables

  12. In a contingency table, the distribution of either variable alone is called the marginal distribution. The counts or percentages are the totals found in the margins (last row or column) of the table. Contingency Tables

  13. In Jan 2007, a Gallup poll asked 1008 Americans age 18 and over whether they planned to watch the upcoming Super Bowl. The pollster also asked those who planned to watch whether they were looking forward more to seeing the football game or the commercials. The results are summarized in the table Example

  14. What’s the marginal distribution of the responses? Example

  15. The distribution of a variable restricting the “Who” to consider only a smaller group of individuals is called a conditional distribution. • In a contingency table, when the distribution of one variable is the same for all categories of another, we say that the variables are independent. Conditional Distributions

  16. How do the conditional distributions of interest in the commercials differ for men and women? Conditional Distributions

  17. Does it seem that there’s an association between interest in Super Bowl TV coverage and a person’s sex? Example

  18. Segmented Bar Charts treat each bar as the “whole” and divides it proportionally into segments corresponding to the percentage in each group. Segmented Bar Charts

  19. When averages are taken across different groups, they can appear to contradict the overall averages. Simpson’s Paradox

  20. It’s the last inning of the important game. Your team is a run down with bases loaded and two outs. The pitcher is due up, so you’ll be sending in a pinch hitter. There are 2 batters available on the bench. Whom should you send to bat? Example

More Related