1 / 40

Unit 1 Introduction to Statistics

Unit 1 Introduction to Statistics. Math 3 Ms. C. Taylor. Warm-Up. How do I find the mean and median of a data set?. Introduction to Statistics. How do I understand statistics as a process for making inferences about population parameters based on a random sample from that population?

Download Presentation

Unit 1 Introduction to Statistics

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Unit 1 Introduction to Statistics Math 3 Ms. C. Taylor

  2. Warm-Up • How do I find the mean and median of a data set?

  3. Introduction to Statistics • How do I understand statistics as a process for making inferences about population parameters based on a random sample from that population? • How do I recognize the purposes of and difference among statistical data gathering methods?

  4. What is/are statistics? • Statistics is a way of reasoning, along with a collection of tools and methods, designed to help us understand the world. • Statistics are particular calculations made from data

  5. Population Data or Sample Data? • Population data is used when you are gathering data from every individual of interest. • Ex: Asking the entire football team a question • Sample data is used when you are gathering data from some of the individuals of interest. • Ex: Asking only the offensive line a question and apply it to the entire football team

  6. Population Data or Sample Data? • You want to know the average GPA of North Forsyth students, so you ask all of the students in all of your classes. • Population • Sample

  7. Parameter vs. Statistic • A statistic is a descriptive measure computed from a sample of data. • A parameter is a descriptive measure computed from an entire population of data. • Inferential statistics enables you to make an educated guess about a population parameter based on a statistic computed from a sample randomly drawn from that population.

  8. Correlation vs. Causation • Correlation: The degree to which two or more measurements on the same group of elements show a tendency to vary together. • Causation: The degree to which one causes the other to happen. • Correlation does not imply causation • Example: There is correlation between the number of people wearing shorts and temperature. More people wearing shorts doesn’t cause higher temperatures!

  9. Ways to Gather Data • Survey – a questionnaire used to collect interesting data on a certain topic from a sample of people. • EX: You want to find out how many students in your class had a summer job. • EX: The government wants to determine average household income in the United States. • EX: You want to know if tattoos have an influence on a person’s GPA.

  10. Ways to Gather Data • Observational Study – we observe individuals and measure variables of interest but do not attempt to influence the responses. Observational Studies may show a correlation between variables, but cannot always guarantee causation. • EX: A study of child care enrolled 1364 infants in 1991 and planned to follow them through their sixth year in school. In 2003, the researchers published an article finding that “the more time children spent in child care from birth to age four-and-a-half, the more adults tended to rate them, both at age four-and-a-half and at kindergarten, as less likely to get along with others, as more assertive, as disobedient, and as aggressive.”

  11. Ways to Gather Data • Experiment – we deliberately impose some treatment on (that is, do something to) individuals in order to observe their responses. Experiments can carry more convincing evidence of a cause and effect relationship. • EX: “Take the Pepsi Challenge” – in the 80’s Pepsi had a huge marketing scheme that had people do a blind taste test to see which soda they preferred – Pepsi or Coke. • EX: Does Vitamin C reduce the causes of getting a common cold?

  12. Warm-Up • Determine if the following is a survey, experiment, or observational study. • Students are asked about their favorite music. • Walking in a classroom to see if technology is efficiently used. • Deciding if Claritin or Allegra is the best allergy medicine.

  13. Sampling • When conducting a survey, experiment, or observational study, it is almost impossible to survey everyone in a population so people use various sampling methods to gather information. • One major concern about sampling methods is whether it is a biased or unbiased method to gather information.

  14. Sampling Methods • Random sampling: when everyone in a population has an equal chance of being chosen in the experiment. • Stratified sampling: when the population is first divided into similar categories and the number of members in each category is determined. • Systematic sampling: when you determine a method for which to choose members of the population (assign numbers to the population and then choose every 5th person to participate) • Cluster sampling: when you randomly put the population into clusters and then choose a cluster randomly and then randomly choose people in that cluster to participate.

  15. Example if selecting 10 animals from 25 dogs, 15 cats, and 10 rabbits • Random sampling: when everyone in a population has an equal chance of being chosen in the experiment. Randomly selecting 10 from all 50 animals • Stratified sampling: when the population is first divided into similar categories and the number of members in each category is determined. Select 5 from 25 dogs, 3 from 15 cats and 2 from the rabbits • Systematic sampling: when you determine a method for which to choose members of the population (assign numbers to the population and then choose every 5th person to participate) Give every animal a random number and then choose every 5th number • Cluster sampling: when you randomly put the population into clusters and then choose a cluster randomly and then randomly choose people in that cluster to participate. Randomly put the animals into 2 groups of 25, choose a group, and then choose 10 from that selected group.

  16. Biased Questions • Some questions may use language that people can associate with emotions: • How much of your time do you waste on Facebook? • Do you prefer the wonderful math class or boring Shakespeare Class? • Some questions may refer to a majority or supposed authority: • Would you agree with the NCAE that teachers should be paid more for earning their master’s degree? • Phrased awkwardly: • Do you disagree with people who oppose the ban on smoking in public places?

  17. Sampling Bias • Occurs when one or more sub groups of a population are either over represented or under represented when conducting a survey or experiment. It must be random and fair selection • Voluntary: People voluntarily turn in a survey • Convenience: Questioner stays in one place • Exclusion: Only asking certain members • Under representation: Not getting 1/6 or at least 30 people • Non-randomness: Calling the first five people on every page of the phonebook. • Self-selection: People choose groups • Lack of double-blinding: Sampler knows which group/product the person is selecting

  18. Errors in Summarizing Data • No causation of effect: The cause could be affected by something other than what is being studied. (correlation only) • (Ex: Frog with no legs are deaf) • No causation of accurate population: Applying your results to the population incorrectly. • (Ex: Just because 85% of this class like math, it doesn’t mean that 85% of all students at this school like math.) • www.tylervigen.com shows correlations no causations

  19. Resources used: • "Next: Introduction to Data and Measurement Issues Surveys and Samples." CK-12 Foundation. N.p., n.d. Web. 21 Aug. 2013. • Yates, Daniel S., David S. Moore, and Daren S. Starnes. The Practice of Statistics: TI-83/84/89 Graphing Calculator Enhanced. New York: W.H. Freeman, 2008. Print. • Greg Fisher – Mount Tabor High School • Christina Holst – Parkland High School • Wendy Bartlett – Parkland High School

  20. Warm-Up • What percentage of the data lie within one standard deviation of the mean? • What percentage of the data lie within two standard deviations of the mean? • What percentage of the data lie within three standard deviations of the mean?

  21. Standard Deviation • First, find the mean (average) of the data set • The way to get the variance is to subtract the mean from EACH value • Square the each of step 2’s results • Find the mean of the squared results (variance) • Take the square root of the variance

  22. Warm-Up • What is the standard deviation for the set of data? 2, 4, 3, 6, 5

  23. Probability • Denoted by P(Event) P(E) = favorable outcomes total outcomes • This method for calculating probabilities is only appropriate when the outcomes of the sample space are equally likely.

  24. Experimental Probability • The relative frequency at which a chance experiment occurs • Flip a fair coin 30 times & get 17 heads i.e. 17/30

  25. Basic Rules of Probability • Rule 1.Legitimate Values For any event E, 0 < P(E) <1 • Rule 2.Sample space If S is the sample space, P(S) = 1

  26. Rules Continued • Rule 3.Complement For any event E, P(E) + P(not E) = 1 • Rule 4.Addition If two events E & F are disjoint, P(E or F) = P(E) + P(F) (General) If two events E & F are not disjoint, P(E or F) = P(E) + P(F) – P(E & F)

  27. Example #1 • A large auto center sells cars made by many different manufacturers. Three of these are Honda, Nissan, and Toyota. (Note: these are not simple events since there are many types of each brand.) Suppose that P(H) = .25, P(N) = .18, P(T) = .14. • Are these disjoint events? yes • P(H or N or T) = • P(not (H or N or T) =

  28. Example #2 • Musical styles other than rock and pop are becoming more popular. A survey of college students finds that the probability they like country music is .40. The probability that they liked jazz is .30 and that they liked both is .10. What is the probability that they like country or jazz? • P(C or J) =

  29. Independent • Two events are independent if knowing that one will occur (or has occurred) does not change the probability that the other occurs • A randomly selected student is female - What is the probability she plays soccer for PWSH? • Independent • A randomly selected student is female - What is the probability she plays football for PWSH? • Not Independent

  30. Rules Continued • Rule 5.Multiplication If two events A & B are independent, • General rule:

  31. Example #3 • A certain brand of light bulbs are defective five percent of the time. You randomly pick a package of two such bulbs off the shelf of a store. What is the probability that both bulbs are defective? • Can you assume they are independent? P(D & D)=

  32. Conditional Probability • A probability that takes into account a given condition P P

  33. Example #4 • In a recent study it was found that the probability that a randomly selected student is a girl is .51 and is a girl and plays sports is .10. If the student is female, what is the probability that she plays sports? • P

  34. Warm-Up • What is the probability of the following events? • Probability that I roll a 4. • Probability that I get a heads. • Probability that I roll a 1.

  35. Probability • Probabilities are written as: • Fractions from 0 to 1 • Decimals from 0 to 1 • Percents from 0% to 100%

  36. Probability If an event is certain to happen, then the probability of the event is 1 or 100%. If an event will NEVER happen, then the probability of the event is 0 or 0%. If an event is just as likely to happen as to not happen, then the probability of the event is ½, 0.5 or 50%.

  37. Probability Impossible Unlikely Equal Chances Likely Certain 0 0.5 1 0% 50% 100% ½

  38. Fundamental Counting Principle • The fundamental counting principle says that the number of possible combinations is the product of the items multiplied together. • Example: Ms. Taylor has 12 shirts, 5 pairs of jeans, and 6 pairs of shoes to choose from, how many outfits are possible? • 12 * 5 * 6 = 360 outfits

  39. Permutations • Order DOES matter! • Example: I have 245 students that competed in a mathematics game. There is a first, second, and third place winner, how many ways can I choose the winners?

  40. Combinations • Order DOESN’T matter! • Joe has 54 species of animals and he wants to pick 5 different ones, how many combinations are possible?

More Related