1 / 19

Introductory Statistics for Laboratorians dealing with High Throughput Data sets

Introductory Statistics for Laboratorians dealing with High Throughput Data sets. Centers for Disease Control. Evaluation of a Laboratory Diagnostic Procedure for Mycoplasma pneumoniae.

Download Presentation

Introductory Statistics for Laboratorians dealing with High Throughput Data sets

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Introductory Statistics for Laboratorians dealing with High Throughput Data sets Centers for Disease Control

  2. Evaluation of a Laboratory Diagnostic Procedure for Mycoplasmapneumoniae • We believe that serum levels of the immunoglobulin M antibody may have diagnostic significance for identification of Mycoplasmapneumoniae. • First thing we need to know is if people who have the pneumonia show higher serum levels of the antibody.

  3. Experimental Design • We will select two groups of subjects • Experimental Group: Persons with clinically defined pneumonia • Control Group: Asymptomatic cases • We will draw serum samples from each person and evaluate the serum level of immunoglobulin M antibody in each sample.

  4. Step 1: State the Null and Alternative Hypotheses • H0: The mean serum level for the experimental group will not be different from the mean serum level for the control group (no difference/ nothing is happening) • Ha: The mean serum level for the experimental group will be different from the mean serum level for the control group (there is a real difference/ something is happening)

  5. Select Statistical Test and Specify the Region of Rejection • We will use a t-test for two independent samples • We will have 20 people in each group (degrees of freedom = 38) • We will reject the null hypothesis if the probability of it being true is less that 5 chances in 100 (alpha = .05)

  6. Conduct Experiment and Collect Data

  7. Compute the Test Statistic

  8. Compute the Test Statistic

  9. Accept or Reject H0: • As seen in the previous table, the probability that these two means are samples from the same population (that the difference is zero) is • p = .001503 • That is less than our chosen alpha = .05 • Reject the Null hypothesis. • Conclude that the experimental group has significantly higher serum levels of IgM

  10. Effectiveness of a Program to Increase Seatbelt Use Among High School Seniors • We have developed a program for use with High School seniors to increase seatbelt use and wish to determine if the program is effective.

  11. Experimental Design • The school has a separate parking lot of seniors. There is only one entrance and the students must swipe their ID to enter or leave the lot. A security camera positioned at the entrance photographs every driver as they enter and exit. This system has been in place for a couple of years.

  12. Students and their parents will sign a release granting permission to participate in the study. • Two weeks later, unannounced, we will begin reviewing the security camera data and recording the drivers ID and if he/she was wearing a seatbelt. • We will record for 2 weeks before the program is presented. (Pretest) • All seniors will then complete the course and accompanying workbook. • Then we will record for another two weeks. (Posttest)

  13. Each student who regularly drives to school during the period (must drive at least 3 days a week during both pretest and posttest) will become subjects in the experiment. • Subjects score will be the percent of time they were wearing a seatbelt when they exited the gate • Number of times wearing seatbelt/number of times exiting * 100 • We will have a pretest score and a posttest score for each person.

  14. Step 1: State the Null and Alternative Hypotheses • H0: The mean percent seatbelt usage on the posttest will not be different from the mean percent seatbelt usage on the pretest. (The program did nothing, nothing happened). • Ha: The mean percent seatbelt usage on the posttest will be different from the mean percent seatbelt usage on the pretest. (The program changed the seatbelt usage, it did something.)

  15. Select Statistical Test and Specify the Region of Rejection • We will use a t-test for paired samples • Paired samples = repeated measures = matched samples = pretest posttest • We will reject the null hypothesis if the probability that it could be true is less than 5 chances in 100, ie: • Alpha = .05 • In this case we don’t know in advance how many subjects we will get so we can’t specify the degrees of freedom until after we finish data collection. That’s OK as long as you specify alpha.

  16. Conduct Experiment and Collect Data

  17. Compute the Test Statistic

  18. Compute the Test Statistic

  19. Accept or Reject H0: • As seen in the previous table, the probability that these two means are samples from the same population (that the difference is zero) is • p = .030838 • That is less than our chosen alpha = .05 • Reject the Null hypothesis. • Conclude that the Posttest mean is significantly higher than the Pretest mean. The program significantly increased seatbelt usage among our Highschool Seniors.

More Related