Statistical Evaluation of Diagnostic Significance and Program Effectiveness

Introductory Statistics for Laboratorians dealing with High Throughput Data sets Centers for Disease Control

Evaluation of a Laboratory Diagnostic Procedure for Mycoplasmapneumoniae • We believe that serum levels of the immunoglobulin M antibody may have diagnostic significance for identification of Mycoplasmapneumoniae. • First thing we need to know is if people who have the pneumonia show higher serum levels of the antibody.

Experimental Design • We will select two groups of subjects • Experimental Group: Persons with clinically defined pneumonia • Control Group: Asymptomatic cases • We will draw serum samples from each person and evaluate the serum level of immunoglobulin M antibody in each sample.

Step 1: State the Null and Alternative Hypotheses • H0: The mean serum level for the experimental group will not be different from the mean serum level for the control group (no difference/ nothing is happening) • Ha: The mean serum level for the experimental group will be different from the mean serum level for the control group (there is a real difference/ something is happening)

Select Statistical Test and Specify the Region of Rejection • We will use a t-test for two independent samples • We will have 20 people in each group (degrees of freedom = 38) • We will reject the null hypothesis if the probability of it being true is less that 5 chances in 100 (alpha = .05)

Conduct Experiment and Collect Data

Compute the Test Statistic

Accept or Reject H0: • As seen in the previous table, the probability that these two means are samples from the same population (that the difference is zero) is • p = .001503 • That is less than our chosen alpha = .05 • Reject the Null hypothesis. • Conclude that the experimental group has significantly higher serum levels of IgM

Effectiveness of a Program to Increase Seatbelt Use Among High School Seniors • We have developed a program for use with High School seniors to increase seatbelt use and wish to determine if the program is effective.

Experimental Design • The school has a separate parking lot of seniors. There is only one entrance and the students must swipe their ID to enter or leave the lot. A security camera positioned at the entrance photographs every driver as they enter and exit. This system has been in place for a couple of years.

Students and their parents will sign a release granting permission to participate in the study. • Two weeks later, unannounced, we will begin reviewing the security camera data and recording the drivers ID and if he/she was wearing a seatbelt. • We will record for 2 weeks before the program is presented. (Pretest) • All seniors will then complete the course and accompanying workbook. • Then we will record for another two weeks. (Posttest)

Each student who regularly drives to school during the period (must drive at least 3 days a week during both pretest and posttest) will become subjects in the experiment. • Subjects score will be the percent of time they were wearing a seatbelt when they exited the gate • Number of times wearing seatbelt/number of times exiting * 100 • We will have a pretest score and a posttest score for each person.

Step 1: State the Null and Alternative Hypotheses • H0: The mean percent seatbelt usage on the posttest will not be different from the mean percent seatbelt usage on the pretest. (The program did nothing, nothing happened). • Ha: The mean percent seatbelt usage on the posttest will be different from the mean percent seatbelt usage on the pretest. (The program changed the seatbelt usage, it did something.)

Select Statistical Test and Specify the Region of Rejection • We will use a t-test for paired samples • Paired samples = repeated measures = matched samples = pretest posttest • We will reject the null hypothesis if the probability that it could be true is less than 5 chances in 100, ie: • Alpha = .05 • In this case we don’t know in advance how many subjects we will get so we can’t specify the degrees of freedom until after we finish data collection. That’s OK as long as you specify alpha.

Conduct Experiment and Collect Data

Compute the Test Statistic

Accept or Reject H0: • As seen in the previous table, the probability that these two means are samples from the same population (that the difference is zero) is • p = .030838 • That is less than our chosen alpha = .05 • Reject the Null hypothesis. • Conclude that the Posttest mean is significantly higher than the Pretest mean. The program significantly increased seatbelt usage among our Highschool Seniors.

Statistical Evaluation of Diagnostic Significance and Program Effectiveness

Statistical Evaluation of Diagnostic Significance and Program Effectiveness

Presentation Transcript

Dealing with Data

Introductory Statistics for Laboratorians dealing with High Throughput Data sets

Introductory Statistics for Laboratorians dealing with High Throughput Data sets

Simulating high throughput data with FBA

Introductory Statistics for Laboratorians dealing with High Throughput Data sets

Introductory Statistics for Laboratorians dealing with High Throughput Data sets

Introductory Statistics for Laboratorians dealing with High Throughput Data sets

Introductory Statistics for Laboratorians dealing with High Throughput Data sets

Dealing with data

Introductory Statistics for Laboratorians dealing with High Throughput Data sets

Locating Statistics and Data Sets

Introductory Statistics for Laboratorians dealing with High Throughput Data sets

Machine Learning for High-Throughput Biological Data

Introductory Statistics for Laboratorians dealing with High Throughput Data sets

Dealing with Data

Introductory Statistics for Laboratorians dealing with High Throughput Data sets

Dealing with Data

Data Analysis for High-Throughput Sequencing

High Data Throughput Recommended Standard

Dealing with Data

Dealing with Data

Statistics: Dealing With Uncertainty