# Statistics - PowerPoint PPT Presentation

1 / 37

Statistics. Summary numbers, or indices, that result from an analysis of data (numbers) – pg. 2 All the procedures and tools used to organize and interpret facts, events, and observations that can be expressed numerically – pg. 2. Imaginary World. Snap. Crackle. Imaginary World #2. Snap.

I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.

Statistics

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.

- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -

### Statistics

• Summary numbers, or indices, that result from an analysis of data (numbers) – pg. 2

• All the procedures and tools used to organize and interpret facts, events, and observations that can be expressed numerically – pg. 2

Snap

Crackle

Snap

Crackle

if

and

or

then

Pop

### Variables

• n. an element, feature, or factor that is liable to vary or change

• adj. not consistent or having a fixed pattern; liable to change

### The Goal of Statistics(and all of science)

• Figure out how things vary together.

• If you succeed, you can answer the questions:

• How, why, when, where, what

Q: Why is little Sally afraid of dogs?

A: What are the variables, and how do they vary together?

(the presence or absence of dogs, Sally’s emotional state, Sally’s neural structure as it relates to these other events, Sally’s history and how it relates to the above elements, etc.)

Q: Why does my roommate drink my milk and leave the empty jug in the refrigerator?

### Making Sense of the Variability

Break it down:

• Independent Variables: the variables we are manipulating (or using for categorization)

• E.g., shock intensity

• Dependent Variables: the variables we are measuring

• E.g., amplitude of scream

IV

DV

### Unaccounted For Variables

• Variables that we do not know the status of

• Because we didn’t bother to include them

• E.g., eye color

• Because we couldn’t include them (even if we wanted to)

• E.g., exact electro-chemical state of the nervous system before, during, and after the shock

### Example 1

• Octavian is interested in the effect of breathing exercises on time spent on task.

• What is the independent variable?

• Breathing exercises

• What is the dependent variable?

• Breathing exercises

### Example 2

• Adelpha wants to know if pronunciation can be improved if one practices speaking with marbles in ones mouth.

• What is the independent variable?

• Clarity of pronunciation

• Practice with marbles

• What is the dependent variable?

• Clarity of pronunciation

• Practice with marbles

### Unaccounted For Variables: What To Do?

• The influence of unaccounted for variables can be controlled for by designing your study in such a way that unaccounted for variables are not likely to influence one group or condition more than any other group or condition.

• How is this accomplished?

### Design

• Random Selection

• Random = each member of the population has an equal chance of being selected.

• If you stop here, you can’t make causal statements.

• Random Assignment

• Random = each selected participant has an equal chance of being exposed to each condition.

• This is the only way you can make causal statements.

• This is an experiment.

### Example

• Romulus selects the first 50 students that walk into the Bernhard Center to participate in a study on WMU students. The students are asked if they are smokers or nonsmokers and then they fill out a short survey that is supposed to measure their general level of anxiety.

• Do we have random selection , random assignment , neither , or both ?

### What’s wrong with it?

• No random selection: not all students go to the Bernhard Center on a regular basis, and many might only go late in the afternoon. (selection bias)

• No random assignment: students were already in the category of interest, smoker or nonsmoker, so the IV was not manipulated. (not an experiment)

### So what?

• No random selection: means that you cannot say anything about the target population (WMU students).

• No random assignment: means that you cannot make causal statements about the IV on the DV (e.g., you can’t say that smoking caused students to have a higher level of anxiety).

• This experiment is a big failure, no matter how good the statistical analysis is.

### What’s the Lesson?

Design Trumps Analysis

• Whenever possible, a study should include both random selection and random assignment.

• If you can’t randomly assign, your study might still be useful, but you can’t make causal statements.

### Statistics and Variability

Variability ≠ Randomness

Randomness is an Illusion

### Complexity

• Variability is brought about by COMPLEXITY, not randomness.

• Human behavior is very complex (i.e., there are many relevant variables), but because it is orderly, we can predict and influence behavior.

• Statistics makes this prediction and influence much easier.

### Population vs. Sample

Population

Every person or object in a group (past, present, future, often infinite)

• Random

• simple

• stratified

Sample

Part of the population

Biased

Summary Measure = Parameter

Ex: μ = pop. mean

σ2 = pop. variance

σ = pop. standard deviation

Summary Measure = Statistic

_

Ex: X = sample mean

s2 = sample variance

s = sample standard deviation

Notice: Greek symbols are used for parameters, Roman symbols for statistics

### Why Sample?

Population

Inference

Sampling

Sample

We want to make inferences about the population.

### Inference

Definition: a conclusion reached on the basis of evidence [in our case, sample data] and reasoning [in our case, statistical analysis]

### Why not take data onthe population?

• The population might be huge.

• If the population is U.S. adults over age 65 you would need to take data on about 40 million people (also see below).

• Part of the population might be inaccessible.

• A study on infants born this year should apply to infants born next year.

• A study on heroin addicts should apply to people who were addicted to heroin but died of an overdose.

• But if you can get population data, you should.

• If I am interested in how many hours students in my class spend studying for quizzes, I can (and should) take population data.

INFER

### Inferential vs. Descriptive Statistics

Inferential Statistics: using statistical methods to make inferences about the population given sample data

Ex: How likely is it that my depression intervention results (from my sample) will generalize to the population?

Descriptive Statistics: using statistical methods to describe a set of data

Ex: What is the mean depression score on the Hamilton Rating Scale for clients I am working with?

DESCRIBE

### Sampling Error

Sampling error: the error caused by observing a sample instead of the population.

• Represented by the formula:

• statistic – parameter = sampling error

• Ex:

• The ONLY way to avoid sampling error entirely is to take data on the entire population

• What can we do to reduce sample error?

1. Random Sampling

2. Large Samples

3. Multiple Samples

### 4. Stratified Random Sampling

• Decide which characteristic of the population you will use for stratification. (Ex: sex)

• Randomly sample from each strata.

• The number drawn from each strata should be proportional to the number in each strata in the population.

Ex: Total pig population (16 pigs):

You think the color of the pig might be an important variable, but you can only afford to do the study with half of the pigs. How many black and how many pink pigs would you include in your study?

That’s right! Six pink pigs and two black pigs.

### Try This

Male: 3,500,000

What should your

Stratified sample

look like if your

total sample size is

100?

Population:

People age 16-65

living in the U. A. E.

Female: 2,000,000

U. A. E.

I wrote the number of females as a fraction of the total population, and put my unknown over my sample size on the other side.

Here’s how I solved it:

So, your sample should look like this:

Male: 64

Then I solved for X.

Female: 36

### With and Without Replacement

• Sampling with replacement means that after each person/object is selected, it is returned to the population so that it could be selected again.

• This is done so that the probability of an individual being selected remains constant.

• Sampling without replacement means that after each person/object is selected, it is not returned to the population and, therefore, cannot be selected again.

• In most behavioral studies, this is the only way to sample. Why is that?

### Measurement

Definition: the assignment of numbers or labels to objects or events

Labels? How is that measurement?

### 1. Nominal Scales

• Naming of objects or events

• Naming ≠ measurement

• Naming allows us to categorize and deal with individuals or groups separately

• Labeling is required to get frequency data, which is type of measurement

### Nominal Scales

• If we took the last seven digits of each of your phone numbers and averaged them, what could we learn?

• NOTHING. Be careful, just because it’s a number doesn’t mean it’s not a name.

• What if we wrote down the area code of each student in class and found the number of times each area code occurs (frequency)?

• This would work! Let’s try it.

### 2. Ordinal Scales

• Naming objects or events and putting them in order

• Ranking or rank ordering

### Ordinal Scales

4

3

2

1

?

?

?

Ordinal scales give us limited data.

### Interval Scales

• Naming of objects or events and putting them in order using units with equal intervals

• The distance between 1 and 2 should be exactly the same as the distance between 2 and 3 or 3 and 4.

• Common example: temperature in degrees Fahrenheit or Celcius

### Interval Scales

• In psychology: whether or not Likert scales are interval or ordinal scales is an issue of debate.

• Why do you think this might be?

Note: We will treat them as interval scales in this course, but only for convenience.

### Ratio Scales

• Naming of objects or events and putting them in order using units with equal intervals on a scale with atrue zero

• Ask yourself, does zero really mean nothing?

• Common example: temperature in degrees Kelvin

### Homework

• Prepare for Quiz #2 over Chapters 1 & 2.