Lecture 3 descriptives graphing lecturer james neill
Download
1 / 87

Research Methods & Design in Psychology - PowerPoint PPT Presentation

Lecture 3 Descriptives & Graphing Lecturer: James Neill Research Methods & Design in Psychology Overview Univariate descriptives & graphs Non-parametric vs. parametric Non-normal distributions Properties of normal distributions Graphing relations b/w 2 and 3 variables

loader
I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
capcha

Download Presentation

Research Methods & Design in Psychology

An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -

Presentation Transcript


Lecture 3 descriptives graphing lecturer james neill l.jpg

Lecture 3

Descriptives & Graphing

Lecturer: James Neill

Research Methods & Design in Psychology


Overview l.jpg
Overview

Univariate descriptives & graphs

Non-parametric vs. parametric

Non-normal distributions

Properties of normal distributions

Graphing relations b/w 2 and 3 variables


Empirical approach to research l.jpg
Empirical Approach to Research

A positivistic approach ASSUMES:

the world is made up of bits of data which can be ‘measured’, ‘recorded’, & ‘analysed’

Interpretation of data can lead to valid insights about how people think, feel and behave


What do we want to describe l.jpg
What do we want to Describe?

Distributional properties of variables:

Central tendency(ies)

Shape

Spread / Dispersion


Basic univariate descriptive statistics l.jpg
Basic Univariate Descriptive Statistics

Central tendency

Mode

Median

Mean

Spread

  • Interquartile Range

  • Range

  • Standard Deviation

  • Variance

Shape

  • Skewness

  • Kurtosis


Basic univariate graphs l.jpg
Basic Univariate Graphs

Bar Graph – Pie Chart

Stem & Leaf Plot

Boxplot

Histogram


Measures of central tendency l.jpg
Measures of Central Tendency

Statistics to represent the ‘centre’ of a distribution

Mode (most frequent)

Median (50th percentile)

Mean (average)

Choice of measure dependent on

Type of data

Shape of distribution (esp. skewness)


Measures of central tendency8 l.jpg
Measures of Central Tendency

Mode

Median

Mean

Nominal

X

Ordinal

X

X

Interval

X

X

X

Ratio

X?

X

X


Measures of dispersion l.jpg
Measures of Dispersion

Measures of deviation from the central tendency

Non-parametric / non-normal:range, percentiles, min, max

Parametric:SD & properties of the normal distribution


Measures of dispersion10 l.jpg
Measures of Dispersion

Range, Min/Max

Percentiles

SD

Nominal

Ordinal

X

Interval

X

X

X?

Ratio

X

X

X


Describing nominal data l.jpg
Describing Nominal Data

Frequencies

Most frequent?

Least frequent?

Percentages?

Bar graphs

Examine comparative heights of bars – shape is arbitrary

Consider whether to use freqs or %s


Frequencies l.jpg
Frequencies

Number of individuals obtaining each score on a variable

Frequency tables

graphically (bar chart, pie chart)

Can also present as %





Bar chart do you believe in god l.jpg
Bar chart: Do you believe in God?





Slide20 l.jpg
Mode

Most common score - highest point in a distribution

Suitable for all types of data including nominal (may not be useful for ratio)

Before using, check frequencies and bar graph to see whether it is an accurate and useful statistic.


Describing ordinal data l.jpg
Describing Ordinal Data

Conveys order but not distance (e.g., ranks)

Descriptives as for nominal (i.e., frequencies, mode)

Also maybe median – if accurate/useful

Maybe IQR, min. & max.

Bar graphs, pie charts, & stem-&-leaf plots


Stem leaf plot l.jpg
Stem & Leaf Plot

  • Useful for ordinal, interval and ratio data

  • Alternative to histogram


Box whisker l.jpg
Box & whisker

  • Useful for interval and ratio data

  • Represents min. max, median and quartiles


Describing interval data l.jpg
Describing Interval Data

Conveys order and distance, but no true zero (0 pt is arbitrary).

Interval data is discrete, but is often treated as ratio/continuous (especially for > 5 intervals)

Distribution (shape)

Central tendency (mode, median)

Dispersion (min, max, range)

Can also use M & SD if treating as continuous


Describing ratio data l.jpg
Describing Ratio Data

Numbers convey order and distance, true zero point - can talk meaningfully about ratios.

Continuous

Distribution (shape – skewness, kurtosis)

Central tendency (median, mean)

Dispersion (min, max, range, SD)



The four moments of a normal distribution l.jpg
The Four Moments of a Normal Distribution

Mean

<-Kurt->

<-SD->

<-Skew

Skew->


The four moments of a normal distribution28 l.jpg
The Four Moments of a Normal Distribution

Four mathematical qualities (parameters) allow one to describe a continuous distribution which as least roughly follows a bell curve shape:

  • 1st = mean (central tendency)

  • 2nd = SD (dispersion)

  • 3rd = skewness (lean / tail)

  • 4th = kurtosis (peakedness / flattness)


Mean 1 st moment l.jpg
Mean (1st moment )

  • Average score

  • Mean =  X / N

  • Use for ratio data or interval (if treating it as continuous).

  • Influenced by extreme scores (outliers)


Standard deviation 2 nd moment l.jpg
Standard Deviation (2nd moment )

  • SD = square root of Variance

    =  (X - X)2

    N – 1

  • Standard Error (SE) = SD / square root of N


Skewness 3 rd moment l.jpg
Skewness (3rd moment )

  • Lean of distribution

  • +ve = tail to right

  • -ve = tail to left

  • Can be caused by an outlier

  • Can be caused by ceiling or floor effects

  • Can be accurate (e.g., the number of cars owned per person)


Skewness 3 rd moment32 l.jpg
Skewness (3rd moment )

  • Negative skew

  • Positive skew




Kurtosis 4 th moment l.jpg
Kurtosis (4th moment )

  • Flatness or peakedness of distribution

  • +ve = peaked

  • -ve = flattened

  • Be aware that by altering the X and Y axis, any distribution can be made to look more peaked or more flat – so add a normal curve to the histogram to help judge kurtosis


Kurtosis 4 th moment36 l.jpg
Kurtosis (4th moment )

Red = Positive (leptokurtic)

Blue = negative (platykurtic)


Key areas under the curve for normal distributions l.jpg
Key Areas under the Curve for Normal Distributions

  • For normal distributions, approx. +/- 1 SD = 68%+/- 2 SD ~ 95%+/- 3 SD ~ 99.9%



Types of non normal distribution l.jpg
Types of Non-normal Distribution

  • Bi-modal

  • Multi-modal

  • Positively skewed

  • Negatively skewed

  • Flat (platykurtic)

  • Peaked (leptokurtic)




Rules of thumb in judging severity of skewness kurtosis l.jpg
Rules of Thumb in Judging Severity of Skewness & Kurtosis

  • View histogram with normal curve

  • Deal with outliers

  • Skewness / kurtosis <-1 or >1

  • Skewness / kurtosis significance tests










Skewed distributions the mode median mean l.jpg
Skewed Distributions& the Mode, Median & Mean

+vely skewed mode < median < mean

Symmetrical (normal) mean = median = mode

-vely skewed mean < median < mode



More on graphing visualising data l.jpg
More on Graphing(Visualising Data)


Edward tufte l.jpg
Edward Tufte

Graphs:

Reveal data

Communicate complex ideas with clarity, precision, and efficiency


Tufte s guidelines 1 l.jpg
Tufte's Guidelines 1

Show the data

Substance rather than method

Avoid distortion

Present many numbers in a small space

Make large data sets coherent


Tufte s guidelines 2 l.jpg
Tufte's Guidelines 2

  • Encourage eye to make comparisons

  • Reveal data at several levels

  • Purpose: Description, exploration, tabulation, decoration

  • Closely integrated with statistical and verbal descriptions


Tufte s graphical integrity 1 l.jpg
Tufte’s Graphical Integrity 1

Some lapses intentional, some not

Lie Factor = size of effect in graph size of effect in data

Misleading uses of area

Misleading uses of perspective

Leaving out important context

Lack of taste and aesthetics


Tufte s graphical integrity 2 l.jpg
Tufte's Graphical Integrity 2

Trade-off between amount of information, simplicity, and accuracy

“It is often hard to judge what users will find intuitive and how [a visualization] will support a particular task” (Tweedie et al)









Distortive variations in scale l.jpg
Distortive Variations in Scale California 1964-1990


Distortive variations in scale67 l.jpg
Distortive Variations in Scale California 1964-1990


Restricted scales l.jpg
Restricted Scales California 1964-1990


Restricted scales69 l.jpg
Restricted Scales California 1964-1990



People histogram l.jpg
People Histogram Variables (Bivariate)


Separate graphs l.jpg
Separate Graphs Variables (Bivariate)



Clustered bar chart l.jpg
Clustered bar chart Variables (Multivariate)


19 th vs 20 th century causes of death l.jpg
19 Variables (Multivariate)th vs. 20th century causes of death


Demographic distribution of age l.jpg
Demographic distribution of age Variables (Multivariate)


Where partners first met l.jpg
Where partners first met Variables (Multivariate)


Line graph l.jpg
Line graph Variables (Multivariate)


Line graph79 l.jpg
Line graph Variables (Multivariate)


Causes of mortality l.jpg
Causes of Mortality Variables (Multivariate)


Bivariate normality l.jpg
Bivariate Normality Variables (Multivariate)


Exampes of more complex graphs l.jpg
Exampes of More Complex Graphs Variables (Multivariate)


Sea temperature l.jpg
Sea Temperature Variables (Multivariate)


Sea temperature84 l.jpg
Sea Temperature Variables (Multivariate)



Links l.jpg
Links Variables (Multivariate)

  • Presenting Data – Statistics Glossary v1.1 - http://www.cas.lancs.ac.uk/glossary_v1.1/presdata.html

  • A Periodic Table of Visualisation Methods - http://www.visual-literacy.org/periodic_table/periodic_table.html

  • Gallery of Data Visualization

  • Univariate Data Analysis – The Best & Worst of Statistical Graphs - http://www.csulb.edu/~msaintg/ppa696/696uni.htm

  • Pitfalls of Data Analysis – http://www.vims.edu/~david/pitfalls/pitfalls.htm

  • Statistics for the Life Sciences –http://www.math.sfu.ca/~cschwarz/Stat-301/Handouts/Handouts.html


ad
  • Login