Statistical Argument and Safe Designs for Researchers

Quantitative Methods for Researchers Paul Cairns paul.cairns@york.ac.uk

Objectives • Statistical argument • Safe designs • A whizz through some stats • Time for questions

Statistical Argument • Inference is an argument form • Prediction is essential • Alternative hypothesis • “X causes Y” • No prediction – measuring noise

Gold standard argument • Collect data • Data variation could be chance (null) • Predictthe variations (alternative) • Statistics give probabilities • Unlikely predictions “prove” your case

Implications • Must have an alt (testable) hyp • No multiple testing • No post hoc analysis • Need multiple experiments

Silver standard argument • Collect data • Data variations could be chance (null) • Are there “real” patterns in the data? • Use statistics to suggest (unlikely) patterns • Follow up findings with gold standard work

Fishing: This is bad science • Collect lots of data • DVs and IVs • Data variations could be chance • Test until a significant result appears • Report the tests that were significant • Claim the result is important

Statistical pit… • … is bottomless! • Safe designs • One (or two) IV • Two (or three) conditions • One primary DV • Other stuff is not severely tested

Choosing a test • What’s the data type? • Do you know the distribution? • Within or between • What are you looking for?

Seeing location • Boxplots • Median, IQR, • “Range” • Outliers

Distributions • Theoretical stance • Must have this! • Not inferred from samples

Parametric tests • Normal distribution • Two parameters • Null = one underlying normal distribution • Differences in location (mean)

t-test: null vs alternate

t-test • Two samples • Two means • Are means showing natural variation? • Compare difference to natural variation

Effect size • How interesting is the difference? • 2s difference in timings • Significance is not same as importance • Cohen’s d

ANOVA • Parametric • Multiple groups • Why not do pairwise comparison? • Get an F value • Follow up tests

ANOVA++ • Multiple IV • So more F values! • Within and between • Effect size, η2 • Amount of variance predicted by IV

Non-parametric tests • Unknown underlying distribution • Heterogeneity of variance • Non-interval data • Usually test location • Effect size is tricky!

Basic tests • Mann-Whitney • Wilcoxon • Kruskal-Wallis • Friedman • No accepted two-way tests

Choosing a test For your fantasy abstract, what test would you choose? Why? Would you change your design?

Questions • Specific problems • Specific tests • Other tests?

Useful Reading • Cairns, Cox, Research Methods for HCI: chaps 6 • Rowntree, Statistics Without Tears • Howell, Fundamental Statistics for the Behavioural Sciences, 6thedn. • Abelson, Statistics as Principled Argument • Silver, The Signal and the Noise

Multivariate • Multiple DV • Multivariate normal distribution • Normal no matter how you slice • MANOVA • Null = one underlying (mv) normal distribution

Issues • Sample size • Assumptions • Interpretation • Communication

Monte Carlo • Process but not distribution • Generate a really large sample • Compare to your sample • Still theoretically driven!

Example • Event = 4 heads in a row from a set of 20 flips of a coin • You have sample of 30 sets • 18 events • How likely? • Get flipping!

Statistical Argument and Safe Designs for Researchers

Statistical Argument and Safe Designs for Researchers

Presentation Transcript

Quantitative Methods for Business

Quantitative Methods

Quantitative Methods

Quantitative Methods

Quantitative Methods

Quantitative Methods

Quantitative Methods

Quantitative Methods

Quantitative Methods

Quantitative Methods for Researchers

Quantitative Methods for Researchers

Quantitative Methods for Researchers

Quantitative Methods for Researchers

Quantitative Methods

Quantitative Methods

Quantitative Methods

Quantitative Methods

QUANTITATIVE METHODS FOR MANAGERS

Quantitative Methods

Quantitative Methods

Quantitative Methods

Quantitative Methods