The Wisdom of Crowds in the Aggregation of Rankings

The Wisdom of Crowds in the Aggregation of Rankings Mark Steyvers Department of Cognitive Sciences University of California, Irvine Joint work with: Michael Lee, Brent Miller, Pernille Hemmer

Rank aggregation problem • Goal is to combine many different rank orderings on the same set of items in order to obtain a “better” ordering • Example applications • Combining voters rankings: social choice theory • Information retrieval and meta-search* • *e.g. Lebanon & Mao (2008); Klementiev, Roth et al. (2008; 2009), Dwork et al. (2001)

Example ranking problem in our research What is the correct chronological order? Abraham Lincoln Ulysses S. Grant time Ulysses S. Grant Rutherford B. Hayes Rutherford B. Hayes James Garfield Abraham Lincoln Andrew Johnson James Garfield Andrew Johnson

Aggregating ranking data ground truth group answer ? A B C D = A B C D Aggregation Algorithm A D B C D A B C B A D C A C B D A B D C

Generative Approach latent truth ? ? ? ? Generative Model A D B C D A B C B A D C A C B D A B D C

Wisdom of crowds phenomenon • Aggregating over individuals often leads to an estimate that is among the best individual estimates (or sometimes better) Galtons Ox (1907): Median of individual weight estimates came close to true answer

Approach • No communication between individuals • There is always a true answer (ground truth) • ground truth only used in evaluation • Unsupervised weighting of individuals* • exploit relationship between expertise and consensus • experts tend to be closer to the truth and therefore reach more similar judgments • Incorporate prior knowledge about latent truth • discount a priori bad rankings • * Klementiev, Roth et al. (2008, 2009); Dani, Madani, Pennock et al. (2006). Bayesian truth serum (Prelec et al., 2004); Cultural Consensus Theory (Batchelder and Romney, 1986)

Overview of talk • General knowledge tasks • reconstructing order of US presidents • Thurstonian models • Sports prediction • forecasting NBA and NCAA outcomes • Thurstonian models • Episodic memory • reconstructing order of personally experienced events • Mallows model

Experiment: 26 individuals order all 44 US presidents

Measuring performance Kendall’s Tau: The number of adjacent pair-wise swaps = 1 = 1+1 = 2 Ordering by Individual A B E C D A B E C D E C D A B C D E A B True Order A B C D E

Empirical Results (random guessing) t

Unsupervised models for ranking data • Classic models: • Thurstone (1927) • Mallows (1957); Fligner and Verducci, 1986 • Diaconis (1989) • Voting methods: e.g. Borda count (1770) • We will focus on Thurstonian and Mallows models • implemented as graphical models • MCMC inference Many models were developed for preference rankings and voting situations  no known ground truth

Thurstonian Model A. George Washington B. James Madison C. Andrew Jackson Each item has a true coordinate on some dimension

Thurstonian Model A. George Washington B. James Madison C. Andrew Jackson … but there is noise because of encoding errors

Thurstonian Model A. George Washington B. James Madison C. Andrew Jackson A B C Each persons mental encoding is based on a single sample from each distribution

Thurstonian Model A. George Washington B. James Madison C. Andrew Jackson A A < C < B B C The observed ordering is based on the ordering of the samples

Thurstonian Model A. George Washington B. James Madison C. Andrew Jackson A A < B < C B C The observed ordering is based on the ordering of the samples

Thurstonian Model A. George Washington B. James Madison C. Andrew Jackson Important assumption: across individuals, variance can vary but not the means

Graphical Model of Extended Thurstonian Model Latent truth Expertise of individual Mental samples Observed ordering j individuals

Inferred Distributions for 44 US Presidents George Washington (1) John Adams (2) Thomas Jefferson (3) James Madison (4) James Monroe (6) John Quincy Adams (5) Andrew Jackson (7) Martin Van Buren (8) William Henry Harrison (21) John Tyler (10) James Knox Polk (18) Zachary Taylor (16) Millard Fillmore (11) Franklin Pierce (19) James Buchanan (13) Abraham Lincoln (9) Andrew Johnson (12) Ulysses S. Grant (17) Rutherford B. Hayes (20) James Garfield (22) Chester Arthur (15) Grover Cleveland 1 (23) Benjamin Harrison (14) Grover Cleveland 2 (25) William McKinley (24) Theodore Roosevelt (29) William Howard Taft (27) Woodrow Wilson (30) Warren Harding (26) Calvin Coolidge (28) Herbert Hoover (31) Franklin D. Roosevelt (32) Harry S. Truman (33) Dwight Eisenhower (34) John F. Kennedy (37) Lyndon B. Johnson (36) Richard Nixon (39) Gerald Ford (35) James Carter (38) Ronald Reagan (40) George H.W. Bush (41) William Clinton (42) George W. Bush (43) Barack Obama (44) error bars = median and minimum sigma

Calibration of individuals t individual t distance to ground truth s inferred noise level for each individual

Wisdom of crowds effect t

Heuristic Models • Many heuristic methods from voting theory • E.g., Borda count method • Suppose we have 10 items • assign a count of 10 to first item, 9 for second item, etc • add counts over individuals • order items by the Borda count • i.e., rank by average rank across people

Model Comparison t Borda

Other ordering tasks Ten Commandments Ten Amendments

Overview of talk • General knowledge tasks • reconstructing order of US presidents • Sports prediction • forecasting NBA and NCAA outcomes • Episodic memory • reconstructing order of personally experienced events • New directions

Human forecasting experiment • Forecast end-of-season rankings for 15 NBA teams • Eastern conference • Western conference • Participants were college undergraduates • heterogeneous population regarding basketball expertise • 172 individuals for Eastern conference • 156 individuals for Western conference • Experiment conducted Feb 2010 • teams have played about a bit over half of games in regular season

Model predictions for Eastern conference Borda Boston Cleveland Orlando Miami Detroit Chicago Philadelphia Atlanta New York New Jersey Indiana Washington Toronto Charlotte Milwaukee Actual outcome Cleveland Orlando Atlanta Boston Miami Milwaukee Charlotte Chicago Toronto Indiana New York Detroit Philadelphia Washington New Jersey Thurstonian Model Cleveland Boston Orlando Miami Atlanta Chicago Detroit Charlotte Toronto Philadelphia Washington Indiana New York Milwaukee New Jersey

East 73% t 93% West t 87% 94%

Calibration Results East West t t s s

Heuristics: who will win more games? Chicago Bulls Charlotte Bobcats vs Won 0 championships Team in existence for 6 years Won 6 championships Team in existence for 44 years Related to work on “fast and frugal heuristics” by Gigerenzer et al.

Heuristic ranking by #championships won Actual outcome Cleveland Orlando Atlanta Boston Miami Milwaukee Charlotte Chicago Toronto Indiana New York Detroit Philadelphia Washington New Jersey #championships Boston Chicago Philadelphia Detroit Indiana New York New Jersey Atlanta Washington Milwaukee Miami Orlando Cleveland Toronto Charlotte

Informative Priors on Expertise • Individuals who closely follow heuristic orderings are probably not experts • Set hyperparameters of variance prior based on distance to heuristic ordering prior for individual who closely follows heuristic ordering s

Graphical Model j individuals

East 73% 93% t 96% West t 87% 94% 96%

Forecasting NCAA tournament (March Madness) • 64 US college basketball teams are placed in a set of four seeded brackets, and play an elimination tournament. • Midwest bracket:

Data • Predictions from 16,718 Yahoo users • Each individual predicts the winner of all games • We use the predictions for the first four rounds (60 games total) • Two scoring systems • Number of correct predictions • Points: • 1 point per correct winner in 1st round • 2 points in 2nd • 4 points in 3rd • 8 points in 4rd

Data and Results of Heuristic Strategies majority rule 73% priorseeding 61% Obama 83% #correct predictions majority rule 71% priorseeding 66% Obama 47% points individuals

Thurstonian Model Team A Team B Team C • Each team has a mean on a single “strength” dimension • Each person has single variance

Thurstonian Model Team A Team B Team C A B wins over A B • The probability a person will choose team A over team B is the probability their strength for team A will be sampled above team B

Thurstonian Model Team A Team B Team C B C wins over B C • The probability a person will choose team A over team B is the probability their strength for team A will be sampled above team B

Modeling Results Thurstonian modelinform. priors 90% Thurst model 83% majority rule 73% priorseeding 61% #correct predictions Thurst. modelinform. priors 81% Thurst. model 78% majority rule 71% priorseeding 66% points individuals

Overview of talk • General knowledge tasks • reconstructing order of US presidents • Sports prediction • forecasting NBA and NCAA outcomes • Episodic memory • reconstructing order of personally experienced events

Recollecting Order from Episodic Memory Study this sequence of images

How good is your memory? Place the images in the correct sequence (by reading order) B C D A F H E G J I

Problem • What if we have only a small number of individuals? • How can we guard against individuals with poor memory? • Idea: “smooth” the inferred group ordering with a prior

Approach • Empirically measure the prior orderings over events • Experiment: a separate group of individuals orders the images without seeing original video • Use this data to construct a prior on the group ordering

Mallows Model Kendall tau distance ω latent truth θj expertise for person j observed ranking for person j yj (memory data)

Mallows Model with an informative prior on the latent truth ωo prior on orderings θ* ω latent truth θoj θj expertise for person j observed ranking for person j yoj yj (prior knowledge data) (memory data)

Results when picking K worst “witnesses” uniform prior informative prior t Number of “witnesses” (K)

The Wisdom of Crowds in the Aggregation of Rankings

The Wisdom of Crowds in the Aggregation of Rankings

Presentation Transcript

Prediction Markets and the Wisdom of Crowds

The Statue That Didn’t Look Right and the Wisdom of Crowds

Search and the New Economy Wisdom of the Crowds

Prediction Markets: Tapping the Wisdom of Crowds

The Wisdom of Crowds

Crowd Funding- The Wisdom of Crowds - are you a part of it?

Prediction Markets and the Wisdom of the Crowds

The Wisdom of crowds and stupidity of herds Richard Zeckhauser Harvard University

The Wisdom of Crowds: Can Online Clinician Engagement Impact Health Technology Assessment?

Wisdom of the Crowds – What to make of web-based sentiment?

The Wisdom of Crowds in the Aggregation of Rankings

The Wisdom of

Wisdom of Crowds and Rank Aggregation

Human computation, and the wisdom of crowds

The Wisdom of Proverbs

HEPS challenges the wisdom of the crowds: the PEAK-Box Game. Try it yourself!

Quality of rankings in the spotlight

Sustainability and the wisdom of crowds

Prediction Markets and the Wisdom of Crowds

Prediction Markets and the Wisdom of the Crowds

The Wisdom of Crowds

The Aggravation of Aggregation?