1 / 8

Research Challenges in Recommender Systems / Survey of the Netflix Contest

By Rachsuda Jiamthapthaksin 10/09/2009. Research Challenges in Recommender Systems / Survey of the Netflix Contest. Edited by Christoph F. Eick. Recommender Systems (RSs). Goal: To help users to find items that they likely appreciate (and buy/lease) from huge catalogues.

minnie
Download Presentation

Research Challenges in Recommender Systems / Survey of the Netflix Contest

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. By Rachsuda Jiamthapthaksin 10/09/2009 Research Challenges in Recommender Systems / Survey of the Netflix Contest Edited by Christoph F. Eick

  2. Recommender Systems (RSs) • Goal: To help users to find items that they likely appreciate (and buy/lease) from huge catalogues.

  3. The recommendation problem • Let • C be the set of all users, and • S be the set of all possible items that can be recommended. • u be a utility function that measures the usefulness of item s to user c, u:CSR • For cC, find s’S that maximizes the user’s utility: cC, s’c = argmaxsSu(c,s) (1).

  4. Netflix Recommender System Scenario := unknown Remark: Typically, a lot of  symbols

  5. Survey of the Netflix Contest • Netflix Prize competition offers a grand prize of US $1M for an algorithm that’s 10% more accurate than “Cinematch” Netflix uses to predict customers’ movie preferences. • The best score will win a $50K Progress Prize.

  6. The Basic Structure of the Contest • Provide 100 million ratings that 480K anonymous customers had given to 17K movies. • Withhold 3M of the most recent ratings and ask the contestants to predict them. • Assess each contestant’s 3M predictions by comparing predictions with actual ratings. • Evaluation metric: the Root-Mean Squared Error

  7. Netflix Dataset (1) • The data were collected between October, 1998 and December, 2005 and reflect the distribution of all ratings received during this period. • The ratings are on a scale from 1 to 5 (integral) stars. • The date of each rating and the title and year of release for each movie id are also provided.

  8. Netflix Dataset (2) • training_set.tar (2 GB) • movie_titles.txt (575 KB) • qualifying.txt (51,224 KB) • probe.txt (10,530 KB) • rmse.pl (1 KB)

More Related