1 / 31

All That Movies

All That Movies. BIFE Presentation. Agenda. Brief Introduction Dashboard Showcase Other Interesting Discoveries Technical Design Team Work. Brief Introduction. Cereal Killers. Jiang Yongli. Peng Cen. Xia Bing. Motivation. What we want to do is all about movies

siusan
Download Presentation

All That Movies

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. All That Movies BIFE Presentation

  2. Agenda • Brief Introduction • Dashboard Showcase • Other Interesting Discoveries • Technical Design • Team Work

  3. Brief Introduction

  4. Cereal Killers Jiang Yongli Peng Cen Xia Bing

  5. Motivation • What we want to do is all about movies • Analyze movies and movie business from different perspectives • Give suggestions for different kinds of people

  6. Target Customers • Movie fans • Movie Journalists • Movie companies • Directors and Actors/Actresses • Investors

  7. Data source • IMDB website • 11,000+ movies from 1898 to present • 81,000+ actors/actresses • 4,700+ directors • 11,000+ movie companies

  8. Dashboard Showcase

  9. Other Interesting Discoveries

  10. Sex Battle Males’ Favorite Females’ Favorite

  11. Our Parents’ Favorite

  12. And many more… • Directors born in Maryland are fond of Comedy movies (36/55 movies) and have no interest in Animation movies (0/55 movies) • Directors born in Rome love Horror movies (40/83 movies) and hate Romance movies (4/83 movies) • …

  13. Technical Design

  14. Working Progress

  15. ETL Processing • Data scale: • 131,000+ web pages • Crawler: • Simulate HTTP request • Extraction: • XPath + Regular Expressions • Save to DB: • ODBC + SQL

  16. Logical Data Model • Time Hierarchy Year Month of Year Quarter Month Day

  17. Logical Data Model (continued) • Geography Hierarchy Continent Country Language State City Movie

  18. Logical Data Model (continued) • Production Hierarchy Birth Country Birth Date Gender Performer Director Movie Company Genres

  19. Data Warehouse Schema • 16 Look Up Tables

  20. Data Warehouse Schema (continued) • 2 Fact Tables

  21. Data Warehouse Schema (continued) • 6 Relationship Tables • MOVIE_DIRECTOR • MOVIE_PERFORMER • MOVIE_GENRES • MOVIE_COUNTRY • MOVIE_LANG • MOVIE_COMPANY

  22. Project/Report/Dashboard Design • 25 Tables including one Data Mart table • 21 Attributes • 53 Facts • 3 User Hierarchies • 72 Metrics • Used smart metric, level metric, evaluation order, derived metric, view filter, conditional metric, report as filter, etc. in our reports • Widgets used: Interactive Stack Graph, Interactive Bubble Graph, Media, Data Cloud, Heat Map, Time Series Sliders, etc. • Miscellaneous selectors

  23. Problems We Met • Media widget automatically shrinks image whenever we resize it We set filling color the same as the border color and put it in another container with same filling color to make this not obvious.

  24. Problems We Met • We cannot use dynamic text for different attributes with the same name(e.g. Director’s Birth Date and Performer’s Birth Date), even if we use {[dataset name]}:{[object name]}. • We use grid to show these attributes and using formatting tricks.

  25. Problems We Met • View Filter on most Metrics is not valid in dashboard. • We try to make sophisticated level metric and report filter to solve the problem.

  26. Problems We Met • Flash mode always timeout when loading after we merged all dashboards together. • We divided our dashboards into two.

  27. Problems We Met • And many more problems… • And many more solutions…

  28. Team Work

  29. Cooperation • Face to face discussion • Communicator • Email • Shared Folders • Shared Intelligence Server • Everyone took part in each section more or less

  30. Work Foucuses • Xia Bing: • Team leader, ETL process, recommended directors and performers dashboard and related reports • Jiang Yongli: • Warehouse design, project building, movie business dashboard and related reports • Peng Cen: • Logical model design, top and bottom movies dashboard and related reports, dashboard formatting

  31. Thanks Do Not Imitate! We Are Professional!

More Related