1 / 20

Techniques for Visualizing Massive Data Sets

Techniques for Visualizing Massive Data Sets. Leilani Battle , Mike Stonebraker. Context. Visualization System. query. result. Database. Problem. Performance Vis systems don’t scale well for big d ata Or are turning into databases Over-plotting M akes visualizations unreadable

stash
Download Presentation

Techniques for Visualizing Massive Data Sets

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Techniques for Visualizing Massive Data Sets Leilani Battle, Mike Stonebraker

  2. Context Visualization System query result Database

  3. Problem • Performance • Vis systems don’t scale well for big data • Or are turning into databases • Over-plotting • Makes visualizations unreadable • Waste of time/resources

  4. Solution: Resolution Reduction Visualization System query Resolution Reduction Layer queryplan query modified query queryplan result reduced result Database

  5. ScalaR • Scalable vis system for data exploration • Web front-end • Uses SciDB (www.scidb.org) • Visualizes query results • Performs Resolution Reduction

  6. Demo of ScalaR

  7. Array Browser • Collaboration with: • Brown: Justin DeBrabant, Stan Zdonik, UgurCetintemel • Stanford: Zhicheng Liu, Jeff Heer • Google Maps-style exploration experience • Fetches subsets of the data (aka data tiles)

  8. Array Browser Example

  9. Array Browser Architecture

  10. Demo of Array Browser

  11. Future Work: Prefetching • Goal: Reduce user-wait time by prefetching tiles • Cache tiles in the tile buffer • Need algorithms to decide what to pre-fetch

  12. User Behavior Predictor (Seer) • Learn common query sequences from user traces P P

  13. Statistical Analysis Predictor • Look for statistical similarities in tiles • Try to guess what’s important based on patterns P P P

  14. Using Multiple Predictors • Run multiple predictors (or experts) in parallel • Compare predictions to user’s actual behavior • Use predictions from best performing expert • May change over time based on user’s goals

  15. Other Challenges • Lots if interesting problems left to address • Best eviction policy for the tile buffer? • How to share data between multiple users? • More predictors?

  16. Questions?

  17. Sagittarius Gemini Dogs Cats

  18. Prefetching Experts • User behavior predictor (Seer) • Learn common query sequences from user traces • Stats analysis predictor • Look for statistical similarities in tiles • Try to guess what’s important based on patterns

More Related