1 / 16

How Big is Big Data? And NoSQL Databases

How Big is Big Data? And NoSQL Databases . University of California, Berkeley School of Information IS 257: Database Management. Announcement. Change to presentations Presentation is now OPTIONAL for extra credit Now will be during finals week Monday Dec. 16 th from 1-4

burton
Download Presentation

How Big is Big Data? And NoSQL Databases

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. How Big is Big Data? And NoSQL Databases University of California, Berkeley School of Information IS 257: Database Management

  2. Announcement • Change to presentations • Presentation is now OPTIONALfor extra credit • Now will be during finals week • Monday Dec. 16th from 1-4 • Final report also due then (or before)

  3. Review Big Data (introduction) More on Big Data and what it means RDBMS vsNoSQL databases Lecture Outline

  4. Big Data and Databases • “640K ought to be enough for anybody.” • Attributed to Bill Gates, 1981

  5. The Grid: On-Demand Access to Electricity Quality, economies of scale Time Source: Ian Foster

  6. Big Data and Databases • We have already mentioned some Big Data • The Walmart Data Warehouse • Information collected by Amazon on users and sales and used to make recommendations • Most modern web-based companies capture EVERYTHING that their customers do • Does that go into a Warehouse or someplace else?

  7. Why the Grid?(1) Revolution in Science • Pre-Internet • Theorize &/or experiment, aloneor in small teams; publish paper • Post-Internet • Construct and mine large databases of observational or simulation data • Develop simulations & analyses • Access specialized devices remotely • Exchange information within distributed multidisciplinary teams Source: Ian Foster

  8. Why the Grid?(2) Revolution in Business • Pre-Internet • Central data processing facility • Post-Internet • Enterprise computing is highly distributed, heterogeneous, inter-enterprise (B2B) • Business processes increasingly computing- & data-rich • Outsourcing becomes feasible => service providers of various sorts Source: Ian Foster

  9. How Big is Big Data • How big is big? 1 Kilobyte 1,000 bits/byte 1 megabyte 1,000,000 1 gigabyte 1,000,000,000 1 terabyte 1,000,000,000,000 1 petabyte 1,000,000,000,000,000 1 exabyte 1,000,000,000,000,000,000 1 zettabyte 1,000,000,000,000,000,000,000

  10. What is Big Data? • Ran across some interesting slides from a decade ago that already frame the problem and did a fair job of predicting where we are today • Slides by Jim Gray and Tony Hey : “In Search of Petabyte Databases” ca. 2001

  11. Summary from Gray & Hey • DBs own the sweet-spot: • 1GB to 100TB • Big data is not in databases • HPTS crowd is not really high performance storage (BIG DATA) • Cost of storage is people: • Performance goal:1 Admin per PB • From Jim Gray and Tony Hey : “In Search of Petabyte Databases” ca. 2001

  12. Why People? One row of one of Google’s data centers

  13. What counts as Big Data • More OPS (other people’s slides) • Taming the Big Data Fire Hose • by John Hugg, VoltDB

  14. RDBMS vsNoSQL • From a course at Dalhousie Univ. in Canada (including slides from Keith W. Hare “A Comparison of SQL and NoSQLDatabases”)

  15. You can buy Big Data… • Oracle will be happy to sell you systems (hardware and software) to manage your exabytes… Oracle Exadata Database Machine X3-8

  16. And NoSQL too… • Oracle Big Data Appliance • With Oracle NoSQLDatabase (BerkeleyDB)

More Related