1 / 11

Big Data

Big Data. Internet2 Day Presentation Dr. Greg Newby, ARSC http://people.arsc.edu/~newby. Mandelbrot Set. Big data are getting Bigger! Networks aren’t keeping up Even small users need big data The biggest data are really, really big!

estansberry
Download Presentation

Big Data

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Big Data Internet2 Day Presentation Dr. Greg Newby, ARSC http://people.arsc.edu/~newby Mandelbrot Set Big Data at Internet2 Day 2006

  2. Big data are getting Bigger! Networks aren’t keeping up Even small users need big data The biggest data are really, really big! Big data projects, big systems, and big networking capability are requirements for Alaskans to benefit Big Data in a Nutshell Fiber Optic Cable Bundle Big Data at Internet2 Day 2006

  3. Basic Lingo Byte (8 bits each): one per character (this sentence has 68 bytes) Kilobyte = 1000 bytes. A text email. Megabyte = 1million bytes. A digital photo. Gigabyte = 1billion bytes. A 30 minute DVD movie Terabyte = 1trillion bytes. 1500 music CDs Petabyte = 1quadrilion bytes. 8 days of UAF’s Internet2 traffic at 100% utilization. 41 minutes of the largest 40Gb Internet2 connection (256 times faster!) Exabyte = 1quintillion bytes. PetaFLOP computers will produce this much output! Big Data at Internet2 Day 2006

  4. Sample “Big Data” Projects All are generating petabytes of data All use high-speed Internet to share data, distribute computation, provide end-user access, and as part of fundamental operations All have national & international collaborations National Virtual Observatory: Distributed astronomy Earth System Grid: Computational earth science Large Hadron Collider: High-energy physics 1992 NSFNet Data Rates Big Data at Internet2 Day 2006

  5. National Virtual Observatory Ongoing sky survey data (visible; radio; x-ray) from many observatories Shared data & processing Transparent access via portals NVO Architecture Big Data at Internet2 Day 2006

  6. Earth System Grid Many large computers running simulations Sharing output, which can be quite big! For computational grids, more bandwidth & lower latency among systems is critical (Where is AK on this map?) ESG Structure Big Data at Internet2 Day 2006

  7. High Energy Physics The CERN Large Hadron Collider (LHC) will become operational in 2007 100 times the collisions of Argonne’s accelerator (the world’s largest today), producing 100 times the data rate per experiment Several PB/year, which requires extensive post-processing to be useful LHC Assembly Big Data at Internet2 Day 2006

  8. Big Data in Alaska: for Science Climate study; Oceanography; Weather ARSC’s Augustine Forecast Big Data at Internet2 Day 2006

  9. Remote sensing with the new hyperspectral satellite 220 spectral bands, versus ~20 previously; higher spatial resolution, too • December 19, 2000 - EO-1 First Light Images • An image of Alaska (right) taken by EO-1's Advanced Land Imager (ALI) in the panchromatic (PAN) band compared with an image (left) taken by Landsat 7 under nearly identical lighting and surface conditions. http://www.gsfc.nasa.gov/topstory/2002/20020624eo1.html Big Data at Internet2 Day 2006

  10. Coming Soon:Big Data in Alaska for …. Medicine: Remote imagery & diagnosis; medical collaboration K-12 education: live remote scientific instruments (AlaskaScope); remote instruction; larger virtual classrooms Government: eGovernment; govdocs; communication to government leaders Entertainment: interaction & entertainment Industry: distributed organizations; consulting; internationalization Carbon Nanotubes, via WikiPedia Big Data at Internet2 Day 2006

  11. More bandwidth needed to participate in Big Data activities Continued large computer systems, informed personnel, scientific expertise, leadership commitment More bandwidth! Fill in the “bandwidth map” for Alaska communities More bandwidth! What’s next for Big Data in Alaska? WCI Fibre Route Big Data at Internet2 Day 2006

More Related