1 / 14

Geo-Distributed Cloud Computing

Geo-Distributed Cloud Computing. Motivation. Data intensive multi-sensor computations. Bandwidth trumps computing. Need Data Reductions Many Emerging Problems. Proposal. Cloud to distribute data geographically Map Reduce paradigm as a programming model

gotzon
Download Presentation

Geo-Distributed Cloud Computing

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Geo-Distributed Cloud Computing

  2. Motivation • Data intensive multi-sensor computations. • Bandwidth trumps computing. • Need Data Reductions • Many Emerging Problems

  3. Proposal • Cloud to distribute data geographically • Map Reduce paradigm as a programming model • Exhibits properties that scale well over long distances • Provide heterogeneous multicore processing environments • Programming strategies for geo-distributed software • Ad-hoc Membership

  4. Strategy 1 • Single Global Sun Grid Engine installation • Single Global Hadoop Distributed Filesystem installation • Jobs submitted to SGE are distributed among nodes provided by various institutions • SSH login

  5. Institution A Institution B Institution C Compute Nodes Compute Nodes Compute Nodes SGE Master Layout

  6. (Dis)Advantages • Advantages • Software already exists • Simple installation • Disadvantages • Requires central authority • Might not maximize data locality

  7. Strategy 2 • Separate installations at each institution • Controller connects installations • Initiates separate Map/Reduce for spatially local data • Data transferred from one system as needed

  8. Institution A Institution B Institution C Compute Nodes Compute Nodes Compute Nodes Cloud Service Cloud Service Cloud Service Layout

  9. Advantages • Institutions in full control over own nodes • Independent Map/Reduce operations at each institution • Reduced bandwidth between sites • Controller provides simpler programming

  10. Conclusions • Isolated compute resources for local cloud machinery  • UCSD 8 QS20 • UMBC 8 JS20 • Begun installation of SGE and Hadoop • By mid-June, cloud with SGE and Hadoop operational on isolated resources

  11. Thank You

More Related