1 / 22

Cloudifying Source Code Repositories: How much does it cost?

Cloudifying Source Code Repositories: How much does it cost?. Hadi Salimi, Distributed Systems Labaratory, School of Computer Engineering, Iran University of Science and Technology hsalimi@iust.ac.ir Fall 2010. What’s the Cloud Computing ???. Large scale

doria
Download Presentation

Cloudifying Source Code Repositories: How much does it cost?

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Cloudifying Source Code Repositories:How much does it cost? Hadi Salimi, Distributed Systems Labaratory, School of Computer Engineering, Iran University of Science and Technology hsalimi@iust.ac.ir Fall 2010

  2. What’s the Cloud Computing ??? • Large scale • Application-specific architectures • Developed for in-house use • Available for general usage • Inexpensive, even for small or medium scale deployments

  3. What is Revision Control? • Repository for data (source code) • All changes are tracked by date and author • Branching and merging • Why move it to the cloud? • Resilient storage • No physical server to administrate • Scale to larger communities (SourceForge)

  4. Available Tools • Subversion, revision control system • Free, open-source • Very popular • Rigid consistency model

  5. Available Tools (Cont’d) • Amazon S3, cloud storage service • Eventual consistency • Yahoo ZooKeeper, coordination service • Free, open-source

  6. Alternative solutions Cloud Computing P2P • Subversion etc. • Repository stored persistently in the cloud • One true, consistent repository exists • GIT etc. • Repository stored at every client • Many repository copies, converging eventually

  7. Outline • Costs of using cloud storage for revision control • Architecture of a simple solution • Performance evaluation

  8. How to Measure Costs • Each revision stored as two files on disk • Revision data • Revision properties • Calculate bandwidth, per-transaction, and storage costs of pushing each revision into S3 over time

  9. Storage Costs

  10. Storage Trends

  11. Outline • Costs of using cloud storage for revision control • Architecture of a simple solution • Performance evaluation

  12. Today’s architecture for source code revision control...

  13. A cloud-basedarchitecture... EC2 EC2 S3 S3 S3

  14. Rev. 31337 Two simultaneous commits… Followed by an update… Leads to data loss! EC2 EC2 Rev. 31337 Rev. 31337 S3 S3 S3

  15. EC2 EC2 S3 S3 S3

  16. Commit Process ZooKeeper

  17. Outline • Costs of using cloud storage for revision control • Architecture of a simple solution • Performance evaluation

  18. Usage Observations • Apache Foundation • 1 repository, 74 projects • Average 1.10 commits per minute • Maximum 7 commits per minute • Debian community • 506 repositories • Average 1.12 commits per minute • Maximum 6 commits per minute

  19. Results Checkouts (Reads) Commits (Writes) • Adding servers improves the user experience

  20. Conclusion • Storing source code repositories in the cloud is feasible… • …and very inexpensive • Only minor changes to existing revision control systems are necessary to robustly take advantage of cloud storage

  21. Questions or Comment

More Related