1 / 35

Protecting VMware Data Off-site

Protecting VMware Data Off-site. “Tape vs. Cloud Options” Bill Evans, Arkeia Software “Case Study from University of Chicago” Tom Indelli, Senior System Administrator. Data Loss and Data Protection. Causes of Data Loss Strategies for Data Protection Replication Backup.

irving
Download Presentation

Protecting VMware Data Off-site

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Protecting VMware Data Off-site “Tape vs. Cloud Options” Bill Evans, Arkeia Software “Case Study from University of Chicago” Tom Indelli, Senior System Administrator

  2. Data Loss and Data Protection • Causes of Data Loss • Strategies for Data Protection • Replication • Backup

  3. Causes of Data Loss Source: Kroll Ontrack Inc., 2011

  4. Data Protection Strategy #1: Replication • Replication • Additional copy of current data (files, images, objects) • Replication Options • Disk (or RAID) • Synchronous • Expensive: data replicated before transaction completes • Asynchronous • Less expensive: data replication lags behind • Replication Benefits • Offsite data storage • Immediate failover Protected Unprotected Replication Faithfully Copies All Errors; Over 50% Of Data Loss is Unprotected

  5. Data Protection Strategy #2: Backup • Backup • Multiple Point-in-time “Restore Points” • Backup Options • Tape or Disk or Cloud • Hourly, Daily, Weekly, Monthly, Quarterly, Yearly • Backup Benefits • Recovery to time in the past • Offsite data storage

  6. Backup Requirements • Secure • Off-site • Off-line • Frequent Restore Points • Restore Point Objectives (RPO) to minimize data loss-1 -2 -7 -14 -30 -60 -180 -365 days • Rapid Restore Time • Restore Time Objectives (RTO) to minimize down-timehours +4 +3 +2 +1 +0.5 +0.1

  7. Off-site Storage • How to choose? • Costs • Fixed • Variable • Backup window • Time-to-restore (RTO) • Reliability • Convenience Backup Agent Backup Agent Backup Server Backup Server Backup Agent Backup Agent Backup Agent Backup Agent WAN Backup Agent Backup Server Backup Agent Backup Agent WAN Backup Server

  8. Off-site Storage Copy is moved offsite Backup Agent Backup Agent Backup Server Backup Server Backup Agent Backup Agent Backup Agent Backup Agent Backup Agent Backup Server Backup Agent Backup Agent Backup Server Copy is moved offsite

  9. Off-site Storage Strategies • Why is Off-site Storage Important? • Loss, theft, site destruction • Strategies • Tapes on trucks • Replication to the cloud • Costs Cloud Tape Data Volume Protected

  10. University of Chicago: VMware Backup Strategy Tom Indelli Senior Systems Administrator University of Chicago

  11. Organization • University of Chicago • Physical Sciences Division • Activities • Theoretical Chemistry (e.g. Molecular Dynamics) • Theoretical Physics • Science Education

  12. Deployment #1: Data & Servers Analyses are computationally-intensive;Physical platforms deliver best performance • Data • Theoretical Chemistry & Molecular Dynamics • Simulations of atoms using “trajectory files” • 20,000 atoms to 100,000 atoms • Jobs run up to 48 hours • Simulate less than 50 nanoseconds of interactions • Most operation is “batch”, performed on 100-node compute clusters • Protected Servers • 2 Red Hat and 1 MacOS file servers • File servers hold inputs to and results of simulations • 44TB source data

  13. Deployment #1: Data Protection Compression occurs in Arkeia agent, before backups are moved on the LAN Red Hat EL 6.0 Red Hat EL 6.0 MacOS X Arkeia Backup Server v9 on RHEL 2Gbps LAN • Backup Server Solution • Arkeia Network Backup v9 on Red Hat 6.0 • 100TB disk (backup target DAS) • Backup Strategy • Backup to Disk • Weekly full, nightly incremental • Agents backed up concurrently • Offsite Strategy • None

  14. Deployment #2: Data & Servers • Data • Web servers • Management software & data • Support software & data • Uninterrupted operation is critical • Protected Servers • 2 ESXi 4.1 hosts with vCenter 4.1 (facilitates upgrades) • 15 - 20 virtual machines • 3TB source data

  15. Deployment #2: Data Protection Compression occurs in Arkeia agent, before backups are moved over LAN VM A.1 VM A.2 VM A.3 VM B.1 VM B.2 ANB VM Hypervisor #A Hypervisor #B Arkeia Backup Server v9 on RHEL 2Gbps LAN • Backup Server Solution • Backup Strategy • Backup to Disk (20TB EqualLogic SAN) • Weekly full, nightly incremental • Three groups of backups performed in sequence • Replicate to Tape Library (Dell Powervault PL-2000 with LTO4 drive) • Offsite Strategy • Tapes moved to another office

  16. Deployment #2: Backups = 19 LTO4 Cartridges

  17. Deployment #2: vStorage Usage • Backups via vCenter • Backups use Changed Block Tracking (CBT) • Full backups (“Thin full” with CBT) • Incremental backups • Restores • Perform occasional full-image restores • Have tested single-file restores

  18. Costs of Tape vs. Cloud for 18TB Does Not Include Costs of Bandwidth • Tape • 22 LTO4 tapes (18TB) @$30/cartridges = $660 • 1 TL-2000 = $10,000 (amortized over 3 years) • One year costs = $4,000 + tape shuffling • Public Cloud • 18TB @$0.125/GB/month (Amazon) = $2,300/month • One year costs = $28,000

  19. Summary • UChicago has both virtual and physical environments • Physical systems are a better fit for some workloads • Want one backup solution to protect both environments • Off-site storage is required • Off-line is a bonus • vSphere Changed Block Tracking • Accelerates incremental backups • Reduces storage

  20. Thank You Tom Indelli Senior Systems Administrator tindelli@uchicago.edu

  21. Hybrid Cloud Backup • Why Hybrid? • Data Volume Limits • Cloud Infrastructure Requirements

  22. “Hybrid” Cloud Backup • Perform backup on LAN • Fast backups, fast restores • Replicate backups to cloud for safe-keeping • Secure data Backup Agent Backup Server Backup Agent Backup Agent Step 1 Step 2

  23. “Hybrid” Cloud Backup • Full Backup • If time < one week: Over the WAN • If time > one week: Via portable media • Daily Incremental Backup • If time < 24 hours: Over the WAN • If time > 24 hours: Impossible Backup Agent Backup Server Backup Agent Backup Agent Incremental Backup Size Limits Cloud Backup: Incremental Size Is 0.01% to 20% of Full Backup

  24. Cloud Strategies: Replication Window Backup Agent Incremental (1%) Backup Backup Server Backup Agent Full Backup Backup Agent

  25. Role of Deduplication in Backup • Shrinks Data • Reduces Storage • Shortens Backup Window • Data Scenarios • Primary Data • Secondary Data Across/Within Files (e.g. PPT files) Over Time(e.g. outlook.pst) Across computers (e.g. word.exe)

  26. Hybrid Cloud Recovery • Storage-only v.s. Storage-and-Server • File Recovery vs. Disaster Recovery

  27. Cloud Recovery Strategies • Data are Secure • Deduplicated • Compressed • Encrypted • How to Recover/Extract? Backup Agent Backup Server Backup Agent Backup Agent

  28. Cloud Recovery Strategies • How to Recover/Extract? • Restore (via big pipe) to servers in cloud • Restore (via portable media) to new location Backup Agent Backup Server Backup Agent Backup Agent Backup Server

  29. Hybrid Cloud Backup Summary • Alternative to Tape • …But Maximum Data-Protection Limit • Imposed by incremental backup size • Primary Cost of Hybrid Cloud • Bandwidth • (Then target disk) • Pay Attention to Recovery Strategy • Instantiate in Cloud • Recovery on Portable Media

  30. Arkeia Software • Company • Founded 1996; HQ in San Diego • Products • Arkeia Network Backup Suite • Backup/Recovery • Disaster Recovery • Virtual and Physical Environments • vSphere (with CBT), Hyper-V, XenServer • Linux, Windows…AIX, BSD, HP-UX, MacOS, Netware, Solaris (200+ platforms) • Software, Appliances, Virtual Appliances • Disk, Tape, Cloud • Customers • 7,000 mid-market customers in 70 countries • Enterprises, Governments, Service Providers

  31. Please Contact Me Bill EvansArkeia Softwarebill.evans@arkeia.com Resources for last-mile internet for data centers and enterprises • ManonBuettner, Principal • Nuvalo • manon@nuvalo.com • +1 408-605-6455 • Jo Peterson, Regional Manager • Teleproviders • jo@teleproviders.com • +1 949-268-2633

  32. Detail 1 of 3: “Incrementals Forever” Traditional Backup Policy • How Does it Work? • Initially, one full backup • Subsequently, “incrementals forever” t • How to recover disk space at target? • “Synthetic backups” Day 0 1 2 3 4 5 6 7 8 9 …

  33. Detail 2 of 3: Multiple Sources • Deduplication consolidation • Static storage cannot resolve duplicates Backup Agent Backup Agent Backup Server Backup Server • Deduplication vs. Encryption • Dedupe → Compress → Encrypt Backup Agent Backup Agent Backup Agent Backup Agent X X

  34. Detail 3 of 3: WAN Bandwidth • Data Compression • File-grain compression • Examples: LZ-77, JPEG, MPEG • Inter-file deduplication • Examples: SIS, fixed-block, variable-block, progressive-dedupe • TCP Optimization? Warnings: Latency Optimization  Bandwidth Optimization No compression of compressed or random data

  35. Data Loss Universe

More Related