1 / 16

Data Storage Systems: A Survey

This survey explores the integration of SSDs in large-scale storage systems, including their benefits, challenges, and potential future directions. It covers different storage architectures, the use of SSDs in cloud storage, and case studies of SSD integration in HPC clusters.

rocha
Download Presentation

Data Storage Systems: A Survey

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Data Storage Systems: A Survey Abdullah Aldhamin July 29, 2013 CMPT 880: Large-Scale Multimedia Systems and Cloud Computing Course Project

  2. Motivation • Research interest in storage systems, specifically in SSDs

  3. Outline • Objective • Overview • Solid-State Drives Use Cases

  4. Objective • Storage system architectures in enterprise data centers. • What is cloud storage? • Integrating flash-based solid-state drives in large-scale storage systems.

  5. Overview • Different storage architectures in data centers: • Block I/O interface (DAS and SAN) • File I/O interface (NAS) • Is there a “better” solution? • Shortcomings for today’s computing…

  6. Overview • Cloud Storage: • What is cloud storage? • Object-based storage • Example: Windows Azure Storage (WAS) • Some research problems

  7. Overview • Flash-based solid state drive: • What is it? • Pros and Cons. • How can we integrate it in large-scale storage systems? • Future direction

  8. Integrating SSDs in Large-Scale Storage Systems • Considerations and Facts: • Non-uniform read access latencies, correlated with workload access pattern • Internal drive-specific operations impacts the performance • Internal fragmentation leads to performance degradation • More writes  write amplification  bad wear leveling • Performance vs. Lifetime vs. Cost

  9. Integrating SSDs in Large-Scale Storage Systems

  10. Gordon: SSD-only HPC Cluster • The first HPC cluster designed with SSD-only storage • Optimized to utilize SSDs high bandwidth, for data-intensive applications

  11. Gordon … (Cont’d) • Design goals: • Reduce performance gap between processor and I/O in large-scale data-intensive computing • Improve the system performance • Less power

  12. Gordon… (Cont’d) • How SSD is integrated? • Replaced conventional hard disks with SSDs • Major device-level modification: New flash translation layer

  13. Gordon… (Cont’d) • Costly $$$ • Not suitable for widespread adoption • Requires major device level modification • Optimized for specific workloads

  14. Hybrid: Griffin • Griffin hybrid storage system • SSD is an end-point store for the data • Uses HDDs as write-back buffers • Log-structured HDDs to buffer incoming writes • Extends SSD lifetime

  15. Hybrid: Hystor • Hybrid storage system • SSD is used to improve I/O performance • Read cache • Write-back buffer • Challenge: • What data to should be cached to gain from SSD performance and improve performance? • Minimum system changes.

  16. Conclusion • The choice of I/O interface allows for different storage access features • Cloud storage continues to grow to accommodate for the overflowing of data collected • Solid state drive has become an instrumental player in storage systems: but how can we best use it?

More Related