Energy-efficient, Thermal-aware Data Placement, Replication, and Scheduling in Data Centers

Energy-efficient, Thermal-aware Data Placement, Replication, and Scheduling in Data Centers Amol Deshpande SamirKhuller Department of Computer Science and UMIACS University of Maryland at College Park

Motivation: Data Centers • Large data centers are a key to handle rapidly growing data management needs • Consume increasingly large amounts of energy both for computing itself, and for cooling • Trend toward higher density of components raises many new challenges w.r.t. thermal issues and energy costs • Hotspots: cooling systems cannot deal effectively with hotspots • Temperature constraints: component temperatures cannot exceed hard thresholds – higher failure rates • Spatial effects: temperature increase at a machine affects temperatures at components nearby • Temperature-dependent power draw: leakage power increases exponentially with temperature

Rethinking Optimization • Data replication, placement, and migration • Given an expected workload, find a data placement that results in better energy efficiency and avoids thermal hotspots • Energy efficiencyclustering related data items together • But may result in thermal hotspots • How to use the inherent replication in data centers to optimize for these new, often conflicting, optimization goals • Task assignment and scheduling • Energy efficiency switchon as few machines as possible • But thermal balancing spread out tasks over time and space • Controlling disk and processor speeds • New hardware often comes with knobs to control performance • How to use those to achieve energy efficiency w/o affecting performance?

Preliminary Results • Energy-efficient scheduling • Goal: Given performance constraints, minimize the total activation cost, i.e., turn on as few machines as possible, to execute the workload • Designed approximation algorithms with provable bounds [KLS’10, LK’11] • Workload-aware data placement and replication • Can be modeled as a hypergraph partitioning problem • Designed several algorithms that try to minimize the number of machines involved in answering a query [KDK’11] HMetis: State-of-the-art Hypergraph Partitioning Algorithm LMBR: A greedy algorithm that does sophisticated local moves Significant energy savings possible by doing workload-driven optimization

Challenges • How to model power consumption as a function of load and temperature? • Too much variance across different hardware platforms • Hardware components often have their own mechanisms to handle undesirable situations (e.g., throttling down if temperature too high) • How to model the temperature in a data center? • Spatial effects are best modeled using computational fluid dynamics • Infeasible for large-scale data centers • Temporal (cooling) effects can be modeled using Fourier’s Law • Unclear if optimization problems can be solved under that model • Need simpler models that approximate the behavior sufficiently well • Developing robust abstractions that are useful across a variety of hardware platforms and component mixes • Infrastructure and/or simulation frameworks for evaluation

Energy-efficient, Thermal-aware Data Placement, Replication, and Scheduling in Data Centers

Energy-efficient, Thermal-aware Data Placement, Replication, and Scheduling in Data Centers

Presentation Transcript

DENS: Data Center Energy-Efficient Network-Aware Scheduling

Energy Aware Lossless Data Compression

Towards Thermal Aware Workload Scheduling in a Data Center

Utility-Function-Driven Energy-Efficient Cooling in Data Centers

Reliable and Efficient Data Placement in a Grid Environment

Leveraging Renewable Energy in Data Centers

Nectar: Efficient Management of Computation and Data in Data Centers

Storage Class Memory Architecture for Energy Efficient Data Centers

Energy and heat-aware metrics for data centers

Thermal Aware Workload Scheduling with Backﬁlling for Green Data Centers

Cooling-Aware and Thermal-Aware Workload Placement for Green HPC Data Centers

Energy Efficiency in Cloud Data Centers: Energy Efficient VM Placement for Cloud Data Centers

Thermal-aware Task Placement in Data Centers

Energy-aware Hierarchical Scheduling of Applications in Large Scale Data Centers

Thermal Aware Data Management in Cloud based Data Centers

Thermal Management of Heterogeneous Data Centers

Quality-Aware Replication of Multimedia Data

Managing and Scheduling Data Placement (DaP) Requests

Cost- and Energy-Aware Load Distribution Across Data Centers

Energy Efficiency in Data Centers

Data Center Energy-Efficient Network-Aware Scheduling

Managing and Scheduling Data Placement (DaP) Requests in GRID