Thermal-aware Task Placement in Data Centers

Thermal-aware Task Placement in Data Centers SANDEEP GUPTA Department of Computer Science and Engineering School of Computing and Informatics Ira A. Fulton School of Engineering Arizona State University Tempe, Arizona, USA sandeep.gupta@asu.edu

Tempe, Fulton School of Engg & CSE

IMPACT (Intelligent Mobile Pervasive Autonomic Computing & Technologies) LAB Research Goals • Enable Context-Aware Pervasive Applications • Dependable Distributed Sensor Networking Projects • Wireless Solution for Smart Sensors Biomedical Applications (NSF - ITR) • Context-Aware Middleware for Pervasive Computing (NSF – NMI) • Thermal Management Datacenters (SFAZ, NSF) • Location Based Access Control (CES) • Identity Assurance (NSF, CES) • Mobility-Tolerant Multicast (NSF) • Ayushman – Infrastructure Testbed for Sensor Based Health Monitoring (Mediserve Inc.) Group • Faculty: Dr. Sandeep K. S. Gupta • 1 PostDoc + 7 PhD + 2MS + 1 UG Department of Computer Science & Engineering, Tempe, Arizona http://impact.asu.edu Sponsors

Thermal Management for Data Centers Pervasive Health Monitoring Criticality Aware-Systems Mobile Ad-hoc Networks ID Assurance • Goal: • Protect people’s identity & consumer computing from viral threats • Features: • PKI based • Non-tamperable, non-programmable personal authenticator • Hardware and VM based trust management • Sponsor: • Goal: • Container Monitoring for Homeland Security • Dynamic Supply Chain Management • Features: • Integration of RFID and environmental sensors • Energy management • Communication security • Sponsor: • Goal: • Protocols for mobile ad-hoc networks • Features: • Energy efficiency • Increased lifetime • Data aggregation • Localization • Caching • Multicasting • Sponsor: Intelligent Container IMPACT: Research Use-inspired research in pervasive computing & wireless sensor networking • Goal: • Increasing computing capacity for datacenters • Energy efficiency • Features: • Online thermal evaluation • Thermal Aware Scheduling • Sponsor: • Goal: • Pervasive Health monitoring • Evaluation of medical applications • Features: • Secure, Dependable and Reliable data collection, storage and communication • Sponsor: • Goal: • Evaluation of crisis response management • Features: • Theoretical model • Performance evaluation • Access control for crisis management • Sponsor: Medical Devices, Mobile Pervasive Embedded Sensor Networks BOOK: Fundamentals of Mobile and Pervasive Computing, Publisher: McGraw-Hill Dec. 2004

Growth Trends in data centers • Power density increases • Circuit density increases by a factor of 3 every 2 years • Energy efficiency increases by a factor of 2 every 2 years • Effective power density increases by a factor of 1.5 every 2 years [Keneth Brill: The Invisible Crisis in the Data Center] • Maintenance/TCO rising • Data Center TCO doubles every three years • By 2009, the three-year cost of electricity will exceed the purchase cost of the server • Virtualization/Consolidation is a 1-time/short term solution [Uptime Institute] • Thermal management corresponds to an increasing portion of expenses • Thermal-aware solutions becoming prominent • Increasing need for thermal awareness

Motivation Cooling is the chief driver of increased data center construction cost, costing up to $5000 per square foot in initial purchase price.[1] Cooling is one of the leading contributors to ongoing total cost of ownership, costing one half to one watt per watt spent on computation.[2] If we can eliminate even 25% of total cooling costs, that can translate to a $1-$2 million annual cost reduction in a single large data center.[3]

softwaredimension Application Thermal-aware data centerjob scheduling Thermal-aware VM (middleware) CPU Load balancing O/S Dynamic voltage scaling Fan speed scaling Dynamic frequency scaling firmware Circuitry redundancy physicaldimension IC room Case/chassis Related Work (extended domain)

Contributions • Developed Thermal models of Data Centers • Developed analytical thermal models using theoretical thermodynamic formulations • Developed online thermal models using machine learning techniques • Designed thermal aware task placement algorithms • Designed genetic algorithm based task placement algorithm that minimizes the heat recirculation among the servers and the peak inlet temperature. • Created a software architecture for dynamic thermal management of Data Centers • Developed CFD Models for Real World Data Centers for testing and validation of thermal models and task placement algorithms

Thermal issues indense computer rooms (i.e. Data centers, Computer Clusters, Data warehouses) • Heat recirculation • Hot air from the equipment air outlets is fed back to the equipment air inlets • Hot spots • Effect of Heat Recirculation • Areas in the data center with alarmingly high temperature • Consequence • Cooling has to be set very low to have allinlet temperatures in safe operating range Courtesy: Intel Labs

Conceptual overview ofthermal-aware task placement Peak air inlet temperaturedetermines upper bound toCRAC temperature setting Task placement determinestemperature distribution Temperature distributiondetermines the equipmentpeak air inlet temperature CRAC temperature settingdetermines it’s efficiency(Coefficient of Performance) The lower the peak inlet temperaturethe higher the CRAC efficiency Coefficient of Performance(source: HP) bottomline There is a task placement that maximizes cooling efficiency. Find it!

Prerequisites forthermal management • Task profiling • CPU utilization, I/O activity etc • Equipment power profiling • CPU consumption, disk consumption etc • Heat recirculation modeling • Task management technologies • Need for a comprehensive research framework

Thermal management research framework Characterization Characterize the power consumption of a given workload (CPU, memory, disk etc) on a given equipment • Thermal Models • To enable on-line real-time thermal-aware job scheduling • fast (analytical, non CFD based) • non-evasive (machine-learning) Model the thermal impact of multicore systems Thermal-awarejob scheduling On-line job scheduling algorithm to minimize peak air inlet temperature, thus minimizing the cost of cooling. http://impact.asu.edu/ Sandeep GuptaQinghui Tang Tridib Mukherjee Michael Jonas Georgios Varsamopoulos

Task Profilingmeasurements at ASU HPC Data Center (one chassis)

Power Model and Profiling • Power Consumption is mainly affected by the CPU utilization • Power consumption is linear to the CPU utilization P = a U + b

Linear Thermal Model • Heat Recirculation Coefficients • Analytical • Matrix-based • Properties of model • Granularity at air inlets (discrete/simplified) • Assumes steadiness of air flow Tin Tsup D P + × = heat distribution powervector inlettemperatures supplied airtemperatures

P Benefit: fast thermal evaluation Extracttemperatures Run CFD simulation (days) Give workload Courtesy: Flometrics D Tsup Tin × + Yieldstemperatures Give workload Compute vector (seconds)

Thermal-awareTask Placement Problem Given an incoming task, find a task partitioning and placement of subtasks to minimize the (increase of) peak inlet temperature P = a U + b Tin Tsup D U XInt Algorithm Approximation solution (genetic algorithm) • Take a feasible solution and perform mutations until certain number of iterations bbb b b bb (a + ) + × = heat distribution inlettemperatures supplied airtemperatures utilizationvector

Contrasted scheduling approaches OutletTemperature • Uniform Outlet Profile (UOP) • Assigning tasks in a way that tries to achieve uniform outlet temperature distribution • Assigning more task to nodes with low inlet temperature (water filling process) • Minimum computing energy • Assigning tasks in a way that keeps the number of active (power-on) chassis as few as possible • Server with coolest inlet temperature first • Uniform Task (UT) • Assigning all chassis the same amount of tasks (power consumptions) • All nodes experience the same power consumption and temperature rise InletTemperature

Simulated Environment • Used Flometrics Flovent • Simulated a small scale data center • physical dimensions9.6m 8.4m 3.6m • two rows of industry standard 42U racks arranged • CRAC supply at 8 m3/s • There are 10 racks • each rack is equipped with 5 chassis • 1000 processors in data center. • 232KWatts at full utilization

Performance Results • Xint outperforms other algorithms • Data Centers almost never run at 100% • Plenty of room for benefits!

Power Vector Distribution Xint contradicts “rule of thumb” placement at bottom key

Supply Heat Index (SHI) • Supply Heat Index • Metric developed by HP Labs • quantifies the overall heat recirculation of data center • Xint consistently has the lowest SHI

Conclusions • Thermal-aware task placement can significantly reduce heat recirculation • XInt performance thrives at around 50% CPU utilization • Not much can be done at 100% utilization • Cooling savings can exceed 30%(in comparison to other schemes) • Cost of operation reduces by 15%(if initially 1:1 ratio of computing-2-cooling)

Related Work in Progress • Waiving simplifying assumptions • Equipment heterogeneity [INFOCOM 2008] • Stochastic task arrival • Thermal maps thru machine learning • Automated, non-invasive, cost-effective [GreenCom 2007] • Implementations • Thermal-aware Moab scheduler • Thermal-aware SLURM • SiCortex product thermal management

Algorithm Assumptions • HPC model in mind • Long-running jobs (finish time is the same — infinity) • One-time arrival (starting time is the same) • Utilization homogeneity(same utilization throughout task’s length) • Non preemptive/movable tasks • Data Center equipment homogeneity • power consumption • computational capability • Cooling is self-controlled

Thank You • Questions? • Comments? • Suggestions? http://impact.asu.edu/

References 1) AMD – Power and Cooling in the Data Center. 34146A_PC_WP_en.pdf 2) HP Labs - Going beyond CPUs: The Potential of Temperature-Aware Solutions for the Data Center. 3) HP Labs - Making Scheduling Cool: Temperature-Aware Workload Placement in Data Centers.

Additional Slides

Functional model of scheduling • Tasks arrive at the data center • Scheduler figures out the best placement • Placement that has minimal impact on peak inlet temperatures • Assigns task accordingly Tasks Scheduler Task Task

ThermalModel create/update control report MachineLearning provide MonitoringProcesses Architectural View Scheduler(Moab, SLURM) dispatch

A simple thermal model • Basic Idea: • We don’t need an extensive CFD model • We only need to know the effect of recirculation at specific points • Express recirculation as “coefficients” N5 Courtesy: Intel Labs N4 N3 N2 N1

Recirculation coefficients:a fast thermal model • Reduce/Simplify the “thermal map” concept to points of interest: equipment air inlets • Can be computed from CFD models/simulations A Matrix Aaij: portion of heatexhausted from node ithat directly goes to node j recirculation coefficients

softwaredimension Application Data centerjob scheduling Thermal-aware VM (middleware) CPU Load balancing O/S Dynamic voltage scaling Fan speed scaling Dynamic frequency scaling firmware Circuitry redundancy physicaldimension IC Case/chassis room Data Center Thermal Management Thermal-aware solutionsat various levels Increasing need for thermal awareness • Power density increases • Circuit density increases by a factor of 3 every 2 years • Energy efficiency increases by a factor of 2 every 2 years • Effective power density increases by a factor of 1.5 every 2 years [Keneth Brill: The Invisible Crisis in the Data Center] • Maintenance/TCO rising • Data Center TCO doubles every three years • By 2009, the three-year cost of electricity will exceed the purchase cost of the server • Virtualization/Consolidation is a 1-time/short term solution • Thermal management corresponds to an increasing portion of expenses • Thermal-aware solutions becoming prominent A dynamic thermal-aware control platform is necessary for online thermal evaluation Thermal issues • Heat recirculation • Increases as equipment density exceeds cooling capacity as planned • Hot spots • Effect of Heat Recirculation • Impact:Cooling has to be set low enoughto have allinlet temperatures insafe operating range Opportunities & Challenges • Data centers don’t run at fulll unitilization • Can choose among multiple CPUs to allocate a job • Different thermal impact per CPU • Need for fast thermal evaluation • Temporal and spatial Heterogeneity of Data Centers • In equipment • In workload without thermal-awaremanagement With thermal-awaremanagement $100M cooling $10M $1M computation year

Different demands for cooling capacity Scheduling Impacts Cooling Setting Inlet temperature distribution without Cooling Inlet temperature distribution with Cooling Scheduling 1 25C Scheduling 2 25C

5 4 3 2 1 Results(1) • Recirculation Coefficients • Consistent with datacenter observations • Large values are observed along diagonal • Strong recirculation among neighboring servers, or between bottom servers and top servers 10 9 8 7 6

Thermal-aware Task Placement in Data Centers

Thermal-aware Task Placement in Data Centers

Presentation Transcript

Towards Thermal Aware Workload Scheduling in a Data Center

Power Aware Virtual Machine Placement

Energy-efficient, Thermal-aware Data Placement, Replication, and Scheduling in Data Centers

Energy and heat-aware metrics for data centers

Thermal Aware Workload Scheduling with Backﬁlling for Green Data Centers

Cooling-Aware and Thermal-Aware Workload Placement for Green HPC Data Centers

Hierarchy-Aware Distributed Overlays in Data Centers using DC2

Task-aware query recommendation

Energy Efficiency in Cloud Data Centers: Energy Efficient VM Placement for Cloud Data Centers

Thermal-aware Task Placement in Data Centers

Energy-aware Hierarchical Scheduling of Applications in Large Scale Data Centers

Thermal Aware Data Management in Cloud based Data Centers

A policy-aware switching layer for data centers

Thermal Management of Heterogeneous Data Centers

Thermal-aware Issues in Computers

Power-Aware Placement

C-Oracle: Predictive Thermal Management for Data Centers

Thermal Via Placement in 3D ICs

Variation Aware Placement in FPGAs

Cost- and Energy-Aware Load Distribution Across Data Centers

A Policy-aware Switching Layer for Data Centers

Power-Aware Placement