QoS Aware Scheduling in a Cluster-Based Web Server

QoS Aware Scheduling in a Cluster-Based Web Server Jiani Guo Architecture Lab Department of Computer Science and Engineering University of California, Riverside

Reference • Performance Guarantees for Cluster-Based Internet Services, ICDCS 2003 Chang Li, Gang Peng, Kartik Gopalan Tzi-cker Chiuh State University of New York at Stony Brook

Web Services Courtesy: Performance Guarantees for Cluster-Based Internet Services, Chang Li.

Differentiated Service • A system is said to be capable of affording differentiated service among service classes if • The system permits its resources to be proportioned among the service classes • Given sufficient request load, a service class receives at least as much resources as were assigned to it irrespective of the load on other service classes • Resources not used by some service class may be distributed among other service classes. • QoS Metrics • The number of generic URL requests per second • A generic URL request represents an average web site access which is assumed to take 10 msec of CPU time, 10 msec of disk channel usage time and 2000-bytes of network bandwidth • For example: QoS requirement is 50 GRPS, which means 500 msec of CPU time, 500 msec of disk access time and 100 Kbytes of the network bandwidth

Scheduling Framework

Request scheduling • Request selection • Weighted round robin (WRR) • No idea about the resource a request will consume on dispatching it • Predict per-request resource usage using history • Feedback to correct the prediction • Server node selection • Load balancing among server nodes (Least Load First) • Select a node based on resource usage accounting • What to account • CPU, disk and network bandwidth • Accounting granularity • Per-request • Per-server • Per process-set Courtesy: Performance Guarantees for Cluster-Based Internet Services, Chang Li.

Performance Isolation

Performance deviation from ideal reservation Averaging Interval (secs)

My Previous Work on Scheduling:Scheduling Multimedia Jobs among Servers

Transcoding Workload A media unit is a Group Of Pictures(GOP) of MPEG stream • A media unit can be transcoded independently by any Worker in the cluster. Transcoding one media unit is considered an independent job. • No communication is required among jobs. • Each job consumes similar amount of processing time. • Consecutive media units in a stream are preferred to be processed in order.

Find an available Computing Server fetch a unit Send the unit Load Balancing Schemes Computing Server • How to take QoS into consideration? • Streams make reservations • Received service is proportional to the reservations Computing Server Media Server Unit Buffer . . . Retriever Scheduler Computing Server

Framework of Fair Scheduling

QoS Aware Scheduling in a Cluster-Based Web Server

QoS Aware Scheduling in a Cluster-Based Web Server

Presentation Transcript

A Combinatorial Procurement Auction for QOS-Aware Web Services Composition

Power-aware scheduling

Web Server Load Balancing/Scheduling

Server Cluster and LVS based Cluster

QoS Based on Context-Aware Middleware in Wireless Sensor Network

Scheduling in Server Farms

Locality-Aware Request Distribution in Cluster-based Network Servers

QoPS: A QoS based Scheme for Parallel Job Scheduling

Scheduling in Web Server Clusters

Preference-Aware Query and Update Scheduling in Web-databases

Web-based Irrigation Scheduling

p-Jigsaw: A Cluster-based Web Server with Cooperative Caching Supports

A QoS-Aware Multicast Routing Protocol

Energy Efficient Web Server Cluster

QoS-Aware Memory Systems

QoS-Aware Dependency Management for Component Based Systems

Cluster scheduling

Performance Analysis of Preemption-aware Scheduling in Multi-Cluster Grid Environments

Power-aware scheduling

QoS Aware