Size based scheduling policies with inaccurate scheduling information l.jpg
This presentation is the property of its rightful owner.
Sponsored Links
1 / 40

Size-Based Scheduling Policies with Inaccurate Scheduling Information PowerPoint PPT Presentation


  • 110 Views
  • Uploaded on
  • Presentation posted in: General

Size-Based Scheduling Policies with Inaccurate Scheduling Information. Dong Lu * , Huanyuan Sheng + , Peter A. Dinda * * Prescience Lab, Dept. of Computer Science + Dept. of Industrial Engineering & Management Science Northwestern University Evanston, IL 60201 USA. Outline.

Download Presentation

Size-Based Scheduling Policies with Inaccurate Scheduling Information

An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -

Presentation Transcript


Size based scheduling policies with inaccurate scheduling information l.jpg

Size-Based Scheduling Policies with Inaccurate Scheduling Information

Dong Lu*, Huanyuan Sheng+, Peter A. Dinda*

*Prescience Lab,Dept. of Computer Science

+Dept. of Industrial Engineering & Management Science

Northwestern University

Evanston, IL 60201 USA


Outline l.jpg

Outline

  • Review of size-based scheduling

  • Motivation

  • Simulation Setup

  • Simulation Results

  • New applications

  • Research Summary by subjects


Non size based scheduling l.jpg

Non-size-based scheduling

  • FCFS, PS, etc.

  • FCFS: First Come First Serve

    • Intuitive

    • Easiest to implement

  • PS: Processor Sharing

    • Fair: all jobs accept equal resources

    • Also easy to implement

Problem: Unaware of job size information, which results in big mean response time


Review of size based scheduling l.jpg

Review of size-based scheduling

  • SRPT, FSP, etc.

  • Utilize the job size (processing time, service time) information for scheduling

    • Optimal in mean response time

    • Fair?

    • Easy to implement?

We use Job Size to refer to the Processing Time (Service Time) of the job


Shortest remaining processing time srpt l.jpg

Shortest Remaining Processing Time (SRPT)

  • Always serve the job with minimum remaining processing time first, Preemptive scheduling

  • Yields minimum mean response time [Schrage, Operations Research, 1968]

  • Performance gains of SRPT over PS do not usually come at the expense of large jobs, in other words, it is Fair for heavy-tail job size distribution [Bansal and Harchol-Balter, Sigmetrics ‘01]

  • Easy to implement?

    • With accurate a priori job size information, YES

    • Otherwise, NO


Fair sojourn protocol fsp l.jpg

Fair Sojourn Protocol (FSP)

  • Combined SRPT with PS, preemptive scheduling

  • Mean response time is close to that of SRPT; and more fair than PS [Friedman, et al, Sigmetrics ‘03]

  • Easy to implement?

    • With accurate a priori job size information, YES

    • Otherwise, NO


Motivation l.jpg

Motivation

  • Size-based scheduling requires accurate knowledge of job sizes

  • In practice, a priori job size information is not always

  • available

  • All the previous work assumes perfect knowledge of job sizes a priori

  • How does performance depend on

  • quality of job size information?


Correlation l.jpg

Correlation

We study the performance of Size-based

schedulers as a function of the correlation

coefficient (Pearson’s R) between actual job sizes and estimated job sizes.


Outline9 l.jpg

Outline

  • Review of size-based scheduling

  • Motivation

  • Simulation Setup

  • Simulation Results

  • New applications

  • Research Summary by subjects


Simulation setup trace generator l.jpg

Simulation Setup: Trace generator

Correlation (Pearson’s R)

Distribution A

Distribution B

Trace Generator

  • X Y

  • 100

  • 300

  • . .

  • . .

  • . .

  • Correlated random pairs of X and Y

    • X has distribution A

    • Y has distribution B

    • X and Y are correlated to R


Simulation setup trace generator11 l.jpg

Simulation Setup: Trace generator

  • Algorithm: “Normal-To-Anything”

    • First developed by Cario and Nelson, on INFORMS Journal on Computing 10, 1 (1998).

    • We simplified the algorithm and first introduced it into the simulation studies of computer systems


Scatter plot of example traces l.jpg

Scatter plot of example traces

Y

Y

X

X

R=0.78

R=0.13


Simulation setup performance metrics l.jpg

Simulation Setup: Performance metrics

  • Performance metrics

    • Mean response time: Sojourn time, Turn-around time

    • Slowdown: the ratio of response time to its size. Fairness metric


Simulation setup simulator l.jpg

Simulation Setup: Simulator

  • Simulator

    • Written in C++

    • Supports M/G/1 and G/G/n/m queuing model

  • Simulator validation

    • Little’s law

    • Repeat the simulations in the FSP paper [Friedman, et al, Sigmetrics ‘03]

    • Compare with available theoretical results [Bansal and Harchol-Balter, Sigmetrics ‘01]


Simulation setup scheduling policies l.jpg

Simulation Setup: Scheduling Policies

  • PS: Processor sharing

  • Size-based scheduling policies

    • SRPT: Ideal SRPT scheduler

    • SRPT-E: SRPT scheduler using estimated job size

    • FSP: Ideal Fair Sojourn Protocol

    • FSP-E: FSP scheduler using estimated job size

Each simulation is repeated 20 times and we present the average


Outline16 l.jpg

Outline

  • Review of size-based scheduling

  • Motivation

  • Simulation Setup

  • Simulation Results

  • New applications

  • Research Summary by subjects


Simulation results mean response time l.jpg

Simulation Results: Mean response time


Simulation results slowdown r 0 0224 l.jpg

Simulation Results: Slowdown (R=0.0224)


Simulation results slowdown r 0 239 l.jpg

Simulation Results: Slowdown (R=0.239)


Simulation results slowdown r 0 4022 l.jpg

Simulation Results: Slowdown (R=0.4022)


Simulation results slowdown r 0 5366 l.jpg

Simulation Results: Slowdown (R=0.5366)


Simulation results slowdown r 0 7322 l.jpg

Simulation Results: Slowdown (R=0.7322)


Simulation results slowdown r 0 9779 l.jpg

Simulation Results: Slowdown (R=0.9779)


Simulation results conclusions l.jpg

Simulation Results: Conclusions

  • Performance heavily depends on correlation

    • SRPT-E and FSP-E can outperform PS given an effective job size estimator

  • Crossover point of performance metrics is a function of correlation

    • Also of job size distributions (See TR NWU-CS-04-33)


Outline25 l.jpg

Outline

  • Review of size-based scheduling

  • Motivation

  • Simulation Setup

  • Simulation Results

  • New applications

  • Research Summary by subjects


New applications web server scheduling tr nwu cs 04 33 l.jpg

New Applications: Web server scheduling (TR NWU-CS-04-33)

  • Is file size a good estimator of a job’s service time (processing time)? Not Really (R  0.14)

File Size

Service time (wall clock time)


New applications web server scheduling l.jpg

New Applications: Web server scheduling

  • Domain-based estimator: much more accurate prediction of the service timeat low overhead


New applications p2p server side scheduling lcr 04 l.jpg

New Applications: P2P server side scheduling (LCR ’04)

  • “Server side” of current file sharing P2P applications superficially similar to web server

    • Both send back files upon requests.

  • However, P2P application can’t even know the file size accurately a priori

    • Partial downloads

  • Our ongoing work shows that SRPT-E performs well using our time-series based job size estimators.


New applications network backup system scheduling l.jpg

New Applications: Network backup system scheduling

  • Incremental backup copies only the files that have been created or modified since a previous backup

  • With Incremental backup, the actual job sizes is difficult to know until the backup finishes

  • We believe that SRPT-E or FSP-E can be applied with time series based job size predictors


Summary l.jpg

Summary

  • Performance of size-based scheduling policies depends on correlation between size estimates and actual sizes

    • Fairness, mean response time, etc.

  • Estimator must preserve ordering of job sizes for high performance

    • Performance degrades as correlation degrades

  • Effective new estimators for Web and P2P


For more information l.jpg

For MoreInformation

  • Prescience Laboratory

    • http://plab.cs.northwestern.edu

  • Home Page of Dong Lu

    • http://www.cs.northwestern.edu/~donglu/


Outline32 l.jpg

Outline

  • Review of size-based scheduling

  • Motivation

  • Simulation Setup

  • Simulation Results

  • New applications

  • Research Summary by subjects


Research summary by subjects l.jpg

Research Summary by subjects

  • Grid Computing

  • Internet Measurement and Prediction

  • Queuing and Scheduling

  • Fat-Tree Based End System Multicast

  • Wireless Ad Hoc Networks

  • Incentivized Protocol Design for Peer-to-Peer Systems

  • Parallel Computing


Grid computing l.jpg

Grid Computing

  • Dong Lu, Peter Dinda, “GridG: Generating Realistic Computational Grids”, ACM SIGMETRICS Performance Evaluation Review (Per), Volume 30, Number 4, 2003.

  • Dong Lu, Peter Dinda, “Synthesizing Realistic Computational Grids,” Proceedings of the 15th ACM/IEEE Supercomputing (SC 2003), Phoenix, AZ, November 2003.

  • Peter Dinda, Dong Lu, “Nondeterministic queries in a relational Grid information service”, Proceedings of the 15th ACM/IEEE Supercomputing (SC 2003), Phoenix, AZ, November 2003.

  • Dong Lu, Peter Dinda, Jason Skicewicz ”Scoped and Approximated queries in a relational Grid Information Service”, Proceedings of 4th IEEE/ACM International Workshop on Grid Computing (Grid 2003), November, 2003, Phoenix, AZ.

  • Bin Lin, Peter Dinda, Dong Lu, ”User-driven Scheduling of Interactive Virtual Machines”, Proceedings of Grid 2004, PITTSBURGH, PA, November, 2004.


Internet measurement and prediction l.jpg

Internet Measurement and Prediction

  • Dong Lu, Y. Qiao, Peter Dinda, and F. Bustamante, ”Characterizing and Predicting TCP Throughput on the Wide Area Network”, Proceedings of the 25th IEEE International Conference on Distributed Computing Systems (ICDCS 2005), June 2005, Columbus, Ohio. To appear.

  • Dong Lu, Yi Qiao, Peter Dinda, Fabian Bustamante, ”Modeling and Taming Parallel TCP on the Wide Area Network”, Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS 2005), April 4-8, 2005, Denver, Colorado.


Queuing and scheduling l.jpg

Queuing and Scheduling

  • Dong Lu, Huanyuan Sheng, Peter Dinda, “Size-Based Scheduling Policies with Inaccurate Scheduling Information,” Proceedings of MASCOTS 2004, October 2004, Volendam, The Netherlands.

  • Dong Lu, Peter Dinda, Yi Qiao, Huanyuan Sheng, Fabian Bustamante, “Applications of SRPT Scheduling with Inaccurate Scheduling Information”, (short paper) Proceedings of MASCOTS 2004, October 2004, Volendam, The Netherlands.

  • Yi Qiao, Dong Lu, Fabian Bustamante, Peter Dinda, ”Looking at the Server-Side of Peer-to-Peer Systems”, Proceedings of the 7th ACM Workshop on Languages, Compilers, and Run-time Systems for Scalable Computers (LCR 2004), October 2004, Houston, Texas.

  • Dong Lu, Huanyuan Sheng, and Peter Dinda, “Effects and Implications of File Size/Service Time Correlation on Web Server Scheduling Policies”, Technical Report NWU-CS-04-33, Department of Computer Science, Northwestern University, April, 2004. In Submission.


Fat tree based end system multicast fatnemo l.jpg

Fat-Tree Based End System Multicast: FatNemo

  • Stefan Birrer, Dong Lu, Fabian Bustamante, Yi Qiao, Peter Dinda, “FatNemo: Building a Resilient Multi-Source Multicast Fat-Tree”, Proceedings of the Ninth International Workshop on Web Content Caching and Distribution (WCW 2004), October 2004, Beijing, China. Also appeared in LNCS, Vol. 3293/2004, pp. 182-196.

    • Long version in submission


Wireless ad hoc networks l.jpg

Wireless Ad Hoc Networks

  • Dong Lu, Haitao Wu, Qian Zhang, Wenwu Zhu, “PARS: Stimulating Cooperation for Power-Aware Routing in Ad-Hoc Networks”. Proceedings of the 40th IEEE International Conference on Communications (ICC 2005), May 2005, Seoul, Korea. To appear.


Incentivized protocol design for peer to peer systems l.jpg

Incentivized Protocol Design for Peer-to-Peer Systems

  • Dong Lu, Yi Qiao, Peter Dinda, Fabian Bustamante, “MultiTorrents: Bandwidth Optimized Hybrid Multicast for Incentivized P2P File Sharing”. In Submission.


Parallel computing l.jpg

Parallel Computing

  • Dong Lu, Peter Dinda ”Virtualized Audio: A Highly adaptive Interactive High Performance Computing Application”, Proceedings of the 6th Workshop on Languages, Compilers, and Run-time Systems for Scalable Computers (LCR 2002), Washington, DC, 2002. Also to appear in LNCS.


  • Login