1 / 11

OSCAR

Workload Management. OSCAR. Jeremy Enos OSCAR Annual Meeting January 10-11, 2002. Topics. Current Batch System – OpenPBS How it Works, Job Flow OpenPBS Pros/Cons Schedulers Enhancement Options Future Considerations Future Plans for OSCAR. OpenPBS. PBS = Portable Batch System

elia
Download Presentation

OSCAR

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Workload Management OSCAR Jeremy Enos OSCAR Annual Meeting January 10-11, 2002

  2. Topics • Current Batch System – OpenPBS • How it Works, Job Flow • OpenPBS Pros/Cons • Schedulers • Enhancement Options • Future Considerations • Future Plans for OSCAR

  3. OpenPBS • PBS = Portable Batch System • Components • Server – single instance • Scheduler – single instance • Mom – runs on compute nodes • Client commands – run anywhere • qsub • qstat • qdel • xpbsmon

  4. OpenPBS - How it Works • User submits job with “qsub” • Execution host (mom) must launch all other processes • mpirun • ssh/rsh/dsh • pbsdsh • Output • spooled on execution host (or in user’s home dir) • moved back to user node (rcp/scp)

  5. OpenPBS – Job Flow User Node (runs qsub) Job output rcp/scp Server (queues job) Execution host (mother superior) Scheduler (tells server what to run) Compute Nodes

  6. OpenPBS – Monitor (xpbsmon)

  7. OpenPBS - Schedulers • Stock Scheduler • Pluggable • Basic, FIFO • Maui • Plugs into PBS • Sophisticated algorithms • Reservations • Open Source • Supported • Redistributable

  8. OpenPBS – in OSCAR2 • List of available machines • Select PBS for queuing system • Select one node for server • Select one node for scheduler • Select scheduler • Select nodes for compute nodes • Select configuration scheme • staggered mom • process launcher (mpirun, dsh, pbsdsh, etc)

  9. Pros Open Source Large user base Portable Best option available Modular scheduler Cons License issues 1 year+ devel lag Scalability limitations number of hosts number of jobs monitor (xpbsmon) Steep learning curve Node failure intolerance Not developed on Linux OpenPBS – On the Scale

  10. OpenPBS –Enhancement Options • qsub wrapper scripts/java apps • easier for users • allows for more control of bad user input • 3rd party tools, wrappers, monitors • Scalability source patches • “Staggered moms” model • large cluster scaling • Maui Silver model • “Cluster of clusters” diminishes scaling requirements • never attempted yet

  11. Future Considerations for OSCAR • Replace OpenPBS (with what? when?) • large clusters are still using PBS • Negotiate better licensing with Veridian • Continue incorporating enhancements • test Maui Silver, staggered mom, etc. • 3rd party extras, monitoring package

More Related