1 / 18

Research Computing Environment at the University of Alberta

Research Computing Environment at the University of Alberta. Diego Novillo Research Computing Support Group University of Alberta April 1999. Computing Environment. SGI Origin 2000, 42 CPUs, 10Gb RAM Mix of interactive and batch jobs 2 CPUs for interactive activity

brock
Download Presentation

Research Computing Environment at the University of Alberta

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Research Computing Environment at the University of Alberta Diego Novillo Research Computing Support Group University of Alberta April 1999

  2. Computing Environment • SGI Origin 2000, 42 CPUs, 10Gb RAM • Mix of interactive and batch jobs • 2 CPUs for interactive activity • 40 CPUs used by batch jobs • Batch jobs managed by LSF (Platform)

  3. How is the system being used?

  4. Monthly System Utilization (CPU days) Theoretical max

  5. Average wait time in queue (hours) Need to balance parallel jobs Started using load thresholds

  6. System usage by job type

  7. Some thoughts on usage • Scalar use is predominant (so far) • We are starting to push the system • Jobs are waiting too long in the queue • Need to modify queue policies • Lower runtime limits • Checkpoint/restart • Limit on number of jobs submitted

  8. Using LSF

  9. Job queues • Parallel queue  par • High priority • Slot-based: up to 32 processors • Jobs are never suspended • Sequential queue  nic • Low priority • threshold-based: up to 95% system utilization • Jobs can be preempted by parallel jobs

  10. Job queues II • Two special queues • npseq • For sequential jobs that do not wish to be preempted • Very low priority • Only 4 slots available • special • Jobs that need to run longer than system limit • Only 1 slot available • Must be approved by committee

  11. Fairshare system • Jobs are scheduled according to priorities • Priorities are dynamic and based on • Number of shares • Past usage (currently 2 weeks of history) • Type of job (parallel jobs higher priority) • Resource availability also important

  12. Getting started • Complete LSF documentation online http://www.ualberta.ca/CNS/RESEARCH/LSF/ • Man pages also available • Add one line to your login files source /usr/local/lsf/cshrc.lsf (C shell) or . /usr/local/lsf/profile.lsf (Bourne shell)

  13. Submitting jobs % bsub [options] pgm args -q name Which queue to use -n num How many processors -o file Output file • Queue defaults to ‘nic’. • If no output file is given, results are mailed to you.

  14. Watching jobs % bjobs [options] -l All the details -p Only pending jobs (and why) -a All jobs (even finished ones) -uall All the jobs in the system jobid Just the job with this id

  15. Manipulating jobs % bkill jobid Kills the job (can also send signal) % bstop jobid Suspends the job (even if not running) % bresume jobid Resumes the job

  16. Getting usage statistics • We keep monthly stats in our web page http://www.ualberta.ca/CNS/RESEARCH/ • For current information % bacct [opts] Total usage for your jobs. Can specify dates and jobs % priorities (or bhpart -r) Lists all the priorities for different groups

  17. Monitoring load on the system % bqueues Shows queues and how loaded they are % lsload Quick glance at the load on the system • Also GUI tools (xlsbatch, xlsmon) • Please use sparingly as they add to interactive load on the system.

  18. Contact Information • Visit our home page http://www.ualberta.ca/CNS/RESEARCH/ • Questions and comments Research.Support@Ualberta.CA

More Related