New features in patrol version 3
This presentation is the property of its rightful owner.
Sponsored Links
1 / 8

New features in PATROL version 3 PowerPoint PPT Presentation


  • 182 Views
  • Uploaded on
  • Presentation posted in: General

New features in PATROL version 3. Michael Jung (TU-Berlin), Waltraut Niepraschk (DESY) System overview Patrol actions and resources control Configuration WWW Interface Patrol usage. Patrol 3.0. based on SLAC patrol by C. Boheim modified and extended by M. Jung

Download Presentation

New features in PATROL version 3

An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -

Presentation Transcript


New features in patrol version 3

New features in PATROL version 3

Michael Jung (TU-Berlin), Waltraut Niepraschk (DESY)

System overview

Patrol actions and resources control

Configuration

WWW Interface

Patrol usage

New features in PATROL 3.0


Patrol 3 0

Patrol 3.0

  • based on SLAC patrol by C. Boheim

  • modified and extended by M. Jung

  • WWW interface in Javascript available

    supported architectures

    AIXIRIX

    SunOSLinux

    SolarisDEC-Unix

    HP-UX

    easy adaptation to new architectures by specifying patterns for the output of certain system commands

New features in PATROL 3.0


Patrol resources control

Patrol resources control

Obtaining information on

processes and daemons (ps)

file systems (df)

file sizes (ls)

services and ports (netstat)

hosts (uptime)

return codes or timing (timeout) of arbitrary commands

Resource checks are based on value and change of value

(as compared to last run of patrol)

Tests

on limits (value, value+delta, (val1, val2, val3+delta)

with relops (>, <, =, !=, =~/regexp/)

New features in PATROL 3.0


Patrol actions

Patrol actions

If tests fulfil specified criteria, perform actions:

mail (to users, to admins)

kill (processes)

nice (processes)

restart (daemons)

write (to syslog)

execute system commands

execute snippets of perl code (access to patrol internal variables)

Mail texts, system commands and perl snippets are defined in blocks for easier reference

New features in PATROL 3.0


Patrol configuration

Patrol configuration

  • patrol actions are defined as rules in a configuration file

  • rules do act on targets (identified by hostname, ostype, netgroup, ...)

    rule format: rule_type target resource condidion action

    rule types:

    Ffile systemHLhost, load limit

    DdaemonHThost, uptime limit

    SPsystem portHUhost, user limit

    PCprocess, cpu limitCCcommand, exit code

    PMprocess, memory limitCTcommand, time out

    PTprocess, time limitWfile size

    PNprocess, number limitSPservice port

New features in PATROL 3.0


Configuration examples

Configuration examples

  • restart sshd and notify admins by email

    D*sshdrestart(“sshd”), mail(admin, MD)

  • renice some jobs on IRIX systems (not Codine batch jobs)

    PC[irix]!{cod_}>50%nice(8), mail($user, admin, MPC)

  • watch the /usr1 file system on host hydra

    Fhydra/usr1>95%mail(admin, MF1)

  • notify admins, if load on aisa machines is above 2

    HLaisa[0-9]>2 mail(admin, MHL)

  • notify admins, if /etc/check has nonzero return code (netgroup hps)

    CC(hps)/etc/check >0mail(admin, MCC)

New features in PATROL 3.0


Www interface

WWW Interface

  • patrol runs periodically (cron) on all hosts to be checked, no communication between hosts, no central information retrieval

  • WWW interface runs periodically on a single host (WWW server)

    • gathers information on all hosts over a (configurable) period of time

    • consists of a perl script (part of patrol) and Javascript HTML files (generated by patrol)

    • provides both global view of the system, information on (configurable) subgroups and on individual hosts

    • can also process and display data from other (monitoring) tools

  • see screen dump of our system in routine use

New features in PATROL 3.0


Patrol usage at desy zeuthen

Patrol usage at DESY Zeuthen

  • presently approximately 100 hosts controlled by patrol

  • patrol started by cron every 15 minutes

    Controlling tasks on all hosts

  • Load monitoring

  • Execution time monitoring of user processes (except batch)

  • Presence of important daemons (cron, xntp, syslog, afs, batch, …)

    Tasks on selected hosts (usually servers)

  • File system usage (/, /usr, /tmp, /home, …)

  • presence of daemons (named, sendmail, …)

    Depending on the problem appropriate actions are taken

    (mail, restart, log, …)

    Observed increased stability of services for users

New features in PATROL 3.0


  • Login