1 / 6

Status of PDC’07

Status of PDC’07. L. Betev ALICE-LCG Task Force, Aug 16, 2007. Central services - load. Central services - AliEn updates to use service aliases instead of host names Allows to extend the load balancing to all services (presently used for proxy)

purity
Download Presentation

Status of PDC’07

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Status of PDC’07 L. Betev ALICE-LCG Task Force, Aug 16, 2007

  2. Central services - load • Central services - AliEn updates to use service aliases instead of host names • Allows to extend the load balancing to all services (presently used for proxy) • Adding / upgrading of individual hosts will be transparent • Increased reliability • Job load • 300K jobs (DONE+Error) in the last 7 days • 0.5 jobs/sec ALICE-LCG TF Meeting

  3. Central services - load (2) • Most loaded hosts/services • db06a (Catalogue/Task queue DB) - average 17, peak 120 • This is the Task Queue - frequent job status changes (each job updates the DB at least 5 times) + additional remaining open connections to DB • Fixes for the above are being made • db1 (Job Optimizer, Authen, IS) - average 4, peak 50 • The Job Optimizer should go a dedicated host • The other 4 hosts are balanced very well ALICE-LCG TF Meeting

  4. Job Errors • Mostly ERROR_V due to problems with application installaton • Three sources: • Installation by JA on mixed 32/64 bit WNs/VO-box - incompatibility of libraries (3 sites) • Incomplete installation by JAs from sites with pool accounts • Shared filesystem problems during installation • Total affected - 10 sites • Fix - PackMan flag (in LDAP) allowing only the VO-box to do the packages installation for certain sites ALICE-LCG TF Meeting

  5. Resources usage ALICE-LCG TF Meeting

  6. Data transfers and staging • Smooth data transfer to GSI • One more disk server will be available tomorrow as xrootd CASTOR2 buffer at CERN • We need to step-up the installation of remote storage (difficult in August) • We should complete the instegration of the tools for data transfer and staging ALICE-LCG TF Meeting

More Related