1 / 12

CESGA Status Report

Javier Lopez, Alvaro Simon, Esteban Freire/ CESGA SA3 All Hands Meeting Barcelona. CESGA Status Report. Outline. Main Achievements since November Work on Deliverables/Milestones Issues/Problems Next Steps. Main achievements since November. Infrastructure.

Download Presentation

CESGA Status Report

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Javier Lopez, Alvaro Simon, Esteban Freire/ CESGA SA3 All Hands Meeting Barcelona CESGA Status Report

  2. Outline EGEE User Forum • Main Achievements since November • Work on Deliverables/Milestones • Issues/Problems • Next Steps

  3. Main achievements since November

  4. Infrastructure EGEE User Forum • New infrastructure based on virtual machines • This is the new common infrastructure for SA3, PPS and Production testbeds @CESGA • Fronted: • 1x Dell Poweredge 2950: 1TB RAID5 storage (golden images of all the services) • Virtual Machines: • 4x10 Dell Poweredge 1955: quad-core processors • For SA3 services we use HVM machines: they allow us to use kernel 2.4 without modification to the OS

  5. Infrastructure • Advantages • We can increase or decrease the capacity on demand • Easy to test new releases in a clean environment • Possible to roll-back failing upgrades using LVM snapshoot capability • We have produced a document explaining our infrastructure: • https://swe-wiki.egee.cesga.es/cgi-bin/moin.cgi/XEN3_Virtual_Machines_-_CESGA-EGEE • More detailed documents also available on request

  6. SGE • NOTE: This task is a joined effort between IC, LIP and CESGA • Integration in LCG CE ready • RPM packages tested • Yaim scripts developed • Documentation updated • Ready for certification • Re-Distribution of Grid Engine: • Reviewed license and sent to SA3 list for second review • Re-distribution allowed

  7. Testing SGE • Based on the Torque/maui tests developed @GRNET • Adapting the scripts • Preliminary results available • Very slow submission • Optimizing SGE configuration

  8. SGE on gLite CE • IP ready (same as in LCG) • Meeting with BLAH developer (David Rebatto) to understand the work required • Required scripts are being developed @IC • Testing will be done @CESGA • APEL ready (Dave Kant)

  9. Assigned Tasks • Task #4759: Testing SGE • In progress • Task #4600: Provide updated RPMs for SGE jobmanager and installation guide • Ready for certification

  10. Issues/Problems • Job submission tests: • Preliminary results show that optimization of default SGE configuration required • Improve SGE configuration to send back to CE stdout and stderror files • Modifications required to run SGE on para-virtual machines

  11. Next Steps • SGE is working on a lcg-CE. Next step: CERTIFICATION • Add RPMs to SA3 repository • Integrate SGE yaim scripts • Tests for SGE lcg-CE (later they will be reused for glite-CE) • SGE on glite-CE: Started on integrating support for BLAH • Other local middleware elements (GIIS, YAIM) basically remain unchanged for this glite-CE flavour. • APEL ready • Support for external SGE_QMASTER (IC and CESGA use this type of configuration in production) • GridICE sensors for SGE

  12. References EGEE User Forum • Xen Virtualization @CESGA • https://swe-wiki.egee.cesga.es/cgi-bin/moin.cgi/XEN3_Virtual_Machines_-_CESGA-EGEE • SGE Wiki Page • https://twiki.cern.ch/twiki/bin/view/LCG/ImplementationOfSGE

More Related