1 / 6

D0 Grid Data Production Initiative: Coordination Mtg

Version 1.0 (meeting edition) 28 May 2009 Rob Kennedy and Adam Lyon Attending: RDK, …. D0 Grid Data Production Initiative: Coordination Mtg. Overview. News and Summary Close-out Prep Meetings D0 CAB: 5/22. This was somewhat brief, but very positive.

chuong
Download Presentation

D0 Grid Data Production Initiative: Coordination Mtg

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Version 1.0 (meeting edition) 28 May 2009 Rob Kennedy and Adam Lyon Attending: RDK, … D0 Grid Data Production Initiative:Coordination Mtg D0 Grid Data Production

  2. D0 Grid Data Production Overview • News and Summary • Close-out Prep Meetings • D0 CAB: 5/22. This was somewhat brief, but very positive. • Initiative: 6/04. Close-out with Lessons Learned, etc. • Coordination Meetings • Re-located in WH9SE Libra • Remaining meetings: 5/24 (today), 6/04 • Agenda • News • SAMGrid and Condor Upgrades • AOB, Action Items

  3. D0 Grid Data Production Topics Remaining to Cover • 5/14: CAB Configuration  First Priority = resource string passing (M Mengel working on it) • Optimize use of CAB resources, beyond just d0farm and CAB2. (providers request) • Retain Turn-around/Response Time for Analysis (customers/users request) • Simplify Production Coordination, Improve Processing Flexibility (all request) • How to Proceed from Here? First, resolve resource string passing, then decide. • 5/21: Monitoring  Organize request(s) to fill out and maintain the monitoring plots. • Assess what we all have now, where our gaps are, what would be most cost-effective to address • See Gabriele’s white paper on D0 grid job tracking (includes monitoring, focus on OSG). (CD DocDB 3129) • May also reference Ruth’s look into Monitoring which produced an inventory (CD DocDB 3106) • How to Proceed from Here? Action Item list created. • 5/28: Condor Releases, SAMGrid Upgrade THIS WEEK • Deferred Task (due to All-CAB2 Processing) from Initiative: Release new SAMGrid with added state feature • Upgrade production release of Condor with fixes – modify Condor/VDT upgrade procedure? • How to Proceed from Here? • AOB: Transition of samgrid.fnal.gov support from FGS to FEF; Action Items • 6/04: Close-Out • Lessons Learned, Close-out Festivities Plan

  4. D0 Grid Data Production SAMGrid, Condor Upgrades • New SAMGrid version with added state feature – can proceed to release now • Reminder of what the added state feature entails. Status? • Upgrade of Condor – modify Condor/VDT upgrade procedure to allow Condor-only upgrades? • Now: New Condor released in VDT packaging after SAMGrid dev testing. • Critique: SAMGrid Dev testing tasks become bottleneck, significant effort and risk to VDT upgrades leads to very infrequent VDT (Condor) upgrades. • Some extra effort now to enable for less effort later, and more frequent Condor upgrades • 1. Condor layered on top of VDT – change to deployment configurations. • 2. SAMGrid sanity check procedure runnable by REX to test new SAMGrid/Condor combination to reduce SAMGrid developer involvement. Effort to formalize this. • 3. Practice “new release” with old software to validate procedure at some level. • How to Proceed from Here? • How to agree to the configuration/procedure? • Approximate time table for deployment?

  5. D0 Grid Data Production AOB, Action Items • Transition of samgrid.fnal.gov support from FGS to FEF • Agreement to proceed by Jason and Glenn. • FEF willing to take on machine without doing a full re-install, but would like to poke around before signing off on the transition. Machine is 2.5 years old now, so may not be an issue anyway in 6 months. • Next Step: FGS to contact Jason,Glenn when ready to arrange root access so they can assess config. • Monitoring • Rob to talk to Jason about formalizing a change request for monitoring of PBS/CAB per meeting discussion • Jason and Glenn agree that collecting monitoring requests together into a somewhat formal change request makes sense. They will entertain a prioritized request list, estimate the effort required for the requests, and then “we” can meet to go over what can/will be done. They also ask for a brief use-case for each plot to help them consider equivalent means of supporting the same usage. • Margaret to talk to Keith about possibility of using existing raw data to characterize job idle time etc. • News? • Mike to talk with Robert about data transfer related plots: distinguish external (Enstore) vs internal (Data Production) issues • News? • Other Topics? • … • Close-out Party Suggestions • Rob to arrange, looking for input. Lunch on Friday 6/5?

  6. D0 Grid Data Production Discussion Summary • Topic… • …

More Related