1 / 16

Software Sharing at Fermilab: Experiences from Run II, KITS and Fermitools

Software Sharing at Fermilab: Experiences from Run II, KITS and Fermitools. Ruth Pordes, Lauri Loebel Carpenter, Elizabeth Schermerhorn Fermilab Computing Division. Software at Fermilab is Shared as a Result of :. Orchestration Planning and design from the get-go

harkness
Download Presentation

Software Sharing at Fermilab: Experiences from Run II, KITS and Fermitools

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Software Sharing at Fermilab:Experiences from Run II, KITS and Fermitools Ruth Pordes, Lauri Loebel Carpenter, Elizabeth Schermerhorn Fermilab Computing Division

  2. Software at Fermilab is Shared as a Result of : • Orchestration • Planning and design from the get-go • Externally mandated sharing • Capturing a popular Tune • Independent use of available software and then sharing of support and experience • Discovery and Advantage of Common Use • Variation on a Theme • Acquisition or Development for single User Group or Experiment and then adoption by more than one • Availability and Dissemination; Migration of Users to new Groups or Experiments (Talking in the Cafeteria; PR on the Web)

  3. Management - of both experiments and Computing Division - encouraged use of Same DBMS system: Offered resources for central support Increase in reliance on databases for data taking and processing Benefits if Offline and Online use common systems Each Experiment needs Run Information File Catalog Calibration Data Hardware Location and State Luminosity Information Each needs Development and production repository systems Synchronization between Online and Offline 1) CDF/D0/CD Joint Project - Support DatabasesOracle for Run II

  4. Each experiment has several Oracle instances for management and support: Offline Databases Online Databases Production System: Production Repository Integration Repository Production System: Production Repository Integration Repository Synchronization Development System: Development Repository Development System: Development Repository Backup Repository (RMAN) Monitoring and Statistics Repository (OEM)

  5. Design Tools and Methodologies Backups Database Monitoring Packaging and testing of software Contact with Company Standards, Conventions, Reviews Possibility for common off-site database strategy New experiments - e.g. Minos - can take advantage of central pool of knowledge and support Central management of licenses and maintenance feasible Sharing and its Benefits

  6. Implementation Environment D0 Online - NT for Level 3, Python D0 Offline - C++, Python CDF Online - Java CDF Offline - C++ Schedule and Priority differences Affects time available to develop consensus Affects resources available for general solutions Strategic Directions D0 3 tier architecture avoid need to distribute/support Oracle Clients on all OS Corba as middle tier Database Server and code generation CDF 2 tier architecture to avoid bottlenecks and use Oracle for multiple user access control 3rd tier of flat files Automated header and C++ code generation but - sharing not 100% achievable -experiment specific infrastructure and policy

  7. 2) Common Tools Shared through Common Product Packaging, Management and Distribution • UPx family of Products • Concepts mature and accepted • Still in use for Run II; modified so no longer requires root access to install or use • Records distributions from central FTP site • In use throughout the Laboratory - da, farms, analysis servers, desktops, information and database systems, offsite collaborators • Full documentation • Central resources for maintenance and support • Consistency, standardization, automation possible

  8. Services Central Distribution Service Configuration Management for Qualifiers and Versions Framework to capture Complete Build and Execution Environment Monitoring and Census of installed and available software Strategy Include base support as part of “canned” OS distributions - e.g. Fermi RedHat Linux Unix (User) Product Support - UPx KITs Distribution Node UPS Database UPD UPD Local Machine Local Machine UPD UPS Database UPS Database UPS Database UPS Database UPC - Census

  9. Must be aware that: Benefits: • Resources to maintain infrastructure - ongoing commitment • Bookkeeping of “benefit” of central support vs “cost” of distributed support difficult • 20 x 10% = 2 FTEs? • People wary of “dictatorship” • Infrastructures can “calcify” and “constrict” • Rules and Conventions allow automation - e.g. build scripts • Many people know tools and rules - easier support coverage • Documentation and • methodologies “general”

  10. Management Support and Information - existing Recommendations Mail Archive for FAQ and Support Community Support Mail Lists

  11. Management Support and Information - future - better integrated information systems Remedy HelpDesk Tracking and FAQ Repository Links to Information Systems of Computer Hardware, Software, System Admins

  12. Desktops: ~500 Servers: ~200 Experiments: ~20 Offsite Distributions: ~50 Number of Products in Kits ~400 Number of Distribution Repositories ~5 Number of UPS databases per Experiment ~3 Infrastructure Support Staff ~ 2 FTEs Scale:

  13. How RPM, AutoRPM and UPx integrate and play together on Linux - clear that we want them to How to support “Fermi-Environment” and “Other Experiments” on same Desktop. E.g. D0 and CMS Increasing Offsite Component NT Guaranteed Support Services and Response when Load Shared Knowledge base must be accurate and complete Nimbleness in Sharing Future Needs Thought

  14. 3) Publishing Fermilab Products to the Community • Fermilab under URA/DOE - “ownership” constraints prohibit us from putting software on an open FTP server for the world. • Fermitools allows us to publish software under already agreed to Terms and Conditions (by the lawyers) • Allows publication of software for wider community and ability to offer support and help • Provides mechanism and pressure points to promote transition to more open source environment • Slowly add products as they become available - most recently Trace tool for Linux; oracle query and mail list tools - from whatever group or individuals at Fermilab • Encourages sharing of software From Fermilab as well as To Fermilab

  15. Fermitools cont. • Product infrastructure encompasses Fermitools Products - available through anonymous FTP • Hope this will facilitate moving more Fermilab s/w to public domain and open source • Encouragement to include demos and presentation materials

  16. aSummary • At Fermilab, we operate several different scenarios for sharing software - not just a single model of use • Experience shows that while achieving consensus and sharing is much work the benefits are sometimes acknowledged • Individual taste and biases can still cause divergence and support loads • Future requires better integration of existing methods with linux and NT • URLs of projects referred to: • http://RunIIcomputing.fnal.gov • http://www.fnal.gov/cd • http://www.fnal.gov/fermitools

More Related