1 / 1

Challenge : How to reprocess experiment data in 25 years, using old software version?

Long-term preservation of analysis software environment. Challenge : How to reprocess experiment data in 25 years, using old software version?. Virtual Machine. CernVM Filesystem. Read-only, globally distributed file system optimized for software distribution.

akando
Download Presentation

Challenge : How to reprocess experiment data in 25 years, using old software version?

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Long-term preservation of analysis software environment Challenge : How to reprocess experiment data in 25 years, using old software version? Virtual Machine CernVM Filesystem • Read-only, globally distributed file system optimized for software distribution. • Based on standard protocols (ex. HTTP). • Highly scalable, redundant and reliable via a multi-Tier infrastructure. • Already used in production by LHC experiments. • Linux distribution based on SL5. • Supports all popular hypervisors. • Small footprint. (300 MB) • Comes in four editions targeting different use cases. (Batch, Head node, Desktop and Basic) • Flexible contextualization. CernVM - based data preservation Assuming you have preserved your data files and recorded in the bookkeeping database the version string of CernVM you used, you will be able to recreate the same CernVM image, appropriate for a future virtualization technology and reprocess the data in the same way 25 years later, • Minimal cloud middleware is required to recreate virtual cluster for data reprocessing. • CernVM can be contextualized using a small subset of EC2 API that allows it to be deployed on public or privateclouds (OpenNebula, OpenStack, Eucalyptus etc..). • CernVM uses Conary software repository for automated, strict component versioning. • You need only the CernVM version string to rebuild CernVM image on demand. Bookkeeping Private Cloud More info : A practical approach to virtualization in HEP: http://dx.doi.org/10.1140/epjp/i2011-11013-1|| CernVM Homepage : http://cernvm.cern.ch|| May 2012

More Related