data preservation at rutherford lab
Download
Skip this Video
Download Presentation
Data Preservation at Rutherford Lab

Loading in 2 Seconds...

play fullscreen
1 / 9

Data Preservation at Rutherford Lab - PowerPoint PPT Presentation


  • 79 Views
  • Uploaded on

Data Preservation at Rutherford Lab. David Corney 9 th July 2010 KEK. 700+ batch workers 500+ storage servers 200+ services nodes 50+ test systems 2 x 10000-slot tape silos 67 tape drives. 1 chassis switch 14 stacks (50+ switches) 30+ management switches 80+ switched PDUs

loader
I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
capcha
Download Presentation

PowerPoint Slideshow about ' Data Preservation at Rutherford Lab' - vila


An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript
data preservation at rutherford lab

Data Preservation at Rutherford Lab

David Corney

9th July 2010 KEK

tier1 fabric

700+ batch workers

  • 500+ storage servers
  • 200+ services nodes
  • 50+ test systems
  • 2 x 10000-slot tape silos
    • 67 tape drives
  • 1 chassis switch
  • 14 stacks (50+ switches)
  • 30+ management switches
  • 80+ switched PDUs
  • Environmental sensors
Tier1 Fabric
  • Fabric Team:
    • 7 systems administrators and 1 hardware technician
slide5

Basic curation – tape migration to new media very 3 years; (Cost driven), and routine check-summing;

  • No current plans for longer term curation ?
    • A good time to start making plans
  • The data is the easy bit; we’re just the data centre
data management for facilities at ral site
Data management for facilities at RALsite
  • ISIS (neutron source)
    • All data ever - 30 years
  • Diamond Light Source
    • 2 years operation 20+ beam lines
    • Automated data handling facility and metadata capture
    • Upgrading infrastructure from SRB to Storage-D/IRODS
  • Central Laser Facility
  • Space science Centre
other related dp activities
Other related DP activities
  • Data preservation for Medical Research Council (Policies)
  • Data archive for Biotechnology and Biological Sciences Research Council (10 years)
  • Data archive for Natural Environment Research Council
  • Investigating Commercial product from Tesella (“Safety Deposit Box” – Pronom)
  • Site wide panel (still) developing “Data Policy”
specific goals for research datasets testbed scape
Specific goals for Research Datasets Testbed (SCAPE)
  • Create sample shareable datasets from large scientific facilities such as neutron sources and synchrotron radiation sources, with associated instruments, serving wide scientific communities.
  • Develop preservation scenarios for typical preservation processes which represent the diversity the scientific research lifecycle
  • Identify requirements for the tools and services (and underlying infrastructure) that are required in order to meet the challenges of this domain
  • Implement the research data scenarioso in the SCAPE environment in order to test, evaluate, demonstrate and ensure the practicality and applicability of the SCAPE developments to this problem area
oais update clarifications and improvements david giaretta
OAIS update – clarifications and improvements (David Giaretta)
  • Audit & Certification built on top of OAIS concepts and ISO audit processes
  • - leads to cycle of continuous improvement
  • - audit and certification organisation being created
  • - European Audit Framework being set up
  • PARSE.Insight provided huge survey responses to understand what people are concerned about and the major threats to digital holdings
  • CASPAR provided tools and infrastructure components (i.e. Usable across disciplines). Validated through accelerated lifetime tests, changing hardware, software, environment and tacit knowledge of users. Used data from various science disciplines (STFC and ESA), cultural heritage (UNESCO) and contemporary performing arts (IRCAM, INA, CIANT, UnivLeeds)
  • VISION to 2030 from EU High Level Expert Group on Science Data Infrastructure – to be published in Sept/Oct
ad