200 likes | 296 Views
The Research Data Archive at NCAR supports climate and weather research by curating, organizing, and creating access to research datasets. Established in the 1960s, it offers over 600 datasets and 4M files, serving 7000 unique users annually. Key data categories include operational models, observations, and remote sensing data. The archive provides assistance through data-selection interfaces and automated processing to cater to diverse research needs. Reanalyses, observations, and analyses form the core data offerings with a focus on historical and ongoing data collections. Future plans involve expanding content, improving archives, and enhancing access options.
E N D
Data Support for Climate Research from NCAR Steven Worley Doug Schuster Joey Comeaux 4th ACRE Workshop, EC FP7 EURO4M, ERA-CLIM, AHRC Historic Weather KNMI, De Bilt, The Netherlands, 21-23 September 2011
Topics Research Data Archive at NCAR Data Service Communities Data Content Reanalyses Observations Analyses Future
Research Data Archive (RDA) at NCAR • Purpose - support climate & weather research by curating, organizing, documenting, and creating access to research datasets • To extent possible, all data and services are free and open • Take Away Metrics • Established in 1960s • 600+ datasets, 4M files, 600 TB • 7000 unique users annually • Many datasets routinely updated • daily, weekly, monthly
Research Data Archive (RDA) at NCAR Core Data Categories • Operational and Reanalysis model outputs Meteorological and Oceanographic Observations Remote Sensing Observations • Topography/Bathymetry, Vegetation, Land Use
Data Service Communities RDA Infrastructure Components, w/o Network DAV GLADE RDA Web HPSS SC
Self-help Users GLADE, 155 TB, 125 Datasets HPSS, 600 TB, All (600+) Datasets Automated Services move data from HPSS to GLADE as needed DAV GLADE RDA Web HPSS SC Self-help Users File Flow
Assisted Users, One-off Requests Assisted User DAV GLADE RDA Web • Assistance: • Data-selection interfaces and automated processing • Data processed by RDA staff HPSS SC Computational Service
NCAR Users – two pathways Reference Only DAV 1 1 NCAR Users GLADE RDA Web 2 2 1 2 HPSS SC
Data Content • Reanalyses • Observations • Analyses
Reanalyses • 10 Major Collections, providers NCEP, ECMWF, JMA, NOAA-CIRES • Start Year: aligned with satellite era, exceptions: NNR, ERA-40, 20CR • ERA-I recently pre-pended to 1979, not in RDA now • End Year: Most are ongoing • Resolution: • Highest shown here – lower resolutions also available • Increasing resolutions => more storage and need for subsetting • E.g. NCEP CFSR v1 is 80TB, online
Reanalyses • Format • Native GRIB1 or GRIB2 • Conversion – limited capability now • Subsetting • By variable/parameter – only newest ones • By date – newest ones on specific days, some capability from file organization • By vertical level – only newest ones • Geospatial – not available
Observations • 5 notable long-term collections: ICOADS, ISPD, UADB, WMSSC, NCEP Surf. Ops. • Start Year: push back to the 1600-1700’s • End Year: Mostly ongoing • Format: Many Native, conversion to ASCII and one to netCDF • Subsetting: Generally, complete temporally and geo-spatially • Truth – This is really time consuming work
Observations Examples: ICOADS, ISPD, WMSSC • Start Date: ~January 2010 • ICOADS, 30% requesting subsetting • Monthly updates have increased routine file access • ISPD, 50% users subset, probably for ASCII output • WMSSC, many users, small data volume
Observations: UADB Update • Basic Features • ~ 2,500 Total Stations from 30+ sources • Pibals, 1920-2011 • Raobs, 1943-2011 > 700 stations with 30+ years
Observations: UADB Update • Status • Homogeneous units and format • Updated monthly with NCEP operational data • Interface access; two sources 1973-2011 • http://dss.ucar.edu/datasets/ds370.0 • All data available by request • Goals • Complete access to CHUAN • “Corrected" stations, ingested, undergoing validation • Improve station history metadata • Form longer time series, upon user selection • Extend interface availability to earliest records • Complete QC research and implement • Add aircraft data • Coordinate with NCDC, bilateral completeness
Analyses • Surface: SST, SLP, ocean fluxes; Ocean 3D: T&S, and V • Start Year: push back to 1850’s at best • End Year: Mostly ongoing • Format: a mixture • Subsetting: not extensive
Future • Complete ERA-I back to 1979 • Add products from ERA-CLIM Project? • CFSR-lite • NOAA, 1949(?)-> current, T126 • Retirement of NCEP/NCAR Reanalysis • Arctic System Reanalysis, OSU/UA/NCAR • Initially, 11 years • Improvements to UADB • More subsetting and more format conversion • Expansion into interoperable web access
Wrap up Research Data Archive at NCAR • Well designed for climate research support Data Service Communities • Convenient for many worldwide Data Content: Reanalyses, Observations, Analyses • Substantial with increasingly easy access options Future • Plans to grow content, and improve current archives