1 / 22

Data Quality

Data Quality. Ed Chapman OOI Chief Systems Engineer. Goal. Address Areas for Recommendations: #2 “Data Policy, Data Quality Protocols and Procedures” and # 4 “Data sampling rate strategy development and management ” Specific topics:

kathydavis
Download Presentation

Data Quality

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Data Quality Ed Chapman OOI Chief Systems Engineer

  2. Goal Address Areas for Recommendations: #2 “Data Policy, Data Quality Protocols and Procedures” and #4 “Data sampling rate strategy development and management ” Specific topics: “Shoreside & at-sea instrument and subsystem quality/calibration procedures/protocols, automated thresholds/flags, manual data QC, exception management, and long term time-series data sampling rate management.”

  3. Shoreside & at-sea instrument and subsystem quality/calibration procedures/protocols

  4. Pre-Deployment Procedures 1. Incoming Inspection Completed for all Instruments and Platforms Verifies configuration and state as delivered 2. Calibration Records Records for each instrument or platform are archived in Vault 3. Quality Conformance Tests (QCT) Completed for all Instruments and Platforms Confirms basic functionality (“bench test”), detects failures or damage 4. Requirements Verification Completed for each instrument type or Class Validate first article against requirements and specifications 5. Platform Integration and Test Platform operation verified using platform controller End-to-End communication verified, instrument to shore station

  5. The data pipeline starts with the instruments Pre-deployment procedures (examples from Pioneer 1) Instrument acceptance • Visual inspection and inventory • Bench test for basic function • Verify against requirements/specs • Archive calibration information  Platform build • Construct according TDP • Correct problems if necessary, document changes

  6. The data pipeline starts with the instruments Pre-deployment procedures Platform Burn-in • Operate in benign environment (e.g. LOSOS high bay) • Operate in representative environment (e.g. WHOI dock) Instrument Burn-in • Verify plausible values (e.g. winds about 10 m/s from the East) • Compare like instruments (e.g. two BP instruments on tower)

  7. At-Sea Procedures: Pioneer-1 * 1102-00300 Protocols and Procedures for OOI Data Products: QA, QC, Calibration and Physical Samples Platform monitoring Full platform function available when within WiFi range Communication with shore station when out of range Shipboard underway sampling Meteorology time series from Knorr bow mast Thermosalinograph time series from Knorr system Bathymetry from echoshounder and multi-beam Shipboard CTD profiles Post-deployment casts at each of 3 sites for Pioneer-1 Seabird 9-11 with DO, Fluoro, beam x-miss, turb, PAR Physical Samples Post-deployment casts at each of 3 sites for Pioneer-1 Salinity and Oxygen completed onboard Nitrate/Nitrate, Chlorophyll and Carbon system done in shore labs

  8. At-sea protocols Deployment and post-deployment procedures Deployment documentation • Pre-deployment checklists • Mooring deployment logs Post-deployment data assessment • Adjacent CTD cast(s) (temp,sal,oxy,chl,turb) • Shipboard systems (met, surface t-sal, ADCP) • Water samples and lab analysis (sal,oxy,chl,etc)

  9. At-sea procedures Post-deployment procedures Deployment documentation • Pre-deployment checklists • Mooring deployment logs Post-deployment assessments • Adjacent CTD cast(s) (temp,sal,oxy,chl,turb) • Shipboard systems (met, surface t-sal, ADCP) • Water samples and lab analysis (sal,oxy,chl,etc) • Quick-look report • Lessons learned

  10. Automated QC Thresholds and Flags L1b and L2b Instrument Driver and Agent Permanent storage Data Product Algorithm Calibration Table Secondary Post-Deployment calibration values POLYVAL Algorithm User Secondary Post-Recovery calibration values POLYVAL Algorithm QC Flags Interpolation QC algorithms (range, spike, stuck, gradient, trend, combined) Lookup Tables

  11. Automated QC Checks • Seven QC Checks • Global Range Test • Local Range Test • Spike Test • Stuck Value Test • Trend Test • Gradient Test • Combined QC Flags

  12. When? • QC Checks are run on a periodic basis • when data is ingested from the uncabled instruments • Continuously from the cabled instruments • QC Flags are stored.

  13. Automated QC actions • PSs create Look Up Tables and values are uploaded through the UI as csv files

  14. Automated QC Updates • If new values are uploaded for any of the QC Flags those values overwrite the original values. • OOINet reruns the QC check for all data products and creates and stores new QC Flags • QC is “value added” so we don’t retain prior flags

  15. Human in the loop QC L1c and L2c Permanent storage Instrument Driver and Agent Data Product Algorithm Calibration Table Secondary Post-Deployment calibration values POLYVAL Algorithm User Secondary Post-Recovery calibration values POLYVAL Algorithm Interpolation QC algorithms (range, spike, stuck, gradient, trend, combined) Lookup Tables Human in the loop L1c

  16. Human in the Loop QC Actions • PS periodically downloads an L1 or L2 product • PS performs HITL QC locally on PS machine • PS uploads L1c or L2c values, and HITL metadata (provenance, etc.) into OOINet • User who downloads L1 or L2 product to which HITL QC has been applied will see L1c or L2c variables in the downloaded time series • Only for the time range for which the HITL QC was applied

  17. Human in the Loop QC Updates If new HITL values are uploaded for a time period that has already been uploaded those values overwrite the original values.

  18. Relationship of QC level a, b, and c

  19. Database L0 L0 L1 Data Product Algorithm L2 Data Product Algorithm Primary Calibration Function L2b L1a Secondary Calibration Functions L1b QC Algorithms QC Algorithms Human In The Loop Human In The Loop QC flags L0 L1a L1b and QC flags L1c L2b L2c GUI User

  20. exception management

  21. Long term time-series data sampling rate management

  22. Questions? Specific topics: “Shoreside & at-sea instrument and subsystem quality/calibration procedures/protocols, automated thresholds/flags, manual data QC, exception management, and long term time-series data sampling rate management.”

More Related