Data panel do you know where your data is asli 21 january 2010
1 / 14

Data Panel: Do you know where your data is? ASLI, 21 January 2010 - PowerPoint PPT Presentation

  • Uploaded on

Data Panel: Do you know where your data is? ASLI, 21 January 2010. Steven Worley Bob Dattore National Center for Atmospheric Research. AMS Ad Hoc Committee on Data Stewardship Prospectus, August 2009. Statement of Need # 4

I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
Download Presentation

PowerPoint Slideshow about 'Data Panel: Do you know where your data is? ASLI, 21 January 2010' - kaoru

An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.

- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript
Data panel do you know where your data is asli 21 january 2010

Data Panel: Do you know where your data is?ASLI, 21 January 2010

Steven Worley

Bob Dattore

National Center for Atmospheric Research

Ams ad hoc committee on data stewardship prospectus august 2009
AMS Ad Hoc Committee on Data Stewardship Prospectus, August 2009

Statement of Need # 4

“Develop a plan for citing data referenced in publications and preserving data links for the long term”

Committee continued problems
Committee continued - Problems

  • Data not traceably cited or even available

  • Publishers do not have rigorous process to handle references held at data centers

  • Data providers have not created and adopted a standard reference coding system

    • DOI’s – e.g. ORNL

    • International DOI Foundation – initiative of German National Library of Science and Technology

    • CLADDIER (Citation, Location, and Deposition in Discipline & Institutional Repositories) – project @ BADC

  • Any data reference scheme will fail if the data are not publicly available and are not in a long-term archive

Committee continued ramifications
Committee continued - Ramifications

  • Without citation that accurately defines the data used, researchers cannot validate or easily advance understanding starting from a publication

    • This can slow scientific discovery and degrade published assertions as fact

Committee continued recommendations
Committee continued - Recommendations

  • Collaboration

    • Establish a process whereby librarians, publishers, and AMS editorial boards are teamed with data providers and data centers, and organizations already addressing this challenge (e.g. AGU, Oak Ridge National Laboratory, etc.), to develop standard schemes for referencing data in publications

Committee continued recommendations1
Committee continued - Recommendations

  • Set Policy

    • Institute a publication policy for data stewardship in the journals by defining recommendations for authors and setting a peer review criteria that focus on the adequacy of the data references

Committee continued recommendations2
Committee continued - Recommendations

  • Awareness and Recognition

    • Use AMS statements or guidelines to emphasize the importance of stewardship and establish ways to recognize scientists that produce publicly-valued data and follow the guidelines

Challenges @ archive centers e g ncar
Challenges @ archive centers, E.G. NCAR

  • Get agreement/approval organization-wide for alpha-numeric coding

    • More than five data groups serve data

    • Need coordination with library

    • Coordinate with publishers

    • Align with other organizations

  • Establish data persistence policy / requirements

    • Minimum time period for data preservation

Challenges @ data centers
Challenges @ data centers

  • Build organization-wide mapping of citation tags (e.g. doi) to URL addresses

    • URL’s are too fragile, may change over time, but citation tag must remain immutable.

Data evolution
Data Evolution

  • Critical difference between data and traditional publications - data collections have a life cycle

    • New “improved” versions can be created

      • Easy case

    • Corrections small and large are made

    • Time and space domains can be appended

  • Metadata grows more comprehensive with usage/feedback, evaluations, publications

Data evolution1
Data Evolution

  • How and when should data citation tags change? Answering the versioning question.

    • Absolute policy – impossible?

    • Sensible guidelines @ organizations

      • Across organizations?

  • Superseded versions cannot disappear

    • Once it is cited it must remain available

    • Need libraries capable for monitoring citations

      • Archives need authoritative opinion before taking action on “out-dated” versions.


  • We need a data sharing movement – three pronged effort.

    • Funding agencies make data sharing a reviewable criteria - augmented with follow up monitoring

    • Archive centers put immutable tags on long-term datasets

    • Publishers accept articles only if reviewed as to having adequate data citations


  • Publishers need to accommodate “data papers” – not a new idea.

    • Benefits

      • Credit data providers for career track advances

      • Foundation for monitoring usage of data

      • Informs users of what is available

      • Kick starts the data citation process


  • SCOR/IODE Workshop on Data Publishing, Oostende, Belgium, 17-19 June 2008. Paris, UNESCO, 23pp. 2008. (IOC Workshop Report No. 207)

  • Lowry, R. and P. Pissierssens, A New Approach to Data Publication in Ocean Sciences, EOS, Vol. 90, Number 50, 15 December 2009

  • Policy on Referencing Data in and Archiving Data for AGU Publications,

  • Cook, R.B., Citations to Published Data Sets, Fluxletter, Vol. 1, Number 4, December 2008,

  • How to cite ORNL DAAC products, Citation Style,

    • ORNL, doi:10.3334/ORNLDAAC/547