1 / 10

Data Foundations And Terminology (DFT) IG

Data Foundations And Terminology (DFT) IG. Data ← Statements about data (metadata), document about data ^ ← Observations producing data ← Statements about Observations | (also metadata) Data Processes.

Download Presentation

Data Foundations And Terminology (DFT) IG

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Data Foundations And Terminology (DFT) IG Data ← Statements about data (metadata), document about data ^ ← Observations producing data ← Statements about Observations | (also metadata) Data Processes DFT IG Breakout Session P 10 Breakout - Th, 21 Sept, 9:00 – 10:30. Co-Chairs DFT IG : Gary Berg-Cross & Raphael Ritz

  2. Overview of Objectives for P-10 1. Updates and Continue IG discussion – Who is completing work and has vocabularies? How do they relate to each other? We now have PIDs for each term, who wants to use them? What are the psychological and linguistic dimensions involved?..... 2. Facilitate community discussion on RDA/group core concepts Help systematize the already large body of domain definition work on terms and their meaning using a rationalized “consensus” knowledge of domain experts, especially as involved in RDA’s efforts. Collaborate and coordinate with other vocabulary efforts 3. Solicit additional data use case scenarios to illustrate what areas of work they plan on using the models and vocabulary for. 4. Continued extension to Domain Vocabulary What are the needs and what services can be provided? Thesaurus service may help some but others need something stronger and may be able to leverage activities between groups. Exporting our vocabulary to graph form repository.

  3. Concept map overview of Core TermsBroadening the Discussion (Stepwise or Scope-wise) Digital Data Management including unregistrered (is a broader concept) Digital Object Management (registered, digital data) Where are datasets???

  4. Agenda DFT Objectives & Overview (Gary Berg-Cross - handout) Vocabulary Updates & liaison relation to other RDA Groups for candidate vocabulary items. EU Commission comments to the effect that concept development which would require some discussion across groups.  2. Tool Update (Raphael Ritz) 3. Working relation with MIG, Data in Context, Collections 4. Examples for various types of metadata 6. Issues and Interested Parties Discussion DXWG/DCAT 7. Quality Domain Vocabularies (Chem Research, Global Water, Biodiversity, Smart Cities etc.) 8. Next steps

  5. Some June – August Vocabulary Updates A data packet is a unit of data made into a single package that travels along a given network path. Some of the Metadata Elements e.g.: Originator refers to the person or organization that is the source of data. Temporal coordinates‎ - A time measurement about some physical entity using units defined as a specified duration or point in time. Physical coordinates‎ etc.

  6. Working Relation with MIG, DF IG & Chairs Collaboration Held virtual meetings Over the Summer and discussed at Chairs Meeting Metadata Element Set (provides some input for DFT vocabulary): 1. Unique Identifier (for later use including citation) 2. Location (URL) 3. Description 4. Keywords (terms) 5. Temporal coordinates 6. Spatial coordinates 7. Originator (organisation(s) / person(s)) 8. Project 9. Facility / equipment 10. Quality 11. Availability (license, persistence) 12. Provenance 13. Citations 14. Related publications (white or grey) 15. Related software 16. Schema 17. Medium / format “Comments” or notes from the P9 session in Barcelona are linked for each element.

  7. Other Data Management Vocabulary opportunities for collaboration, coordination, and de-duplication of effort. Despite decades of intensive work on controlled vocabularies (standardized sets of terms) problems remain with definitions that are central to RDM. The important need for clear definitions of RDM terms is widely recognized RDA’s Data Foundations and Terminology (DFT) WG is one of the earlier initiatives. Other important efforts include: Science Europe Data Glossary; Data Documentation Initiative (DDI); and Research Data Canada (RDC)/CASRAI RDM pilot evolved into a new International Research Data Management glossary (IRiDiuM) supported by RDC, CASRAI, and CODATA. Big Data at NIST etc...

  8. Quality Domain Vocabulary Development, Standardization, Registration, Harmonization and Support Growing Interest from a range of domains - Chem Research, Global Water, Biodiversity, Smart Cities etc. interest in the topic of domain vocabulary services from vocabulary registration to harmonization, for example, at P7's joint DFT and VSIG meeting . Followed up with BoF at P8 and one here Groups: Chemistry Research Data IG, Materials Science Registry WG Global Water Information IG, Quality of Urban Life IG, Agrosementics, Geospatial IG, Structural Biology IG, Biodiversity Data Integration IG, Health Data, …..

  9. Other Group Terms Provenance pattern and Prov graph are terms that might be defined for DFT to help others understand this work. Example of Prov use – getting credit for date item in a digital collections Data Description Registry Interoperability WG interest in developing a graph for controlled vocabularies

More Related