1 / 78

Metadata

Metadata. RCN Workshop Samantha Romanello Long Term Ecological Research University of New Mexico. In this session we will discuss…. Metadata: what are they? and why should they be created? Metadata standards: why do we need them? Metadata tools: what’s out there to help?

salene
Download Presentation

Metadata

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Metadata RCN Workshop Samantha Romanello Long Term Ecological Research University of New Mexico

  2. In this session we will discuss… • Metadata: what are they? and why should they be created? • Metadata standards: why do we need them? • Metadata tools: what’s out there to help? • Creating metadata: just how much work is this? • Finding and evaluating metadata: what is good? • Metadata resources: what’s out there?

  3. Metadata what are they? and why should they be created?

  4. What are metadata? Metadata? “higher level information that describe the content, quality, structure, and accessibility of a specific data set” Michener et al., 1997

  5. Example In front of you are two tuna shaped cans. How do you decide which can you would like to eat?

  6. Metadata helps you decide which can you would like to eat !

  7. Metadata are • The label • The information the label contains • Our understanding of what a label is and the information it describes

  8. Metadata • Provides the context of when, where, why, and how the data was collected • It also provides the who – some insight into the analytical framework of the scientist who collected the data

  9. Metadata is all around…

  10. Data 072998 29.5 17.0 073098 29.7 6.1 073198 29.1 0

  11. Data -- Metadata Date Temp (C) Precip. (mm) Obs. #1 072998 29.5 17.0 Obs. #2 073098 29.7 6.1 Obs. #3 073198 29.1 0

  12. www.utexas.edu/depts/grg/gcraft/notes/mapproj/gif/threepro.gifwww.utexas.edu/depts/grg/gcraft/notes/mapproj/gif/threepro.gif

  13. Value ofMetadata • Maintains internal investment in data • Provides information to data catalogs and clearinghouses • Promotes data sharing • Leads to potential research partners (e.g., promotes data discovery) • Clarifies semantics • Enables machine-processing

  14. Metadata describe: • Who? • What? • When? • Where? • How? about every facet of the data !

  15. In this session we will discuss… • Metadata: what are they? and why should they be created? • Metadata standards: why do we need them? • Metadata tools: what’s out there to help? • Creating metadata: just how much work is this? • Finding and evaluating metadata: what is good? • Metadata resources: what’s out there?

  16. Metadata Standards What are they and why do we need them?

  17. Why do we need Metadata Standards … … to reduce “information entropy” en·tro·py: a process of degradation or running down or a trend to disorder – Merriam-Webster

  18. Time of publication Specific details General details Retirement or career change Information Content Accident Death Time after Michener et al., 1997 Information Entropy over Time Information usefulness at 10 years, 20 years, 30 years…

  19. Why are metadata standardized ? • To provide a common set of understandable terms to describe data; • To facilitate entry and retrieval of metadata and data; and • To create tools which can automate entry, search and integration of data

  20. Metadata Standardization • Defines a common terminology • Allows for system “cross-walks”; that is, mapping one metadata structure to another • Format and Structure • Binary (GeoTIFF header) … Text (XML) • Proprietary (MrSID) … Open (EML) • Allows software engineers to automate • Entry • Searching • Integration • Synthesis

  21. Metadata Content Specifications • Dublin Core • NBII (National Biological Information Infrastructure) Biological Data Profile / CSDGM (Content Standards for Digital Geospatial Metadata) • ISO (International Organization for Standardization) CD 19115, Geographic information - metadata • LTER Data Table of Contents • Darwin Core • Ecological Metadata Language (EML)

  22. Ecological Metadata Language • Adopted by the LTER Information Management • Metadata specification developed by the ecology discipline for the ecology discipline • Based on prior work of Ecological Society of America and others (Michener et. al., 1997) • Seven years in development – 14 versions • EML 2.0.1 • Implemented as an XML Schema • Supports four separate modules • Dataset • Citation • Software • Protocol

  23. 5 Classes of ecological metadata descriptors • Data set • Research origin • Data set status and accessibility • Data structural • Supplemental

  24. Metadata Descriptors • What relevant data exist? • Why were those data collected and are they suitable for a particular use? • How can these data be obtained? • How are the data organized and structured? • What additional information is available that would facilitate data use and interpretation?

  25. XML: eXtensible Markup Language • Development influenced by SGML and HTML – Version 1.0 in early 1998 • A semantic language that lets you more meaningful annotate text (where HTML lets you define how text can be displayed, XML provides it with meaning). • Important for presentation, exchange, and management of information • Tools include DTD, Schema, XSL, and more…

  26. In this session we will discuss… • Metadata: what are they? and why should they be created? • Metadata standards: why do we need them? • Metadata tools: what’s out there to help? • Creating metadata: just how much work is this? • Finding and evaluating metadata: what is good? • Metadata resources: what’s out there?

  27. Metadata tools what’s out there to help?

  28. A Smorgasboard of Metadata Tools • Proprietary • Non-proprietary • On-line • Standalone • Windows • ASCII • Unix

  29. Tools for Managing Metadata • Flat-file System • Hybrid Flat-file System • Relational Databases • Oracle, PostgreSQL, mySQL • Hybrid Relational Databases • Metacat, Digital Library eXtension Service • Hierarchical Databases • Adabas, IMS • Object-Relational Databases • Birdstep, XDb, JADE

  30. Metacat Data Repository

  31. Tools for Creating Metadata • Text editors • Notepad (Windows) • Emacs, vi (UNIX, Linux, …) • XML Specific (XMLSpy, oXygen, …) • Custom software • NBII Metamaker • ESRI ArcCatalog • ecoinformatics.org Morpho

  32. ESRI ArcCatalog Metadata • Metadata properties: Derived from the data itself and automatically created by ArcCatalog • Documentation: Written by a person • Metadata editor enforces Federal Geographic Data Committee (FGDC) standards • Stored in XML format within the geodatabase • Automatically exports/transfers with coverages

  33. ESRI ArcCatalog Metadata Attribute Metadata: Properties…automatic Documentation…input

  34. ESRI ArcCatalog Metadata Metadata creation/import selections Metadata Tab Catalog list Metadata Sections Metadata Parts

  35. ESRI ArcCatalog Metadata Location Metadata

  36. ESRI ArcCatalog Metadata Import a template or existing file Edit screen

  37. Morpho • Metadata editor enforces EML 2.0 standards • Stored in XML format within Metacat server • Automatically imports Metadata and data • 5 Classes of Metadata Descriptors • Data set descriptors • Research origin descriptors • Data set status and accessibility • Data structural descriptors • Supplemental descriptors

  38. Morpho • Create & Edit Metadata • Search & Query Metadata Collections

  39. Data Set Descriptors • Identity • Identification code • Originator

  40. Data Set Descriptors Abstract Keywords Project description

  41. Research Origin Descriptors • Research methods • Experimental or sampling design

  42. Data Set Status & Accessibility Where data is located Who is the contact person How to access and use the data

  43. Data Set Status & Accessibility Dates of when the data were accessed & modified

  44. Data Structural Descriptors • Variable information • Units of measurement

  45. Data Structural Descriptors • Data type • Data format • Data anomalies

  46. Supplemental Descriptors • Data acquisition • Quality assurance • Supplemental materials • Computer programs • Archival info • Publications • History of usage

  47. In this session we will discuss… • Metadata: what are they? and why should they be created? • Metadata standards: why do we need them? • Metadata tools: what’s out there to help? • Creating metadata: just how much work is this? • Finding and evaluating metadata: what is good? • Metadata resources: what’s out there?

  48. Creating Metadata Just how much work is this?

  49. How much work is this going to be???

  50. Benefits of using Metadata • Information entropy , data longevity • Data reuse and sharing • Even original researchers need refreshing • System interoperability • Broad-based data synthesis • Compliance with funding agencies

More Related