1 / 70

GEO 802, Data Information Literacy Winter 2019 – Lecture 2 Gary Seitz, MA

Discovery and Acquisition of Data. GEO 802, Data Information Literacy Winter 2019 – Lecture 2 Gary Seitz, MA. Lesson 2 Outline. Data Repositories. Discipline-related repositories. Portals for data publication. Open Data from organizations. Luis Prado from The Noun Project.

may
Download Presentation

GEO 802, Data Information Literacy Winter 2019 – Lecture 2 Gary Seitz, MA

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Discovery andAcquisitionof Data GEO 802, Data Information Literacy Winter 2019 – Lecture 2 Gary Seitz, MA

  2. Lesson 2 Outline Data Repositories Discipline-related repositories Portals for data publication Open Data fromorganizations Luis Prado from The Noun Project

  3. Re3dataRegistry of Research Data Repositories

  4. Re3dataRegistry of Research Data Repositories

  5. Re3dataRegistry of Research Data Repositories

  6. Re3dataRegistry of Research Data Repositories

  7. Exercise • Check Registry of Research Data Repositories www.re3data.org: • Can you find data repositories in your field? • List 5 data repositories, where you think you could find data for your thesis.

  8. Directories of repositories • Open Access Directory: Data RepositoriesLaunched in 2008 andhostedbythe Graduate School of Library and Information Science at Simmons College, the Open Access Directory is a wikithatlists links toover 50 open datarepositories in thedisciplinesofarchaeology, biology, chemistry, environmental sciences, geology, geosciencesandgeospatialdata, marine sciences, medicineandphysics, aswellasmultidisciplinary open datarepositories.

  9. Data repositories Zenodo • A researchdatarepository. It was createdbyOpenAIREandCERNtoprovide a placeforresearcherstodepositdatasets • HasintegrationwithGitHubtomakecodehosted in GitHubcitable • Providessecurearchivingandreferability, including digital objectidentifiers (DOIs) • Easy access • Disadvantage: Nocuration, noqualitycontrol

  10. Data repositories Dryad • International repositoryofdataunderlyingscientificandmedicalpublications • All datafilesareassociatedwith a publishedarticle, andaremadeavailableforreuseunderthetermsof a Creative Commons Zerowaiver • Beganchargingsubmissionfees in September 2013 • Data in Dryadreceives a permanent, uniqueDigital object identifier (DOI)

  11. A look at Dryad • For datasets associated with publications only • $80/data package, unless… • Journal sponsors the submission • Discipline agnostic • Some integration w/journals • Metadata (DC) http://datadryad.org/

  12. figshare Data repositories • Repository fordataandfiles (figures, datasets, images,audioandvideos, articles (includingpre-print), posters, software und file-sets) • Advantage: itemsareattributed a DOI, allowsresearcherstopublish negative data, altmetrics, tracksthedownloadstatisticsforhostedmaterials, acting in turn as a sourceforaltmetrics, partnershipwithPLOS • Disadvantage: operatedbyMacmillan (Nature)

  13. A look at figshare http://figshare.com/

  14. Browsing in figshare

  15. Data repositories Have a close look at the records to see the ways Zenodo, Dryad and Figshare have made its records discoverable and accessible.   Compare them. Which bibliometric data is provided?

  16. Data repositories

  17. Data repositories DataCite GitHub Protocols.io DataCite is a leading global non-profit organisation that provides persistent identifiers (DOIs) for research data and other research outputs. Organizations within the research community join DataCite as members to be able to assign DOIs to all their research outputs. An up-to-date open accessrepositoryofsciencemethodsand a collaborativeprotocol-centeredplatform, to find andsharelifescienceprotocols . GitHubis a web-basedGitorversion control repositoryandInternet hosting service. Itoffers all ofthedistributed version controlandsource code management (SCM) functionalityofGitaswellasaddingitsownfeatures. Itprovidesaccess controlandseveralcollaborationfeatures such asbug tracking, feature requests, task management, andwikisforeveryproject

  18. Data repositories ROAR The aim of ROAR is to promote the development of open access by providing timely information about the growth and status of repositories throughout the world. Research Data Australia ICSU World Data System Research Data Australia helps you find, access, and reuse data for research from over one hundred Australian research organisations, government agencies, and cultural institutions. WDS aims to facilitate scientific research by coordinating and supporting trusted scientific data services for the provision, use, and preservation of relevant datasets, while strengthening their links with the research community.

  19. Discipline-related repositories • Ecology • Long Term EcologicalResearch (LTER)https://portal.lternet.edu/nis/home.jsp • EcoTrends: http://www.ecotrends.info/ • Ecological Society of America (ESA) Data Registry and Archive: http://data.esa.org/esa/style/skins/esa/index.jsp • Knowledge Network for Biocomplexity (KNB): https://knb.ecoinformatics.org/index.jsp • Oceanographic Data Repositories: providesaccesstoseveraloceanographicdatarepositoriescreatedbythe US Joint Global OceanFlux Study and US Global OceanEcosystem Dynamic programs. • Global Biodiversity Information Facility: http://www.gbif.org/

  20. Discipline-related repositories • Life and Biological Sciences • Biogeographic Information and Observation System (BIOS). • Protein DataBank - Experimentallydeterminedstructuresformacromolecules (proteinandnucleicacids). The siteincludessearchandvisualizationtools • TreeBase: http://treebase.org/treebase-web/home.html

  21. Discipline-related repositories • Environmental andGeosciences • Marine Geoscience Data System (MGDS):A dataportal, hosted at theLamont-Doherty Earth Observatory (Columbia University) • National Climatic Data Center (NCDC) : Meteorologyandpaleoclimatology • National Oceanographic Data Center (NODC): World-wide marine environmental andecosystemdata • National Snow and Ice Data Center (NSIDC):Cryosphericdatasetsfromgroundfieldreseachandsatellites • DataONE (Data Observation Network for Earth): https://search.dataone.org/data • Kompetenzzentrum Forschungsdaten: https://www.komfor.net/data-portal.html • Polar Data Catalogue: https://www.polardata.ca/

  22. Discipline-related repositories • Environmental andGeosciences • DASH (University Corporation for Atmospheric Research&National Centre for Atmospheric Research • WDC (World Data Center forClimate): https://cera-www.dkrz.de/WDCC/ui/cerasearch/ • Climate Data at the National Center for Atmospheric Research: https://www.earthsystemgrid.org/home.html • ENES (European Network for Earth System Modelling): https://verc.enes.org/ • EarthChem - EarthChemoperatesandmaintains a suiteofdatasystemsanddatacollectionsthatprovideaccessto a widevarietyof solid earthdata

  23. Discipline-related repositories • Environmental andGeosciences • Atmosphericradiation measurementdata: focuseson obtainingcontinuousmeasurementsandprovidingdataproductsthat promote theadvancementofclimatemodels. • CUAHSI: a list of web portals and/or websites with data or links to data on water resources.  The portals generally provide data that are at a minimum national in scope, and many of the portals offer global data. • British Atmospheric Data Centre (BADC) - Data CentrefortheAtmosphericSciences • KNMI Climate Explorer: http://climexp.knmi.nl/ • USGS Water Data for the Nation: https://waterdata.usgs.gov/nwis • PANGAEA Data Publisher for Earth & Environmental Science http://www.pangaea.de/

  24. Discipline-related repositories • GIS andGeography • GeoCommons.comGIS filerepositoryandfindingtool • Federal Geographic Data Committee - Providesaccesstothe National Spatial Data Infrastructure (NSDI) Clearing House Network andthegeodata.govportal • http://inspire-geoportal.ec.europa.eu/ : The INSPIRE Geoportal is the central European access point to the data provided by EU Member States and several EFTA countries under the INSPIRE Directive. • Geoportal, Geodaten aus Deutschlandhttp://www.geoportal.de/ • Geodatenkatalog : https://wiki.gdi-de.org/display/gdk/Geodatenkatalog-DE/

  25. Discipline-related repositories • Remote Sensing • GEOSS Datenportal http://www.geoportal.org • Global Change Master Directory The Global Change Master Directory, maintainedbythe Earth SciencesDirectorateatthe National Aeronauticsand Space Administration (NASA), providesaccesstomorethan 25,000 earthand environmental sciencedatasets, relevant to global changeand Earth scienceresearch.

  26. Discipline-related repositories • Chemistry • ORNL DAAC forBiogeochemical Dynamics - The OakRidge National Laboratory Distributed Active Archive Center forbiogeochemicaldynamicsisoneofthe NASA Earth Observing System Data and Information System • Cambridge Structural Database - smallmoleculecrystalstructures • ChemSpider - free-to-access collectionofchemicalstructuresandtheirassociatedinformation • eCrystals - x-raycrystallographicdata • PubChem - NCBI'srepositoryofbioactivy/bioassaydataandinformationfor "small" molecules (i.e. not macromolecular). Both text-basedandstructure-basedsearchtoolsareprovided

  27. Discipline-related repositories • SocialSciences • ICPSR (Inter-university Consortiumfor Political andSocialResearch at the University of Michigan. • Dataverse Networkis a collectionofsocialscienceresearchdatacontained in virtualdataarchivescalled "dataverses". • FORS : Schweizer Kompetenzzentrum für Sozialwissenschaften. FORS führt große nationale und internationale Umfragen durch, bietet Daten- und Forschungsinformationsdienste für Forscher und akademische Einrichtungen an. • SSOAR : Social Science Open Access Repository

  28. Discipline-related repositories • Exercise • Look throughdiscipline-relatedrepositories in yourfield. • Have a close look at the records to see the ways repositories have made their records discoverable and accessible. List positive and negative aspects of the search in those repositories. • Can you already find data that you could use?Save one dataset you maybe could use.

  29. DataSearch Data Search Machine As of June 2016, theyare (completelyorpartially) indexingthefollowingcontentsources: a) Tables, figuresandsupplementarydataassociatedwithpapers in ScienceDirect, arXiv b) EarthChem Portal, Dryad, ICPSR, Harvard Dataverse, MendeleyData, NeuroElectro, PANGAEAandThemoML

  30. Data Search Machine • Google Dataset Search Howwelldoesthe Google Search work, after yourknowledgeandexperienceswiththedatarepositoriesyouhavelooked at?

  31. Data papers & data journals Earth System Science Data

  32. Data papers & data journals Geoscience Data Journal

  33. Biodiversity Data Journal Data papers & data journals

  34. Nature Scientific Data Data papers & data journals

  35. Journal of Open Psychology Data Data papers & data journals

  36. Data papers & data journals Journal of Open Research Software

  37. Geoscientific Model Development Data papers & data journals

  38. CODATA Data Science Journal Data papers & data journals The CODATA Data Science Journalis a peer-reviewed, open access, electronic journal, publishingpapers on themanagement, dissemination, useandreuseofresearchdataanddatabasesacross all researchdomains, includingscience, technology, thehumanitiesandthearts.

  39. Data in Brief Data papers & data journals The CODATA Data Science Journalis a peer-reviewed, open access, electronic journal, publishingpapers on themanagement, dissemination, useandreuseofresearchdataanddatabasesacross all researchdomains, includingscience, technology, thehumanitiesandthearts.

  40. Data papers & data journals Sciencematters.io

  41. Data papers & data journals A listoffurtherdatajournalsishere:https://www.wiki.ed.ac.uk/display/datashare/Sources+of+dataset+peer+review Have a look at the “About” of one of these journals. What didn’t you expect to see? What do think is the advantage to publish your data in a special data journal? What could be the advantage for the progress of science and for the public

  42. Data Citation Index (I)

  43. Data Citation Index (II) • started in October2012 • Thereareabout350 Repositories in DCI • crossdisciplinary, mainfocus on science; 50% ofthedataarefrommedicine • Linkedwiththebibliographicrecord in Web of Science • Linking of Peer-Reviewed-Articleswithunderlyingreserachdata • Uniform metadataschema

  44. Data Citation Index (III) Data Citation Index – DescriptiveDocument

  45. Data Citation Index: Search- plasmamembraneprotein*

  46. Resultlist

  47. A Dataset with link to ist source

  48. UNdata

  49. GLOBAL OPEN DATA INDEX

  50. Google Public Data

More Related