1 / 33

Publishing and Cataloguing Datasets It’s time everyone got involved

Publishing and Cataloguing Datasets It’s time everyone got involved. UKSG Conference 2009. Part 1. BUT FIRST, LET’S GET SERIOUS. OECD likes being cool with data. Some data to start with. TIB in Hamburg say they have archived and added DOIs to 500,000 datasets.

greeneg
Download Presentation

Publishing and Cataloguing Datasets It’s time everyone got involved

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Publishing and Cataloguing DatasetsIt’s time everyone got involved UKSG Conference 2009

  2. Part 1 BUT FIRST, LET’S GET SERIOUS OECD likes being cool with data.

  3. Some data to start with TIB in Hamburg say they have archived and added DOIs to 500,000 datasets. Yet their most-cited dataset has been cited just 3 times.

  4. So … Either no-one wants to cite data OR, having a DOI by itself isn’t enough

  5. Let us imagine for a moment . . .

  6. If an article is . . . “ A piece of data that is presented in a static, two-dimensional, form. “ Geoffrey Bilder, CrossRef, 2007

  7. Print Static, two-dimensional objects http://dx.doi.org/10.1787.280675838368 • PDF • HTML

  8. Active, two-dimensional object http://dx.doi.org/10.1787.280675838368

  9. Another active, two-dimensional object?

  10. It’s a view on a datacube

  11. In fact, it’s a view on a collection of datacubes Active, multi-dimensional object!

  12. OECD Database Active, multi-dimensional object http://dx.doi.org/10.1787.280675838368 OECD Excel TableActive, two-dimensional object OECD ArticleStatic, two-dimensional object

  13. So, instead of imagining, let’s say we built this. We’d get something like . . .

  14. . . . this. Excel – Active two- dimensional object Dataset – Active multi- dimensional object Dataset OECD Regional Database PDF – Passive two- dimensional object

  15. OK – that’s cool and OECD can do this because we have all the objects in our publishing system. But how are other publishers, authors and librarians coping with data?

  16. Chart from The Economist Source: OECD

  17. Taken from an appendix to an article published in Elsevier’s World Development Source: Acemoglu et al (2001), based on Curtin, 1989, Philip D. Curtin, Death by migration: Europe’s encounter with the tropical world in the nineteenth century, Cambridge University Press, New York (1989).Curtin 1989 and other sources. Tertiary school enrollment: School enrollment, tertiary (% of gross). Source: Barro and Lee (2000) and their databases. You can’t fault the author for trying . . . but it’s not a lot of help for a reader

  18. And Librarians, How many are cataloguing datasets in their OPACs in ways which are compatible with search systems for books and journals?

  19. Conclusion: Datasets:Scholarly Publishing’s Black Sheep?

  20. A&I & subject portals Publishers EconLitRePEc ScienceDirect OPACs Scholarly Publishing Sites for Journals and Books Library portals Network Ingenta Content Aggregators

  21. A&I & subject portals Publishers EconLitRePEc ScienceDirect OPACs Scholarly Publishing Sites for Journals and Books Library portals Network Ingenta Content Aggregators

  22. A&I & subject portals Publishers EconLitRePEc ScienceDirect OPACs Scholarly Publishing Sites for Journals and Books Library portals Network Ingenta Content Aggregators

  23. Using metadata for: Datasets In the same industry standard formats as . . .Book chapters and Journal articles

  24. Authors will be able to cite . . . Publishers will be able to link . . . Discovery systems will be able to find . . . Librarians will be able to catalogue . . . Datasets alongside published outputs . . . . . . to the benefit of Everyone

  25. A proposed example of a dataset using standard bibliographic and citation metadata. Dataset title with ISSN, DOI (& MARC) record Citation tool compatible with EndNotes et al Bibliography of Books that cite this database

  26. There are still challenges: - Dynamic data - Versioning - Preservation But, let’s round the sheep up first.

  27. OECD is: • issuing a white paper on Publishing Standards for Datasets • Speaking with CrossRef about citation standards for dynamic objects • publishing OECD datasets with ‘sheepdogs’ from mid-2009: MARC records ONIX records Citation records

  28. End of Part One QUESTIONS? DISCUSSION?

  29. Part 2 BEING COOL WITH DATA: OECD’S FIRST STEPS

  30. March 2007

  31. April 2009 Print edition Web-book on SourceOECD USB Key EditionOECD Factbook on eXplorer (new for 2009) OECD Factbook on iPhone (new for 2009)

  32. OECD Regional Statistics using NCVA’s eXplorer tool October 2008 March 2009 OECD Regional Statistics data using the eXplorer tooltool http://stats.oecd.org/OECDregionalstatistics/

  33. Other cool visualisation stuff • IMF Datamapper on www.imf.org. See also www.mappingworlds.com who provided the technology. • See Gapcasts and Trendalyzer on www.gapminder.org • The New York Timesuses a lot of dynamic graphics • USA Today built their reputation on graphics – now they’re doing it online. We like How much is $700bn? • Economist’sChart Gallery generate a lot of comment. • Data sharing sites include www.swivel.com, www.many-eyes.com an newcomers www.icharts.net and www.widgenie.com . • There are many blogs on charts or visualization such as www.flowingdata.comorwww.eagereyes.org

More Related