1 / 28

Biodiversity Informatics at the Natural History Museum

Biodiversity Informatics at the Natural History Museum. Ed Baker Terrestrial Invertebrates, Department of Life Sciences & NHM Informatics Initiative. Science as a Slow Cooker. Only the surface visible Lid kept on for extended periods of time Uses cheap cuts of raggy meat

jaafar
Download Presentation

Biodiversity Informatics at the Natural History Museum

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Biodiversity Informatics at the Natural History Museum Ed Baker Terrestrial Invertebrates, Department of Life Sciences & NHM Informatics Initiative

  2. Science as a Slow Cooker • Only the surface visible • Lid kept on for extended periods of time • Uses cheap cuts of raggy meat • Ingredient lose their nutritional value • Children at risk due to high temperatures http://ispiders.blogspot.co.uk/2011/11/realtime-web.html

  3. http://en.wikipedia.org/wiki/Slow_cooker

  4. We like data • 70 million+ specimens collected over 400 years • 350,00+ books • ??? Unpublished datasets in archive, notebooks, computes • ??? In the minds of staff

  5. How do we provide access? • Digitisation of specimens and associated data • Scanning and transcribing books, journals, archives • Providing tools for managing the data life cycle • Changing the way we publish: data publication

  6. Flowing Data Collection Curation Use Publication

  7. Flowing Data Collection Curation Sits in desk drawer or on a hard drive until…. Somebody retires Somebody dies Project is cancelled

  8. Flowing Data Collection Curation Use Data Publication Publication Re-use Re-use Re-use Re-use

  9. Flowing Data: from collection to reuse Collection Curation Use Data Publication Publication Re-use Re-use Re-use Re-use

  10. Collection Citizen Science Automated identification and monitoring Traditional taxonomic sources

  11. Flowing Data: from collection to reuse Curation Use Data Publication Publication Re-use Re-use Re-use Re-use

  12. Curation • Websites for communities to publish and curate: • Taxonomy / nomenclature • Bibliographies • Specimen information • Character matricies

  13. Flowing Data: from collection to reuse Use Data Publication Publication Re-use Re-use Re-use Re-use

  14. Use: Oboe

  15. Use: Oboe

  16. Flowing Data: from collection to reuse Data Publication Publication Re-use Re-use Re-use Re-use

  17. Publication (Data) • Datasets • Single species descriptions • Checklists • Software

  18. Flowing Data: from collection to reuse Publication Re-use Re-use Re-use Re-use

  19. Publication (Research) • Traditional research • Systematic zoology • Phylogeny • Biogeography

  20. Flowing Data: from collection to reuse Re-use Re-use Re-use Re-use

  21. The Problem of Scale • Data is being generated by tens of thousands of researchers, in thousands of institutions • Hard to find what you need • Hard to know if what you need actually exists • Impossible to go through researcher by researcher

  22. NHM Data Portal • Aggregator for NHM science data • Visualisation tools for datasets • Allows export of NHM data for re-use

  23. The Informatics Landscape >18K specimen records (local small scale coverage) >276M specimen records (worldwide coverage)

  24. The Informatics Landscape A webpage for every species Aggregate specimen and observation data globally

  25. Wikimedianin Residence • Make NHM content available under open licenses for use on Wikimedia projects (and elsewhere) • Reach of Wikipedia: BBC, Encyclopedia of Life • Wikisource: Transcription and translation crowd-sourcing

  26. Flowing Data: from collection to reuse ?

  27. "Everybody makes mistakes. And if you don't expose your raw data, nobody will find your mistakes." Jean-Claude Bradley http://bit.ly/146ugIv

More Related