1 / 23

Building a Data Sharing Community

Building a Data Sharing Community. In collaboration with. The Vertebrate Networks. Facilitate open access to specimen data on the web Enhance the value of specimen collections Conserve curatorial resources Use a design easily adapted by other disciplines with similar needs. Primary Goals.

ash
Download Presentation

Building a Data Sharing Community

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Building a Data Sharing Community

  2. In collaboration with The Vertebrate Networks

  3. Facilitate open access to specimen data on the web Enhance the value of specimen collections Conserve curatorial resources Use a design easily adapted by other disciplines with similar needs Primary Goals

  4. Performance Critical Challenge #1

  5. Critical Challenge #1

  6. Critical Challenge #1

  7. Performance Aggregation Critical Challenge #2

  8. Performance Aggregation Costs and Sustainability Critical Challenge #3

  9. ~ $200k annually reduced to ~ $20k annually Critical Challenge #3

  10. Critical Challenge #3 All you need is a Darwin Core Archive Create your DwC-A or we'll do it for you Publish it yourself or we'll host it for you No servers, no extra IT expertise needed Easy

  11. Performance Aggregation Costs and Sustainability Technological Integration Critical Challenge #4

  12. Critical Challenge #4 Big Data 157+ institutions + 377+ collections = ~100M records and growing Technical Challenge: Downloading, aggregating, caching, and serving these data from the cloud Technical Solution: "Gulo": aggregates archives in the cloud

  13. Visualization: VertNet & CartoDB

  14. Opening Doors to Innovation

  15. Progress So Far... 32 institutions (79 collections) are up 19 institutions (44 collections) in process 106 institutions (228 collections) waiting In CartoDB to date (44 archives): 3,367,773 records processed 1,606,374 mappable records 228,270 distinct, mappable coordinates 162,077 distinct scientific names

  16. Moving Forward 2012-2013: • Finish transitioning current networks into VertNet • 2012-2013: Develop User Interface for data searching • 2012-2013: Integrate with other partners and projects 2013-2014: • Develop tools for visualization, discovery, and improvement (annotations, thesaurus, phylogenetic browser) • Sustainability Workshop

  17. Dave Bloom - VertNet Coordinator dbloom@vertnet.org Laura Russell - VertNet Programmer larussell@vertnet.org Carla Cicero - VertNet PI ccicero@berkeley.edu

  18. All Aves

  19. Field Museum of Natural History

  20. Hyla regilla

  21. Hyla regilla

More Related