1 / 5

Data Handling Breakout Neil Chue Hong, OMII-UK

Data Handling Breakout Neil Chue Hong, OMII-UK. Data Handling. What’s important? Ease of management Lifetime / durability Interoperability with other systems / software Functionality Security / trust Data formats Size limitations S tandards for annotation of databases

taro
Download Presentation

Data Handling Breakout Neil Chue Hong, OMII-UK

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Data Handling BreakoutNeil Chue Hong, OMII-UK

  2. Data Handling • What’s important? • Ease of management • Lifetime / durability • Interoperability with other systems / software • Functionality • Security / trust • Data formats • Size limitations • Standards for annotation of databases • Are these different for different communities?

  3. Questions • Challenges from your research area • What does the NGS currently provide that ‘s useful? • What does the NGS need to provide? • What should we do next (as a community)?

  4. Challenges • Security of certain types of data is very important • E.g. storage of anonymous MRI image data for large scale research projects • If this is not resolved, data will stay within the lab • Data formats are all different and divergent • Need functionality to aggregate and integrate data from different formats • Policy varies between areas and internationally • Standards for annotation of databases needed

  5. NGS Wishlist • Identify and host up-to-date key databases in each field • Prevent decay , divergence and desynchonisation of locally copied datasets • Easy way for database providers to publish datasets to NGS • Map VO attributes to Unix groups so VO’s can have control on authorisation to their data • Make it easy to make data available on worker nodes when it’s needed • Provide more information for submission of data, how-to’s for common usage scenarios(e.g. SNP calling, BLAST search)

More Related