1 / 12

Interoperability of phylogenetic data

Interoperability of phylogenetic data. Weigang Qiu, Rutger Vos. Introduction. Improving interoperability at the data level. We consider two options: “ new standard ” approach: a new file format, endorsed by this meeting;

tamar
Download Presentation

Interoperability of phylogenetic data

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Interoperability of phylogenetic data Weigang Qiu, Rutger Vos

  2. Introduction • Improving interoperability at the data level. • We consider two options: • “new standard” approach: a new file format, endorsed by this meeting; • “abstraction” approach: parsing and serialization layer, intermediate data model layer;

  3. A new standard • What is to be done to achieve this? • Walk through of the steps involved; • Compare & contrast with abstraction approach;

  4. Creating a new standard • Exhaustively define and publish structure and syntax; • Create unambiguous validation procedure; • Create extension protocol: • Governed extension adoption mechanism • Versioning

  5. Commitment to maintenance • Standard governance means long term commitment of someone or some “body” (NESCENT? OBF? Us?), but: • So does abstraction approach, • …which doesn’t encourage standardization • …and trails rather than leads data trends • …neither of which provide impetus for maintenance

  6. New standard, new features • When designing standard, add attractive new features from the start: • Substitution models; • More metadata for taxa, trees, nodes, matrices, sequences, sites; • More metadata for “project” (analysis metadata, logging)

  7. Expanding abstraction architecture • Even without new features, facilitating union of existing features in abstraction approach implies complex ontology and metaformat: • Premature generalization • Analysis paralysis

  8. Implement IO in common tools • Possible early adopters: • Services: CIPRES and TreeBASE • Analysis apps: Paup*, HyPhy, MEGA, MrBayes, Mesquite • Toolkits: Bio::*

  9. Implementation of abstraction • Abstraction architecture needs to be developed separately • By whom? In what language? • More of a “complete rewrite” • Doesn’t tap into phylogenetics community expertise • Might be “lock in”

  10. Adoption of new standard • Advocacy to increase community adoption: • Carrots: • Access to new features • Robust, can be validated objectively • Interoperability • Stick: • Submission requirement for services, databases, journals

  11. Adoption of abstraction architecture • Adoption is hampered by catch-22: • Abstraction architecture only encourages contributions of mappings from 3rd party authors if that “adds value” to their application, • But main added value of abstraction architecture is the number of contributed mappings

  12. Do the simplest thing that could possibly work

More Related