1 / 13

Supporting Heterogeneous Data Access in Genomics

Supporting Heterogeneous Data Access in Genomics. Terence Critchlow Center for Applied Scientific Computing Lawrence Livermore National Laboratory March 2002. Outline. Motivation Approach Specific use cases Introduction to others.

darcie
Download Presentation

Supporting Heterogeneous Data Access in Genomics

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Supporting Heterogeneous Data Access in Genomics Terence Critchlow Center for Applied Scientific Computing Lawrence Livermore National Laboratory March 2002

  2. Outline • Motivation • Approach • Specific use cases • Introduction to others

  3. Motivated by current state of the art in genomics data access. The user is required to perform all data management tasks. Source Specific Schema PDB Different users end up doing the same thing. SWISS-PROT SCoP dbEST User applications Transform Map data format similar concepts Parse Access input/ the data output

  4. What is the ideal environment? A single location that provides effective access to a consistent view of data and tools from many sources through an intuitiveanduseful interface. :: Parse Access input/ the data output User applications Transform Map data format similar concepts

  5. a realistic What is the ideal environment? A single location that provides effective access to a consistent view of data and tools from many sources through an intuitiveanduseful interface. :: Parse Access input/ the data output User applications Transform Map data format similar concepts

  6. PDB Semantic Wrapper XPath Wrapper Model-Based Mediator Semantic Wrapper XPath Wrapper DF Matt Semantic Wrapper XPath Wrapper VIPAR Wrapper Medline XPath Wrapper : XPath Wrapper Metadata Registry XWrap SDM Center Data Integration Infrastructure Query Dispatch and Collection (QDaC) GUI External Tools

  7. Blast Matt ::: ::: Provide access to many more sources than Matt currently has Unfortunately, Matt cannot query all of the relevant data sources. Use case 1: Find everything related to a sequence MILLAFSSGRRLDFVHRSGVFFFQTLLWILCATVCGTEQYFN The more sources queried, the more valuable the results

  8. Matt ::: Use case 1: Find everything related to a sequence Blast • Additional Desired Capabilities • Handle hundreds of sequences • Search using other tools • Preprocess sequence(s) • Use results as input to other tools and queries

  9. Gene name /accession # Clusfavor Genbank Model sequence Matt Sequence Blast against HTGS Modelbuilder Homologs Filter Sequence Accession # Transfac Sequence Subseq to 2000bp Use case 2: Identifying model sequences Hundreds of sequences MILLAFSSGRRLDFVHRSGVFFFQTLLWILCATVCGTEQYFN

  10. Summary • Matt’s current research objectives focus on Use Case 2 • That is our current target • Details of current status in following talks • Context-sensitive Service Composition for Support of Scientific Workflows • Mladen A. Vouk •  XWRAPComposer: A wrapper generation system for Integrating Bioinformatics Data Sources • Ling Liu • Constructing Workflows by Integrating Interactive Information Sources • Amarnath Gupta

  11. Questions?

  12. LLNL Terence Critchlow (lead) Georgia Tech Calton Pu Ling Liu David Buttler Dan Rocco Henrique Paques Wei Han SDSC Bertram Ludaescher Amarnath Gupta Ilkay Altintas Agent Technology Tom Potok (ORNL) Mladen Vouk (NCSU) Target Users Matt Coleman (LLNL) Allen Christian (LLNL) Phil Bourne (PDB) People

  13. This work was performed under the auspices of the U.S. Department of Energy by University of California Lawrence Livermore National Laboratory under contract No. W-7405-ENG-48.

More Related