1 / 22

A Virtual Distributed Database Model for Creating a Database Federation

A Virtual Distributed Database Model for Creating a Database Federation. Jaspreet Singh Pahwa PhD Year 2 School of Computer Science Cardiff University. Overview. Motivation Hypothesis Product Class Product Class Database Grid Enabled Multiple Database Search Service (MDSS)

anika
Download Presentation

A Virtual Distributed Database Model for Creating a Database Federation

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. A Virtual Distributed Database Model for Creating a Database Federation Jaspreet Singh Pahwa PhD Year 2 School of Computer Science Cardiff University

  2. Overview • Motivation • Hypothesis • Product Class • Product Class Database • Grid Enabled Multiple Database Search Service (MDSS) • MDSS Federation • Conclusions and further work

  3. Motivation • Information held by large number of data sources in a Wide Area Network. • Access to such information held by large number of organisations. • Information processing with aid of independent mechanisms, and • Sharing of information with those who need it.

  4. Hypothesis • Enabling product definition infrastructure. • Creating standard product definition using the infrastructure. • Subscription of product definitions and product creation by product suppliers. • Building a virtual distributed database using the Grid infrastructure. • Participation of product suppliers in the virtual distributed database as part of a federation. • Advertisement of products in the virtual distributed database by Product Suppliers. • Product search using product definition and the Grid.

  5. The Product Class • A standard product definition for providing product information. • Acts as a template and provides meta information for creating actual product information. • Is made up of a number of specifications conforming to different specification types. • Can be used by Product Suppliers for product advertisement in the federation.

  6. List Specification Value Value … The Product Class and its various specification types Product Class Table Specification Specification Group Sub Product Class Specification Specification Specification List of Column Specifications where each column specification has list of values relative to other column specification values. Value Table Spec List Specification Sub-Specification Group Product Class Specification

  7. Product Class Versioning • Versioning of Product Class/Specifications. • New products emerge. • Existing products evolve and are assigned more attributes. • New versions created and assigned. • Product suppliers notified. • New classes downloaded.

  8. Transport Layer Product Class Database Product Supplier Database Table Table <xml> … … … </xml> Xml Parse Service Table Table … … Transport Layer Product Class Subscription

  9. The Product Class Database • A tool for creating Product Classes and Product Class Specifications. • Enables Product Class versioning. • Releases Product Class for subscription by Product Suppliers. • Provides meta information for building product search criteria.

  10. Product Class Category Product Category Product Class Version Category Hierarchy Product Class Definition Specification List Version Specification Group Version Table Version Sub Product Class Version List Specification Table Specification Specification Group Sub Product Class List Definition Specification Group Definition Table Definition Sub Product Class Definition Product Class Database Structure

  11. The MDSS • Multiple Database Search Service. • A database centric Grid service for accessing and processing information from large number of data sources. • Searches supplier data sources based on a search criteria. • A channel for product suppliers for advertising products to a large user base. • Built using Globus Toolkit 3.0.2 (Core).

  12. The Search • What to search? - a search criteria submitted by the user. • Where to search? - data sources that match user’s request. • What are the available resources to propagate the search? - the available Database Search Services in the MDSS

  13. Search Criteria Where to Search? <NewDataSet> <Supplier> <IDSupplier>1</IDSupplier> <SupplierWS>http://131.251.42.40/test/categories.asmx</SupplierWS> </Supplier> … </NewDataSet> Available GSHs <NewDataSet> <GridServiceHandle> <IDGsh>1</IDGsh> <GSH>http://131.251.42.40:8080/ogsa/services/DatabaseSearchService</GSH> </GridServiceHandle> <GridServiceHandle> <IDGsh>2</IDGsh> <GSH>http://131.251.47.110:18080/ogsa/services/DatabaseSearchService</GSH> </GridServiceHandle> … </NewDataSet>

  14. The Architecture Two major components: • Master Grid Service: identify resources available, register new resources, job allocation, data parsing and data aggregation. • Database Search Service: data search, data aggregation, data parsing,collaborative working.

  15. Single user Metadata Product Classes Portal (Search Criteria Specified here) VO Master Grid Service * Metadata Query *Job Allocation Grid Service * Collation of datasets Grid Engine (based on the OGSA Model) Grid DB Search Service Grid DB Search Service Grid DB Search Service Apache Axis Soap Server <<soap messages >> <<soap messages >> <<soap messages >> WS Layer WS Layer WS Layer WS Layer WS Layer WS Layer Supplier databases - MDSS Architecture -

  16. Design Features • Ability to register new data sources. • Ability to enhance MDSS capability by adding more machines in the Grid cluster. • Supplier databases free of Grid Complexities. • Ability to identify data sources to be searched in real time. • Distribute search equally among available GSHs. • New instance of Database Search Service for each operation.

  17. Job Execution Grid Service Manager GSH Document Parser Master Grid Service Database Search Service Supplier Document Parser Data Aggregation Xml Converter OGSA Container Master Grid Service Modules

  18. MDSS Federation • Information retrieval model for retrieving product information without the need of creating federated schema. • Encompasses the MDSS System and all participating SD Systems. • Cooperation to the centralised MDSS System for advertisement of products to VOs. • Provides an environment for data models to compete with each other. • Database access mechanism Grid enabled and not data sources. • Component data models participate in the federation by providing web service interface.

  19. Single user / VO The MDSS System Database Search Job Allocation Collation of Datasets Web Service Layer Web Service Layer Web Service Layer Component Schema (Product Class) Component Schema (Product Class) Component Schema (Product Class) Component DBS Component DBS Component DBS Supplier 1 Supplier 2 Supplier n The MDSS Federation

  20. Design Features • Component DBS retain local autonomy. • Schema integration or creation of federated or external schemas not required. • Uniform data model addresses the issue of semantic heterogeneity. • Supports different kinds of autonomies.

  21. Conclusions & Further Work • MDSS Federation – a Grid enabled virtual distributed database model for product advertisement. • Creation of downloadable product classes. • Product class subscription. • Multiple instances of Master Grid Services. • Data Streaming. • Memory Resident databases.

  22. Questions?

More Related