Physical Design Patterns in Information Systems
450 likes | 656 Views
Karim Ali & Sarah Nadi CS848 – Spring 2010 July 14 th , 2010. Physical Design Patterns in Information Systems. Outline. Stages of Design Elements of Physical Design in Information Systems Different Physical Designs Disk Based Relational Database Systems (DRDB)
Physical Design Patterns in Information Systems
E N D
Presentation Transcript
Karim Ali & Sarah Nadi CS848 – Spring 2010 July 14th, 2010 Physical Design Patterns in Information Systems
Outline • Stages of Design • Elements of Physical Design in Information Systems • Different Physical Designs • Disk Based Relational Database Systems (DRDB) • Memory Based Relational Database Systems (MMDB) • XML Databases • Data Warehouses • Future Work • Open Problems • Summary & Conclusions Karim Ali & Sarah Nadi
Stages of Design • Describes the intended behavior Karim Ali & Sarah Nadi
Elements of Physical Design Karim Ali & Sarah Nadi
Indexes • Data needs to be organized for quick searching • I/O operations are expensive --> need to minimize Karim Ali & Sarah Nadi
Materialized Views • Repeated complicated queries should not have to be executed every time • Save execution time, and I/O reads by pre-computing the results & storing them • Materialized views are store on disk Karim Ali & Sarah Nadi
Paritioning • Divides the data into related partitions • Horizontal Partitioning: divides tables into sets of rows according to a specific attribute (E.g. Date ranges) • Vertical Partitioning: divides table into the sets of attributes that are usually accessed together • Reduces table scan time • Improves performance Karim Ali & Sarah Nadi
Clustering • Records that are accessed together are physically located together • Reduces the number of pages to be queried • Can have multi-dimensional clustering based on more than one criteria Karim Ali & Sarah Nadi
Data Compression Karim Ali & Sarah Nadi
Sriping, Mirroring, Denormalization Karim Ali & Sarah Nadi
Physical Design of Different Information Systems Karim Ali & Sarah Nadi
Disk Based Relational Database Systems (DRDB) Karim Ali & Sarah Nadi
DRDB: Indexes Karim Ali & Sarah Nadi
DRDB: Materialized Views Karim Ali & Sarah Nadi
DRDB: Paritioning Karim Ali & Sarah Nadi
DRDB: Clustering Karim Ali & Sarah Nadi
DRDB: Summary • Summary table/figure Karim Ali & Sarah Nadi
Main Memory Database Systems (MMDB) • Data resides in main memory • Cheaper to access main memory Karim Ali & Sarah Nadi
MMDB: Indexes • Factors to consider: • I/O operations are cheaper • Should be cache conscious • Types of indexes used: • B+trees • T Trees • Cache Sensitive Search Trees • Cache Sensitive B+ Trees • Prefetching B+ Trees • J+ Trees and pJ+ trees Karim Ali & Sarah Nadi
MMDB: Materialized Views Karim Ali & Sarah Nadi
MMDB: Partioning Karim Ali & Sarah Nadi
MMDB: Clustering Karim Ali & Sarah Nadi
MMDB: Summary • Summary table/figure Karim Ali & Sarah Nadi
Data Warehouses • Collection of data and decision support technologies • Used in: • Retail: user profiling • Finance: claims analysis, risk analysis, credit card analysis, and fraud detection • Healthcare: outcomes analysis Karim Ali & Sarah Nadi
DW: Challenges • Data is usually • Extremely large • Multi-dimensional • Priority for aggregated and summarized data • Ad-hoc and complex queries • Expensive operations: aggregation, and joins • the fact table participates in every join • Figure ?? Karim Ali & Sarah Nadi
DW: Design • ROLAP • Relational implementation of DW • Multidimensional view of data is achieved through star scheme Karim Ali & Sarah Nadi
DW: Indexes Karim Ali & Sarah Nadi
DW: Materialized Views Karim Ali & Sarah Nadi
DW: Partitioning Karim Ali & Sarah Nadi
DW: Clustering Karim Ali & Sarah Nadi
DW: Summary Karim Ali & Sarah Nadi
XML Databases • XML-enabled DBs: • Maps XML documents to relational tables • Native XML DBs: • Data structures store actual XML Karim Ali & Sarah Nadi
XML DBs: Indexes • Same index structures can be used • Need adjustments • Need a numbering schema for the XML nodes Karim Ali & Sarah Nadi
XML DBs: Materialized Views Karim Ali & Sarah Nadi
XML DBs: Paritioning Karim Ali & Sarah Nadi
XML DBs: Clustering Karim Ali & Sarah Nadi
XML DBs: Summary Karim Ali & Sarah Nadi
Future Work & Open Problems Karim Ali & Sarah Nadi
Future Work • Looking at automating physical design (put some examples of work here and say its time permitting) Karim Ali & Sarah Nadi
Open Problems in Physical Design Karim Ali & Sarah Nadi
Summary & Conclusions Karim Ali & Sarah Nadi
Big summary table(s) Karim Ali & Sarah Nadi
Conclusions Karim Ali & Sarah Nadi
Thank you Karim Ali & Sarah Nadi
References Karim Ali & Sarah Nadi