1 / 11

XML et. al vs Tables

XML et. al vs Tables. Michael Stonebraker Adjunct Professor Massachusetts Institute of Technology (stonebraker@lcs.mit.edu). Tables. Good for structured data remember Codd’s laws Schema first have to think about your data up front Data integrity key not loosey-goosey semantics. XML.

lavada
Download Presentation

XML et. al vs Tables

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. XML et. al vs Tables Michael Stonebraker Adjunct Professor Massachusetts Institute of Technology (stonebraker@lcs.mit.edu)

  2. Tables • Good for structured data • remember Codd’s laws • Schema first • have to think about your data up front • Data integrity key • not loosey-goosey semantics

  3. XML • Good for documents • Bad for data (Codd’s laws) • Schema later ok • Loosey-goosey ok

  4. Applicability lots Tables data volume XML little lots little structure

  5. Why? • Low end content uses HTML -> XML • High end content uses DBMS • invariably structured • High end business data processing uses DBMS • Low end may well use XML • because it is easier

  6. Which is More Important? • Internet has generated lots of HTML content (on the left) • As well as lots of click stream data (on the right) • Which is more important depends on metric • bytes represented? • bytes * monetary value of a byte? • market value?

  7. Nothing is New • Document systems have never used data bases • IR community versus DBMS community • XML will allow better IR • Outer fringes have never used data bases • in the 1970’s it was airline reservation systems • in the 1990’s it was web crawlers

  8. But DBMS is “Good” • Makes you think about you data up front • good discipline (used by business data processing folks) • registers meta data for downstream data sharability • Transactions, recovery, etc. are good

  9. Battleground Between The Two Approaches • Corporate media asset management • Scientific data

  10. How Can DBMS Do Better? • Become easier to use • Better tools

  11. My Main Fear • Two fiefdoms are incompatible • shades of Corba and Com • cross fiefdom joins may not work well • DBMS will export XML • no guarantee that client won’t update it and hand it back • horrible view update problem….

More Related