1 / 16

Indexing of XML Data

Indexing of XML Data . Raghuraman Rangarajan KReSIT, IIT Bombay. Plan of Talk. Why is indexing needed? Queries and Indexes in Traditional DBMS Querying in XML Indexes: Path, Value Conclusion. Why is Indexing Needed?.

stash
Download Presentation

Indexing of XML Data

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Indexing of XML Data Raghuraman Rangarajan KReSIT, IIT Bombay. XML Workshop, IIT Bombay

  2. Plan of Talk • Why is indexing needed? • Queries and Indexes in Traditional DBMS • Querying in XML • Indexes: Path, Value • Conclusion XML Workshop, IIT Bombay

  3. Why is Indexing Needed? • Allows fast access to data by replicating portions of the data in special purpose structures. • Despite the additional cost (storage, maintenance and complexity) they have shown to be useful in evaluating queries. XML Workshop, IIT Bombay

  4. Queries and Indexes in Traditional DBMS XML Workshop, IIT Bombay

  5. An XML Fragment part subpart part name supplier supplier name name subpart supplier name name address name supplier address address name address name (with leaf values omitted) XML Workshop, IIT Bombay

  6. Queries in XML • SELECT X • FROM part._*.supplier.name X 2. Select X From part._*.supplier: {name X, address: “Mumbai”} XML Workshop, IIT Bombay

  7. Indexes for XML • Path indexes: regular path expressions • Value Indexes: locating atomic objects XML Workshop, IIT Bombay

  8. Building A Path Index part subpart part name supplier supplier name name subpart name name address name supplier address address name address name h1 part subpart h2 name supplier subpart name supplier h6 h3 name address name supplier h4 name address h7 name address h5

  9. Path Index h1 part subpart h2 name supplier subpart name supplier h6 h3 name address name supplier h4 name address h7 name address h5 • Index summarises path information • Each entry: list of pointers to data nodes XML Workshop, IIT Bombay

  10. Using Path Index for Regular Path Expressions h1 part subpart h2 name supplier subpart name supplier h6 h3 name address name supplier h4 name address h7 name address h5 (R1) part.name (R2) part.supplier.name (R3) _*.supplier.name (R4) part._*.subpart.name XML Workshop, IIT Bombay

  11. Path Indexes • XSet project (Berkeley) • Dataguides (Lore, Stanford) XML Workshop, IIT Bombay

  12. Value Index • Useful for comparisons (=, <, etc.) • Example: Find supplier whose name is “XYZ”? part subpart VIndex(name) supplier supplier name address name address “XYZ” “ABC” XML Workshop, IIT Bombay

  13. Other Indexes • Text Indexes: Information retrieval style keyword search. Example: Find the suppliers in Mumbai(“address”) Also supports search features like AND, OR, NEAR, etc. XML Workshop, IIT Bombay

  14. Conclusion • Performance improves significantly when indexing is used for query processing (Lore). • Performance of the path indexes depends on the type of queries. XML Workshop, IIT Bombay

  15. References • The Lore Project (www-db.stanford.edu/lore) • Work done by Dan Suciu (www.research.att.com/~suciu/) • Data on the Web: Serge Abiteboul, et al. XML Workshop, IIT Bombay

  16. Indexing of XML Data Raghuraman Rangarajan KReSIT, IIT Bombay. XML Workshop, IIT Bombay

More Related