1 / 19

Introduction to HDF5: Mathematical Concepts and Fundamental Objects

This session provides an overview of HDF5, focusing on the mathematical concepts behind its structure and the fundamental objects it uses. Topics include groups, datasets, and the concept of fiber bundles.

chargis
Download Presentation

Introduction to HDF5: Mathematical Concepts and Fundamental Objects

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Introduction to HDF5Session ?HDF5 Mathematical Concepts

  2. Fundamental HDF5 Objects • Groups • Containers of links • Allow creating arbitrary directed graphs, including non-treelike and cyclic structures • Datasets • Multi-dimensional arrays (currently) • Based on mathematical concept of “fiber bundle” – representing the values of a field over a space

  3. Groups - Overview • Groups are container objects in a file that follow a “set” data structure semantic: • Groups contain links • No two links in a group can have the same name • Links have two components: • Name • Destination • Three types of links currently: • Hard – Destination is object in same file • Soft – Destination is path to object in same file • External – Destination is path to object in another file

  4. Groups • Tree, with hard links

  5. Groups • Non-Tree, with hard links

  6. Groups • Cyclic, with hard links

  7. Groups • Tree, with soft links

  8. Groups • Tree, with external links

  9. Groups - Discussion • What would happen if links didn’t have names, but objects had names? • What other types of links are useful?

  10. Datasets - Overview • Datasets are objects in an HDF5 that represent “real” application data • Array-like currently • Datasets have three components: • Dataspace describes current and maximum dimensions of array • Datatype describes type of elements in array • Elements are the values stored in the array

  11. Datasets – Measurement Example • Think of algebraic concept of independent and dependent variables • X-Y Plot:

  12. Dataset – Measurement Example, 2 • X-Y Plot data in Database:

  13. Dataset – Measurement Example, 3 • X-Y Plot data in HDF5 Dataset:

  14. Dataset – Measurement Example, 4 • In HDF5, independent variables are implicit and not stored (they are the coordinates of elements in array) • In Database, independent variables are explicitly stored in each record • A “packed” HDF5 dataset of N dimensions is up to N times smaller than database table storing the same data.

  15. Datasets - Discussion • When would storing data in a database table be better than storing the same data in an HDF5 dataset? • If you were measuring two dependent values at each coordinate, what are the trade-offs between storing them as a pair for each element in a single dataset and storing each one in a separate dataset?

  16. Review • Fundamental HDF5 Objects are: • Groups • Containers of links to objects • Create arbitrary directed graph structures • Datasets • Multi-dimensional arrays of elements • Based on mathematical concept of fiber bundles, but can be thought of in terms of independent and dependent variables

  17. Stretch Break

  18. Dataset – Fiber Bundles • HDF5 Datasets actually based on mathematical concept of “fiber bundles” A fiber bundle consists of the data (E, B, π, F), where E, B, and F are topological spaces and π : E → B is a continuous surjection satisfying a local triviality condition outlined below. The space B is called the base space of the bundle, E the total space, and F the fiber.

  19. Dataset – Fiber Bundles, 2

More Related