1 / 10

Large Instance Points

Large Instance Points. 16th Eurofiling Workshop Wednesday 12 December Herm Fischer Mark V Systems Limited and Arelle open source XBRL processor. Study results. RAM consumption vs. instance size Instance XML to DOM (XML only) 1 : 3 to 1 : 10 Instance XBRL to formula processor 1 : 60

tia
Download Presentation

Large Instance Points

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Large Instance Points 16th Eurofiling Workshop Wednesday 12 December Herm Fischer Mark V Systems Limited and Arelle open source XBRL processor

  2. Study results • RAM consumption vs. instance size • Instance XML to DOM (XML only) 1 : 3 to 1 : 10 • Instance XBRL to formula processor 1 : 60 • Instance XBRL to SAX object model 1 : 15 • Conclusion • Constant-memory streaming approach suggested • Non-XML technologies eventually required

  3. US SEC form “SD” • Mining and oil exploration “payout” details • Sample size: • Instance: 21.7 MB • Every filer uses extension taxonomy • 150,006 facts • 21,001 contexts • 5 units • 0 footnotes

  4. Streaming XBRL in XML syntax • Base spec 2.1 working group note (WGN) & task force • Compatible organization of instance for streaming • SAX vs DOM improves speed, no XML persistence • Constant memory usage • Use for 2.1 & XDT validation • Challenges for • Financial validation (~GFM) • Formula processing • XPath (XML node access)

  5. XBRL streamability issues • Order • Freedom to order facts, contexts, units, footnotes • XML syntax detail • Formula & Table Xpath to nodes and XML structure • Validation and formula strategies • Designed for complete instance in memory • Complex fallback and existence strategies

  6. XBRL streaming approach • Constant memory • Backwards compatibility • Order constrained within instances • Contexts/units located as needed in instance

  7. Financial Validation EFM/GFM • Full object models analysis in memory for • Context, unit, fact duplications (could use hashes) • Fact cross-dimension analysis (only some concepts) • History of concepts used • Roll ups, roll forwards, aggregations • Full DTS model in memory for • Concept issues, label/definition issues • Missing / improper calculations, roll-ups, roll-fwds • Can be re-architected for streaming environment

  8. Global Ledger Architecture • Multiple content models required in parallel • Transactions • Company data • Account data • Reformulate for independent streams or persistent company/account models.

  9. Formula Response • Define subset of XPath for streamed processor • No node-axis features (“/” or “[“ operators) • More functions for context, typed dimensions, etc • Should allow use of non-XML implementations • Define subset of formula processing • Consider SQL infrastructures • Consider OLAP features • Reconsider use of features like • Fallbacks • Multiple instances of large data sets

  10. Abstract Model Response • Abstract model is based on OMG MOF & CWM • Abstracted XBRL semantics from syntax • Implementation will layer on XML as well as • CWM (OLAP supportive technologies) • Next step is a pilot project • Implement abstract model demonstration • Evolve and tweak specs • Provide prototype implementation

More Related