1 / 12

Apache AsterixDB

This presentation gives an overview of the Apache AsterixDB project. It explains the AsterixDB database in terms of its functionality and capabilities. <br> <br>Links for further information and connecting<br><br>http://www.amazon.com/Michael-Frampton/e/B00NIQDOOM/<br><br>https://nz.linkedin.com/pub/mike-frampton/20/630/385<br><br>https://open-source-systems.blogspot.com/

semtechs
Download Presentation

Apache AsterixDB

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. What Is Apache AsterixDB ? ● A Big Data Management System (BDMS) ● Open source / Apache 2.0 license ● Manages semi-structured data ● Has a NoSQL style data model (ADM) ● Has an expressive and declarative query language (AQL) ● Uses a runtime query execution engine, Apache Hyracks ● Support for querying and indexing external data (e.g. HDFS)

  2. What Is Apache AsterixDB ? ● Has two query languages (SQL++ and AQL) ● Scale-tested on up to 1000+ cores and 500+ disks ● Basic transactional (concurrency and recovery) capabilities ● Partitioned LSM-based data storage and indexing ● Supports efficient data ingestion ● Exploits internal data partitioning and indexes – To avoid scanning data sets – When processing queries

  3. Asterix Data Model (ADM) ●Unusual extensions in red

  4. Asterix Built In Functions ● Numeric Functions ● Object Functions ● String Functions ● Aggregate Functions ● Binary Functions ● Comparison Functions ● Spatial Functions ● Type Functions ● Similarity Functions ● Conditional Functions ● Tokenizing Functions ● Miscellaneous Functions ● Temporal Functions

  5. AsterixDB HTTP API ● Examples of HTTP API queries using curl

  6. AsterixDB CSV Load Example ● Create a dataverse / type and dataset

  7. AsterixDB Full Text Queries ● Searching for words in text rather than sub strings

  8. AsterixDB External Data ● Built in adapters for external data sets – localfs – hdfs – socket – socket_client – twitter_push – twitter_pull – rss

  9. AsterixDB User Defined Functions ● UDF's written in Java, stored in libs ● Use managix command to – Stop Asterix instance – Install UDF library – Start Asterix instance ● Now UDF's in lib can be executed ● See simplified example on next slide ● For testlib library use against tweet feed

  10. AsterixDB User Defined Functions use dataverse feeds; drop feed ProcessedTwitterFeed if exists; create secondary feed ProcessedTwitterFeed from feed TwitterFeed apply function testlib#addHashTags; connect feed ProcessedTwitterFeed to dataset ProcessedTweets; use dataverse feeds; for $i in dataset ProcessedTweets limit 10 return $i;

  11. Available Books ● See “Big Data Made Easy” Apress Jan 2015 – See “Mastering Apache Spark” ● Packt Oct 2015 – See “Complete Guide to Open Source Big Data Stack ● “Apress Jan 2018” – ● Find the author on Amazon www.amazon.com/Michael-Frampton/e/B00NIQDOOM/ – Connect on LinkedIn ● www.linkedin.com/in/mike-frampton-38563020 –

  12. Connect ● Feel free to connect on LinkedIn –www.linkedin.com/in/mike-frampton-38563020 ● See my open source blog at open-source-systems.blogspot.com/ – ● I am always interested in – New technology – Opportunities – Technology based issues – Big data integration

More Related