N oSQL for the SQL Server Pro
DESCRIPTION
N oSQL for the SQL Server Pro. Lynn Langit. Feb 2013 – SDC, Sweden. Is NoSQL just Hadoop? . HUGE Hype factor over last few years. Apache Hadoop is a software framework that supports data-intensive distributed applications under a free license
1 / 0
Download Presentation
N oSQL for the SQL Server Pro
An Image/Link below is provided (as is) to download presentation
Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.
Content is provided to you AS IS for your information and personal use only.
Download presentation by click this link.
While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.
During download, if you can't get a presentation, the file might be deleted by the publisher.
E N D
Presentation Transcript
-
NoSQL for the SQL Server Pro
Lynn Langit Feb 2013 – SDC, Sweden - Is NoSQL just Hadoop? HUGE Hype factor over last few years Apache Hadoop is a software framework that supports data-intensive distributed applications under a free license enables applications to work with thousands of nodes and petabytes of data was inspired by Google's MapReduce and Google File System (GFS) papers
- Hadoop in the Enterprise
- Working with Hadoop Common Tools / Languages Java (JDK) / Eclipse MapReduce Map (query/format) Reduce (aggregate) plug-in for Eclipse (Java) Pig (ETL -- Java) Hive (HQL Query) HBase tables Others Mahout (analyze) Karmasphere (analyze) R (analyze)
- Demo -HDInsight– Cluster Allocation
- What is the relationship?
- BigData = Exponentially More Data Retail Example -> ‘Feedback Economy’ Number of transactions Number of behaviors (collected every minute)
- BigData = ‘Next State’ Questions
- Demo - HDInsight - MapReduce
- Hitting (Relational) Walls CA Highly-available consistency CP Enforced consistency AP Eventual consistency
- So many NoSQL options More than just the Elephant in the room Over 120+ types of NoSQL databases
- Flavors of NoSQL
- Key / Value Database Schema-less State (Persistentor Volatile) Examples AWS Dynamo DB Riak
- Column Database Wide, sparse column sets Examples: Cassandra HBase BigTable GAE HR DS Azure Tables SQL 2012 Tabular Model
- More about Column Databases Type A Column-families Non-relational Sparse Examples: HBase, Cassandra, xVelocity (SQL 2012 Tabular) Type B Column-stores Relational Dense Example: SQL Server 2012 Columnstore index
- Demo - Document Database (Mongo DB) document-oriented (collection of JSON documents)w/semi structured data Encodings includeBSON, JSON, XML… binary forms PDF, Microsoft Office documents -- Word, Excel…)
- Demo - Graph Database (Neo4j) a lot of many-to-many relationships recursive self-joins when your primary objective is quickly finding connections, patterns and relationships between the objects within lots of data
- So which type of NoSQL? Back to CAP… CP = NoSQL/column Hadoop Big Table H-baseMemCacheDB CA = SQL/RDBMS SQL Sever / Oracle MySQL AP = NoSQL/document or key/value DynamoDB CouchDB CassandraVoldemort
- Which type of NoSQL for which type of data?
- Cloud-hosted NoSQL up to 50x CHEAPER
- The reality…two pivots
- NoSQL (Cloud) BLOB Storage Buckets Amazon – S3 or Glacier The gold standard Google – Cloud Storage Free for developers Microsoft Azure BLOBS DropBox, Box…
- Cloud-hosted RDBMS AWS RDS – SQL Server, mySQL, Oracle Medium cost Solid feature set, i.e. backup, snapshot Use existing tooling Google – mySQL Lowest cost Most limited RDBMS functionality Microsoft – SQLAzure Highest cost
- Demo - AWS RDS SQL Server, MySQL or Oracle Essential to understand pricing models
- Cloud Offerings– RDBMS AND NoSQL
- Data Scientists…
- Comparing…
- Karmasphere Studio for AWS
- Hadoop Connector to Excel
- Google BigQuery Hadoop-like (Dremel) based service For massive amounts of data SQL-like query language
- Dremel Realized => Impala Interactive Hadoop?
- Other types of cloud data services
- NoSQL To-Do List
- The Changing Data Landscape
- recipes) www.TeachingKidsProgramming.org Free Courseware ( Do a Recipe Teach a Kid (Ages 10 ++) Java or Microsoft SmallBasic
- Toward Data Craftsmanship…
More Related