slide1 n.
Download
Skip this Video
Loading SlideShow in 5 Seconds..
Real-Time Big Data Meetup , March 2013 PowerPoint Presentation
Download Presentation
Real-Time Big Data Meetup , March 2013

Loading in 2 Seconds...

play fullscreen
1 / 14

Real-Time Big Data Meetup , March 2013 - PowerPoint PPT Presentation


  • 67 Views
  • Uploaded on

Apache Hive What to Expect in the Next Release Carl Steinbach. Real-Time Big Data Meetup , March 2013. Speaker Bio: Carl Steinbach. Currently: Engineer @ Citus Data PMC Chair, Committer -- Apache Hive Project Formerly: Cloudera, Informatica, NetApp, Oracle

loader
I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
capcha
Download Presentation

PowerPoint Slideshow about 'Real-Time Big Data Meetup , March 2013' - kermit-wooten


An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript
slide1

Apache Hive

What to Expect in the Next Release

Carl Steinbach

  • Real-Time Big Data Meetup, March 2013
slide2

Speaker Bio: Carl Steinbach

  • Currently:
  • Engineer @ Citus Data
  • PMC Chair, Committer -- Apache Hive Project
  • Formerly:
  • Cloudera, Informatica, NetApp, Oracle
  • Contact:
  • Twitter: @cwsteinbach
  • LinkedIn: carlsteinbach
slide3

What is Apache Hive?

  • SQL to MapReduce
  • (OLAP, not OLTP)
  • MetaStore
  • Format Handlers
slide4

What’s New?

HiveServer2

- Committed earlier today…

slide5

What’s New?

HCatalog

- Is Merging into Hive…

slide6

What’s New?

Columnar Formats

- Optimized Row Columnar Format (ORC)

- Parquet

slide7

What’s New?

  • Analytic SQL
  • Work in progress on feature branch
  • HIVE-896
slide8

What’s New?

Better Query Plans

HIVE-3784, HIVE-2340, HIVE-3952, HIVE-HIVE-3562, HIVE-3972, HIVE-3841, HIVE-948, HIVE-2340, HIVE-3891, …

slide9

What’s New?

Smarter Query Compiler

MapJoin hint inferred automatically in most cases (HIVE-3784, HIVE-3403)

slide10

What’s on the Horizon?

New Runtime Framework

Apache Tez…

slide11

What’s on the Horizon?

Vectorized Query Execution

slide12

Real-time SQL on Hadoop

CitusDB, Impala, Apache Drill, …

What matters:

Data Locality

Block aware query planner

slide13

Monthly Hive Meetups in the Bay Area

Hive User Group Meetup

Hive Contributors Group Meetup

slide14

We’re Hiring

  • citusdata.com/job