Loading…
This event has ended. Create your own event → Check it out
This event has ended. Create your own
Register Now or Visit the Website for more Information 
View analytic
Tuesday, May 10 • 10:00am - 10:50am
Apache Hive 2.0 SQL Speed Scale - Alan Gates, Hortonworks

Sign up or log in to save this to your schedule and see who's attending!

Apache Hive is the most commonly used SQL interface for Hadoop. To meet users data warehousing needs it must scale to petabytes of data,
provide the necessary SQL, and perform in interactive time. The Hive community is working towards a 2.0 release of Hive that includes significant improvements. These include:
* LLAP, a daemon layer that enables sub-second response time.
* HBase to store Hive’s metadata, resulting in significantly reduced planning time.
* Expanding Hive’s support for managing changing data in a transactionally consistent way with SQL MERGE.
* Using Apache Calcite to enable Hive to use multiple storage engines (e.g. HBase)
This talk will cover the use cases these changes enable, the architectural changes being made in Hive as part of building these features, and share performance test results on how these improvements are speeding up Hive.

Speakers
avatar for Alan Gates

Alan Gates

Co-founder & Executive Chairman, Hortonworks
Alan is a founder of Hortonworks and an original member of the engineering team that took Pig from a Yahoo! Labs research project to a successful Apache open source project. Alan has done extensive work in Hive, including adding ACID transactions. Alan has a BS in Mathematics from Oregon State University and a MA in Theology from Fuller Theological Seminary. He is also the author of Programming Pig, a book from O’Reilly Press.


Tuesday May 10, 2016 10:00am - 10:50am
Georgia A

Attendees (31)