Loading…
Apache: Big Data 2016 has ended
Register Now or Visit the Website for more Information 

Sign up or log in to bookmark your favorites and sync them to your phone or calendar.

SQL Interaction [clear filter]
Tuesday, May 10
 

9:00am

SQL on Hadoop/Big Data - Architecture, Technology and Roadmap - Sumit Pal, Big Data Consultant
Talk Topic - "SQL on Hadoop - Architecture, Technology and Road Ahead"

This talk - will give an exhaustive overview of how SQL is done on Hadoop more foccused
on low latency SQL on Hadoop.
The various open source and commercial tools to perform SQL on Hadoop and their
internal architectures. The tools cover - Hive, Hive on Tez, Spark SQL, Impala, Apache
Drill, Presto, Tachyon based architecture etc.

The talk also covers how SQL can be used for Structured, UnStructured and Streaming
Data the concepts behind them and shows demo of using SQL - for JSON, Structured and
Streaming Data.

The talk also covers the changes coming in this field - with products like OLAP
on Hadoop, BlinkDB, NuoDB and HTAP based solutions.

Speakers
SP

Sumit Pal

Big Data Consultant
Sumit has more than 22 years of experience in the Software Industry in various roles spanning companies from startups to enterprises. He is a big data, visualisation and data science consultant and a software architect and big data enthusiast and builds end-to-end data-driven analytic... Read More →



Tuesday May 10, 2016 9:00am - 9:50am
Georgia A

2:00pm

Hive on ACID - Alan Gates, Hortonworks
Apache Hive provides SQL access for data in Hadoop. Traditionally data in Hadoop is write once read many. But with traditional data
warehousing use cases moving to Hadoop there is a need to support transactional update and delete of records. Hive has recently implemented
ACID compliant row level insert, update, and delete as well as very low latency ingestion of streaming data from tools like Storm and Flume. This is done with snapshot isolation between queries. This talk will cover the intended use cases, architectural challenges of implementing updates and deletes in a write-once file system, and details of changes to the file storage formats and transaction management system.

Speakers
avatar for Alan Gates

Alan Gates

Co-founder and Architect, Hortonworks
Alan is a founder of Hortonworks and an original member of the engineering team that took Pig from a Yahoo! Labs research project to a successful Apache open source project. Alan has done extensive work in Hive, including adding ACID transactions. Alan has a BS in Mathematics from... Read More →


Tuesday May 10, 2016 2:00pm - 2:50pm
Georgia A