Loading…
This event has ended. Create your own event → Check it out
This event has ended. Create your own
Register Now or Visit the Website for more Information 
View analytic
Tuesday, May 10 • 3:00pm - 3:50pm
Using Kafka and Kudu for Fast, Low-latency SQL Analytics on Streaming Data - Mike Percy & Ashish Singh, Cloudera

Sign up or log in to save this to your schedule and see who's attending!

Apache Kudu (incubating) is a fast new columnar data store for the Hadoop ecosystem designed to enable high-performing, flexible analytic pipelines. In this talk, Mike Percy and Ashish Singh will demonstrate how Apache Kafka can be combined with Kudu to achieve low latency, high throughput analytics on streaming data. We will compare various approaches to building such a solution and demonstrate a working system for analyzing tweets in real time by combining Kafka, Kudu, and Apache Impala (incubating).

Speakers
avatar for Mike Percy

Mike Percy

Software Engineer, Cloudera
Mike Percy is a software engineer at Cloudera and a PMC member on Apache Kudu, an open source distributed column store for the Hadoop ecosystem. He is also a PMC member on Apache Flume. Prior to joining Cloudera, Mike worked at Yahoo! building machine learning infrastructure for Big Data. Mike holds a BSCS from UC Santa Cruz and an MSCS from Stanford.
avatar for Ashish Singh

Ashish Singh

Software Engineer, Cloudera
Ashish Singh is a Software Engineer, working with Cloudera to empower the Hadoop ecosystem to answer bigger questions. Ashish studied Computer Science and Engineering at Ohio State University. Before working in the Big Data space, he worked on optimizing MPI collective communications on High Performance Computing clusters. As part of the ingest team at Cloudera, he is interested in making data movement easier in large-scale data architectures... Read More →



Tuesday May 10, 2016 3:00pm - 3:50pm
Georgia A