Loading…
Apache: Big Data 2016 has ended
Register Now or Visit the Website for more Information 
Tuesday, May 10 • 11:20am - 12:10pm
Streaming Data Integration at Scale with Kafka - Ewen Cheslack-Postava, Confluent

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

The last decade as seen a dramatic shift in the complexity of data pipelines. Data is stored in more systems, queried in more ways, and comes from more sources. Complex data pipelines combined with the need for applications that can analyze and respond to that data in real-time leave traditional approach to data integration struggling to keep up.

This talk will describe how data integration is shifting to a streaming model and how Kafka supports this new model. Specifically, it will focus on a new tool included with Kafka, Kafka Connect, that handles streaming "E" and "L". It will describe Kafka Connect’s data and execution models, which provide scalable fault-tolerant import and export between Kafka and other data systems. Finally, it will show how this can be combined with other tools such as stream processing frameworks to create a complete streaming data integration solution.

Speakers
EC

Ewen Cheslack-Postava

Confluent
Ewen Cheslack-Postava is a Kafka committer and engineer at Confluent building a stream data platform based on Apache Kafka to help organizations reliably and robustly capture and leverage all their real-time data. He received his PhD from Stanford University where he developed Sirikata... Read More →



Tuesday May 10, 2016 11:20am - 12:10pm PDT
Regency A