Loading…
Apache: Big Data 2016 has ended
Register Now or Visit the Website for more Information 
Spark [clear filter]
Tuesday, May 10
 

10:00am PDT

Clickstream Analysis with Apache Spark - Andreas Zitzelsberger, QAware GmbH
On large-scale web sites, users leave thousands of traces every second. Businesses need to process and interpret these traces in real-time to be able to react on the behavior of their users.
In this talk, Andreas will show a real world example of the power of a modern open-source stack.
He will walk you through the design of a real-time clickstream analysis PAAS solution based on Apache Spark, Kafka, Parquet and HDFS, explain our decision making and present our lessons learned.

Speakers
avatar for Andreas Zitzelsberger

Andreas Zitzelsberger

Principal Software Architect, QAware GmbH
Andreas is Principal Software Architect at QAware, an independent cloud native software manufacturer that has been repeatedly awarded Best IT Workplace in Germany. His focus is cloud native computing in all its glory. He is responsible for the heavy lifting at a large-scale cloud... Read More →



Tuesday May 10, 2016 10:00am - 10:50am PDT
Plaza C

3:00pm PDT

Real Time BOM Explosions with Apache Solr and Spark - Andreas Zitzelsberger, QAware GmbH
Bill of materials (BOMs) are at the heart of every manufacturing process. Especially large BOMs can be found in the automotive industry, where a complex and highly variable product meets high production volumes.
Drawing from the experiences made in an ongoing real world project for a major car manufacturer, Andreas will provide an in-depth view how Apache Solr and Apache Spark were used to power an innovative architecture that provides lightning-fast BOM explosions, demand forecasts and scenario-based planning on 20 billion records per scenario.

Speakers
avatar for Andreas Zitzelsberger

Andreas Zitzelsberger

Principal Software Architect, QAware GmbH
Andreas is Principal Software Architect at QAware, an independent cloud native software manufacturer that has been repeatedly awarded Best IT Workplace in Germany. His focus is cloud native computing in all its glory. He is responsible for the heavy lifting at a large-scale cloud... Read More →



Tuesday May 10, 2016 3:00pm - 3:50pm PDT
Plaza C
 
Wednesday, May 11
 

10:50am PDT

Spark After Dark 2.0: Complete End-to-End, Real-time Advanced Analytics, Big Data Reference Pipeline Including Machine Learning, Graph Processing, and Text/NLP Analytics, and Streaming Approximations Using Kafka, Spark Streaming, Spark ML, Spark SQL - Chr
The audience will participate in a live, interactive demo that generates personalized, real-time recommendations using the latest open source streaming and big data processing tools available. We’ll dive deep into not only the architecture and application code, but also the Spark, Cassandra, and ElasticSearch internal codebases that power this awesome combination of technologies. All code and demos are available on Github and DockerHub. Follow the links @ advancedspark.com.

Speakers
avatar for Chris Fregly

Chris Fregly

Solution Architect, AI and machine learning, AWS


Wednesday May 11, 2016 10:50am - 11:40am PDT
Plaza C
 
Filter sessions
Apply filters to sessions.