Apache: Big Data 2016 has ended
Register Now or Visit the Website for more Information 

Sign up or log in to bookmark your favorites and sync them to your phone or calendar.

Streams [clear filter]
Monday, May 9

3:00pm PDT

Generating Many Resources from One Set of Schemas with Apache Streams - Steve Blackmon, People Pattern
Apache has many good programming languages, databases, and analytics libraries. Most have some unique competency or value that justifies their application in certain situations. Use the right tool for the right job. However, mastering the data definition file formats of multiple platforms and keeping representations of your data (and partner data) current can be challenging and tedious.

Apache Streams (incubating) contains libraries and patterns for specifying, publishing, and inter-linking data schemas, and can convert data between the representation, format, and encoding preferred by supported platforms. The talk will cover using Streams to specify your object schemas, bind them across languages (Java, Scala), serializations (JSON, XML), databases (Cassandra, Elasticsearch, Mongo, HBase), and analytics tools (Spark, Pig, Hive), as well as re-use object definitions created by others.

avatar for Steve Blackmon

Steve Blackmon

VP Technology, People Pattern, Inc.
VP Technology at People Pattern, previously Director of Data Science at W2O Group, co-founder of Ravel, stints at Boeing, Lockheed Martin, and Accenture. Committer and PMC for Apache Streams (incubating). Experienced user of Spark, Storm, Hadoop, Pig, Hive, Nutch, Cassandra, Tinkerpop... Read More →

Monday May 9, 2016 3:00pm - 3:50pm PDT
Plaza A

4:10pm PDT

Designing Workflows with OODT - Tom Barber, Meteroite Consulting
When building a data management platform, flexible and effective workflows are key to the scalability and effectiveness of the platform.

OODT (originally developed by NASA JPL) has a very flexible and powerful workflow engine and is at the core of pretty much any data processing you will do within the platform but understanding it can sometimes be a challenge.

In this talk we’ll take a deep dive into guts of workflows inside OODT using CAS PGE to help lower the barrier for entry. We’ll run through a number of real world examples. How you build them, how you deploy and trigger them.

We’ll also look at monitoring and feedback. Lastly we’ll tackle resource management and how you make sure your workflows run in the correct server pool, without swamping your resources.

avatar for Tom Barber

Tom Barber

Technical Director, Spicule LTD
Tom Barber is the director of Meteorite BI and Spicule BI. A member of the Apache Software Foundation and regular speaker at ApacheCon, Tom has a passion for simplifying technology. The creator of Saiku Analytics and open source stalwart, when not working for NASA, Tom currently deals... Read More →

Monday May 9, 2016 4:10pm - 5:00pm PDT
Plaza A