Apache: Big Data 2016: Full Schedule

Register Now or Visit the Website for more Information

9:00am PDT

Getting Started with Apache OODT - Tom Barber, Meteroite Consulting (Additional Fee)

With data becoming more and more prevalent along with a requirement to store it managing it becomes a ever greater problem. How can Apache OODT fill that void?

Apache OODT is a distributed data processing and management platform. In this talk we’ll go through installation and configuration. How to start a project, deploy and test a project. We’ll run through the various components you’re likely to use, how to customise them and make your users embrace data management. We’ll also take a look at workflows, resources and how to build simple workflows. During this presentation we’ll also connect Apache OODT to a number of different data sources to demonstrate data ingestion and metadata capture. Finally, of course it’s all well and good capturing data, but how do you get data out to your end users? We’ll go through the options for data extraction and dissemination to end users.

Speakers

Tom Barber

Technical Director, Spicule LTD

Tom Barber is the director of Meteorite BI and Spicule BI. A member of the Apache Software Foundation and regular speaker at ApacheCon, Tom has a passion for simplifying technology. The creator of Saiku Analytics and open source stalwart, when not working for NASA, Tom currently deals... Read More →

Thursday May 12, 2016 9:00am - 12:00pm PDT
Plaza A

Tutorial, Beginner

9:00am PDT

Interactive Data Science from Scratch with Apache Zeppelin and Apache Spark - Felix Cheung (Additional Fee)

How do you find the needle in the haystack?

With Big Data, finding insight is a big problem. Visualization and exploratory analysis help convert on insights and Apache Zeppelin (incubating) is an essential tool for that.

In this tutorial, Felix Cheung will introduce you to Apache Zeppelin, and provide step-by-step guides to get you up-and-running with Apache Zeppelin to run Big Data analysis with Apache Spark.

This is going to be a heavily hands-on session, no previous experience with Zeppelin, Data Science, or Statistics necessary. Bring your laptop - attendees are expected to be able to handle some software installation steps.

You can view the materials here:
http://www.slideshare.net/felixcss/interactive-data-science-from-scratch-with-apache-zeppelin-and-apache-spark

Speakers

Felix Cheung

Engineering Manager, Uber

Felix started in the big data space about 5 years ago with the then state-of-the-art MapReduce. Since then, he (re-)built Hadoop cluster from metal more times than he would like, created a Hadoop “distro” from two dozens or so projects into .rpm/.deb, and kicked off clusters in... Read More →

Thursday May 12, 2016 9:00am - 12:00pm PDT
Lord Byron

Tutorial, Beginner