Apache: Big Data 2016 has ended
Register Now or Visit the Website for more Information 

Sign up or log in to bookmark your favorites and sync them to your phone or calendar.

Keynote [clear filter]
Monday, May 9

9:00am PDT

Welcome & Opening Remarks - Rich Bowen, Executive Vice President, Apache Software Foundation
avatar for Rich Bowen

Rich Bowen

Community Architect, Red Hat
Rich has been involved in Open Source since before it had that name. He's a director at the Apache Software Foundation, and a community architect at Red Hat.

Monday May 9, 2016 9:00am - 9:05am PDT
Regency CD

9:05am PDT

Keynote: How Netflix Leverages Big Data - Brian Sullivan, Director of Streaming Analytics, Netflix
Netflix is the world's leading internet television network. That didn't happen by accident or simple fortune - we are data-driven as part of our culture, and have built the tools needed to navigate the unchartered waters of delivering internet video at scale and becoming the first truly global storyteller in movies and television.

avatar for Brian Sullivan

Brian Sullivan

Director of Streaming Analytics, Netflix
Brian Sullivan is the Director of the Streaming Data Science and Engineering team at Netflix, the world’s leading Internet television network. His team builds analytic systems and delivers insight into the streaming activity across hundreds of client devices, world-class server... Read More →

Monday May 9, 2016 9:05am - 9:25am PDT
Regency CD

9:30am PDT

Keynote: How Enterprises are Leveraging Open Source Analytics Platforms for Making Game Changing Decisions - Luciano Resende, Architect, Spark Technology Center, IBM
In this Keynote, Luciano Resende, Architect, Spark Technology Center at IBM, will showcase Open source Analytic platforms.  Luciano will also discuss how they are being leveraged by different organizations to upend their competition, as well as enable new use cases.

avatar for Luciano Resende

Luciano Resende

Architect, Spark Technology Center, IBM
Luciano Resende is an Architect in IBM Analytics. He has been contributing to open source at The ASF for over 10 years, he is a member of ASF and is currently contributing to various big data related Apache projects including Spark, Zeppelin, Bahir. Luciano is the project chair for... Read More →

Monday May 9, 2016 9:30am - 9:45am PDT
Regency CD

9:50am PDT

Keynote: It Takes a Village: Making Data Projects Work - Amy Gaskins, Big Data Project Director
We've all seen an innovative data project that fails and all too often the reason isn't a lack of technical skill among the team members. In order to succeed in complex organizations, data project teams require both diversity and versatility (and their sources aren't always where you might expect). From the battlefield to the boardroom, Amy's experience demonstrates that incongruous teams can achieve remarkable results.

avatar for Amy Gaskins

Amy Gaskins

Amy Gaskins has previously worked for NOAA and MetLife.  Amy was an Assistant Vice President in MetLife’s Global Technology & Operations, managing data science projects in Europe, the Middle East, and North Africa. In her previous government service, Amy spent over 10 years as... Read More →

Monday May 9, 2016 9:50am - 10:10am PDT
Regency CD
Tuesday, May 10

4:15pm PDT

Keynote: ODPi 101: Who We Are, What We Do and Don't Do - Alan Gates, Co-founder, Hortonworks
It's no surprise that application developers find it difficult to keep up with the breathtakingly large ecosystem of new and emerging Hadoop-related technologies. Hadoop, its components, and Hadoop Distros, are innovating very quickly and in different ways.

What's needed to push Hadoop even further in the enterprise is standardization and simplification. That's the mission behind the new Open Data Platform initiative (ODPi) that launched last year and warrants extra explanation.

In this session, Alan Gates, Co-Founder of Hortonworks will outline why close to 30 companies came together to be part of the the nonprofit ODPi. Organized to support the ASF, ODPi promotes innovation and development of upstream projects like Hadoop and Ambari. While not a distribution, ODPi Core is a stable base against which big data solutions providers can qualify solutions over multiple Apache Hadoop® distributions. ODPi Core is a set of software components, a detailed certification and a set of open source tests to make it easier to create big data solutions and data-driven applications.

 The well-defined ODPi Core and ODPi Certification Program are designed to drive interoperability, a broad set of use cases and major growth for the big data ecosystem, not to mention a new level of choice for enterprises and end users. The reference implementation frees up developers and SIs to focus on building business-driven applications for things like fraud detection, customer behavior and data warehouse optimization

avatar for Alan Gates

Alan Gates

Co-founder and Architect, Hortonworks
Alan is a founder of Hortonworks and an original member of the engineering team that took Pig from a Yahoo! Labs research project to a successful Apache open source project. Alan has done extensive work in Hive, including adding ACID transactions. Alan has a BS in Mathematics from... Read More →

Tuesday May 10, 2016 4:15pm - 4:25pm PDT
Regency CD

4:30pm PDT

Keynote: More Fun, Less Friction: How Open Source Operations Will Take Big Data to the Next Level - Mark Shuttleworth, Founder, Canonical
avatar for Mark Shuttleworth

Mark Shuttleworth

Mark is founder of Ubuntu and leads product design at Canonical. Mark founded Thawte, an internet commerce security company in 1996 while studying finance and IT at the University of Cape Town. In 2000 he founded HBD, an investment company, and created the Shuttleworth Found... Read More →

Tuesday May 10, 2016 4:30pm - 4:40pm PDT
Regency CD

4:45pm PDT

Keynote: A Look Ahead at Spark 2.0 - Ion Stoica, Co-founder & Executive Chairman, Databricks
During this keynote talk, Ion will discuss the key features of the upcoming Apache Spark 2.0 release, and the longer term development directions.

avatar for Ion Stoica

Ion Stoica

Co-founder & Executive Chairman, Databricks
Ion Stoica is a Professor in the EECS Department at University of California at Berkeley. He does research on cloud computing and networked computer systems. Ion's past work includes Apache Spark, Apache Mesos, Tachyon, Chord DHT, and Dynamic Packet State (DPS). He is an ACM Fellow... Read More →

Tuesday May 10, 2016 4:45pm - 5:05pm PDT
Regency CD

5:10pm PDT

Lightning Talks aka Big Data Shark Tank
This year lightning talks have been overrun by sharks. Which means, at this point, you may be wondering: is it a panel? Is it a talk? It is a Big Data Shark Tank! Back by popular demand with even sharkier judges! What is it, you ask? Well, this is just like Shark Tank TV show (think speed dating between entrepreneurs and investors) but instead of “Squirrel Boss” and “Man Candle” you'll be hearing pitches for Apache Incubator  Big Data projects. Also instead of Mark Cuban and Kevin O'Leary you'll be pitching to the panel of ASF grey beards and money men (trying to convince them that your project  is worthy of their esteemed attention and endorsement). The will be snark, there may be prizes, there will be reciting of Apache Way creed. But most of all there will be fun. We guaranteed that!

avatar for Roman Shaposhnik

Roman Shaposhnik

Director of Open Source, Linux Foundation
Apache Software Foundation and Data, oh but also unikernels

avatar for Milind Bhandarkar

Milind Bhandarkar

Founder, Ampool
Milind Bhandarkar was the founding member of the team at Yahoo! that took Apache Hadoop from 20-node prototype to datacenter-scale production system. Parallel programming languages and paradigms has been his area of focus for over 20 years. He worked at several HPC companies, Yahoo... Read More →
avatar for Shane Curcuru

Shane Curcuru

Founder, Punderthings Consulting
Shane serves as V.P. of Brand Management for the ASF, setting trademark and brand policy for all 250+ Apache projects, and has served as five-time Director, and member and mentor for Conferences and the Incubator. Shane's Punderthings consultancy is here to help both companies and... Read More →
avatar for Jim Jagielski

Jim Jagielski

Developer, Uber
Jim Jagielski is a well-known and acknowledged expert and visionary in open source, an accomplished coder, and frequent engaging presenter on all things open, web, blockchain, and cloud related. As a developer, he’s made substantial code contributions to just about every core technology... Read More →
avatar for Mark Shuttleworth

Mark Shuttleworth

Mark is founder of Ubuntu and leads product design at Canonical. Mark founded Thawte, an internet commerce security company in 1996 while studying finance and IT at the University of Cape Town. In 2000 he founded HBD, an investment company, and created the Shuttleworth Found... Read More →

Tuesday May 10, 2016 5:10pm - 5:50pm PDT
Regency CD
Wednesday, May 11

9:00am PDT

Keynote: Apache Hadoop at 10 - Doug Cutting, Chief Architect, Cloudera
2016 marks the 10th Anniversary of Apache Hadoop. This birthday provides us an opportunity to celebrate, and also to reflect on how we got here and where we are going. Ten years ago, digital business was mostly limited to a few sectors, like e-commerce and media. Since then, we have seen digital technology become central to nearly every industry. Hadoop did not create this digital transformation, but it is a critical character in this larger story. Thus by exploring Hadoop’s tale we can better understand the century we are now in."

avatar for Doug Cutting

Doug Cutting

Doug (@cutting) is the founder of several successful open source projects, including Lucene, Nutch, Avro, and Hadoop. Doug joined Cloudera in 2009, after previously working at Yahoo!, Excite, Apple, and Xerox PARC. Doug holds a Bachelor’s degree from Stanford University and is the... Read More →

Wednesday May 11, 2016 9:00am - 9:20am PDT
Regency CD

9:25am PDT

Keynote: Role of Apache in Transforming eBay’s Data Platform - Seshu Adunuthula, Sr. Director of Analytics Infrastructure, eBay
eBay has one of the most sophisticated Data Platform’s in the industry with over 200PBs of data stored in our Hadoop and Teradata Warehouses. On average 30 TB of transactional and behavioral data is extracted on a daily basis and thousands of metrics are computed, analyzed and monitored for decision making and detecting anomalies. eBay has embarked on an ambitious project to transform the batch oriented ETL processes which could take 24 to 48 hour to near real time infrastructure. Apache Big Data Projects continue to play a critical role in this transformation process.

avatar for Seshu Adunuthula

Seshu Adunuthula

Sr. Director of Analytics Infrastructure, eBay
Seshu Adunuthula is Sr Director of Analytics Infrastructure at eBay responsible for managing some of the world’s largest deployments of Hadoop, Teradata and ETL Ingest platforms. He is an industry veteran with over 20 years of Distributed Computing and Analytics Experience. Prior... Read More →

Wednesday May 11, 2016 9:25am - 9:45am PDT
Regency CD

9:50am PDT

Keynote: Making Data Accessible - Ashish Thusoo, Co-founder & CEO, Qubole
Every organization is handling data in one way or another, but today’s data tools and infrastructure continue to hinder an organization’s ability to make data accessible to less technical users. In this keynote, Ashish Thusoo, CEO and co-founder of Qubole, will discuss the gaps in organizations’ data ambitions and ability to execute.

He will cover the gap between an organization’s ability to operationalize the infrastructure needed to support ubiquitous access to data, specifically regarding administrative expertise of data systems, the ability to predict capacity and to centrally monitor and govern usage. In order to address this, Ashish will discuss how cloud platforms can offer the elasticity, automation and access planes to alleviate these issues and provide a more accessible data platform.

Additionally, despite a new class of user-friendly tools, there is still a gap between a company’s ability to make data accessible throughout throughout the organization. To truly bridge this gap, Ashish will offer strategies on how developers can take a verticalized approach to building applications on top of data so that users can benefit from easy-to-use visualizations and other tools.

avatar for Ashish Thusoo

Ashish Thusoo

Co-founder, Qubole
Before co-founding Qubole, Ashish ran Facebook’s Data Infrastructure team; under his leadership the team built one of the largest data processing and analytics platforms in the world. This platform achieved not just the bold aim of making data accessible to analysts, engineers... Read More →

Wednesday May 11, 2016 9:50am - 10:10am PDT
Regency CD

10:15am PDT

Keynote: ODPi and ASF: Building a Stronger Hadoop Ecosystem - John Mertic, Director of Program Management, ODPi

ODPi Director of Program Management, John Mertic, will explain how the work of the ODPi complements and supports that of the ASF. Since ODPi’s launch in 2015, there has been some confusion around how its work may overlap, or potentially compete, with that of the ASF. Mr. Mertic will detail how the ODPi’s specifications and by-laws reinforce the role of the ASF as the singular place where Hadoop development occurs. He will also explain how the ODPi’s focus on the downstream Hadoop ecosystem oxygenates the Big Data market and stimulates growth.

avatar for John Mertic

John Mertic

Director of Program Management, The Linux Foundation
John Mertic is the Director of Program Management for The Linux Foundation. Under his leadership, he has helped ASWF, ODPi, Open Mainframe Project, and R Consortium accelerate open source innovation and transform industries. John has an open source career spanning two decades, both... Read More →

Wednesday May 11, 2016 10:15am - 10:25am PDT
Regency CD