Name: Getting Started with Machine Learning & Spark - Holden Karau, IBM (Additional Fee)
Start: 2016-05-12T09:00:00-0700
End: 2016-05-12T12:00:00-0700

Register Now or Visit the Website for more Information

Back To Schedule

Getting Started with Machine Learning & Spark - Holden Karau, IBM (Additional Fee)

Apache Spark is a fast and general engine for distributed computing & big data processing with APIs in Scala, Java, Python, and R. Apache Spark ships with built in libraries for a variety of purposes including: SQL, Streaming, Graph Analysis, and Machine Learning. This talk will focus on how to use Spark for Machine Learning.

Apache Spark has two APIs for Machine Learning, the newer of which is focused on creating Machine Learning Pipelines. This talk will explore a simple classification problem in both of the APIs, followed by a tour of some of the different machine learning models. We will then talk about loading/saving models and the challenges faced when attempting to construct a real-time serving solution from Spark ML’s models. From their we will explore some of the performance improvement work being done inside of Spark for improving machine learning.

Speakers

Holden Karau

Developer Advocate, Google

Holden Karau is a transgender Canadian open source developer advocate at Google focusing on Apache Spark, Beam, and related big data tools. Previously, she worked at IBM, Alpine, Databricks, Google (yes, this is her second time), Foursquare, and Amazon. Holden is the coauthor of Learning... Read More →

Thursday May 12, 2016 9:00am - 12:00pm PDT
Constable

Tutorial, Intermediate

Apache: Big Data 2016

Holden Karau

Attendees (8)

Apache: Big Data 2016

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Holden Karau

Attendees (8)