Apache: Big Data 2016 has ended
Register Now or Visit the Website for more Information 
Back To Schedule
Wednesday, May 11 • 5:10pm - 6:00pm
Mining Public Datasets Using Apache Zeppelin (incubating) and Spark - Alexander Bezzubov, NFLabs

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

There are a lot of public datasets available in the wild and the number is growing. In meantime, ASF provides a plethora of free tools for any practitioner to build up on. In this talk Alexander will show how to levirage 2 of them, Zeppelin and Spark, for exploratory data anaytics and building a data product over two real datasets CommonCrawl http://commoncrawl.org and GithubArchive https://www.githubarchive.org


Alexander Bezzubov

Software Engineer, NFLabs
Alexander Bezzubov is Apache Zeppelin contributor, PMC member and software engineer at NFLabs. Previous speaking experience includes Apache BigData NA 2016 in Vancouver, FOSSASIA 2016 in Singapore, Apache BigData EU 2015 in Budapest.

Wednesday May 11, 2016 5:10pm - 6:00pm PDT
Plaza C