Apache: Big Data 2016 has ended
Register Now or Visit the Website for more Information 
Back To Schedule
Wednesday, May 11 • 4:10pm - 5:00pm
Secure Spark Shuffle: A Fast and Convenient Approach Using Chimera - Cheng Xu, Intel

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Shuffle is the key process in Spark computing model. It’s very sensitive to performance. Since the frequent crimes and accidents arising from security, data encryption becomes more and more important for an enterprise ready product. In this talk, we will talk about how we use Chimera to secure the shuffle data. Chimera is a cryptography library optimized with AES-NI (Advanced Encryption Standard New Instructions). It provides Java API for both cipher level and Java stream level. It originates from Intel Diceros and Hadoop encryption at rest. It limits the performance impacts using hardware acceleration and helps users get rid of native issues used by native code. In this presentation, we will also show the performance results after enabling the shuffle encryption in Spark.

avatar for Cheng Xu

Cheng Xu

Senior Software Engineer, Intel
I am a software engineer from Intel. I am now working on Apache Hive project, Apache Parquet and Apache Spark Project. I am a committer of Apache HIVE project. Now I am focussed on Spark Authorization specially in Spark SQL component and the performance improvements in Apache Parquet... Read More →

Wednesday May 11, 2016 4:10pm - 5:00pm PDT
Plaza C