Security is a crucial component of the big data ecosystem. The need to protect data from exploits and vulnerability is evident in the strong push for cybersecurity and secure clusters across businesses and industries alike. Spark itself has been a major analytic backbone of that infrastructure. As with Hadoop, the security infrastructure for Spark is evolving and expanding.
So how do you ensure security with Spark without much hassle? Neelesh Srinivas Salian outlines the steps that need to be taken to set up security with Spark and discusses the potential issues with Spark Core, Streaming, and other components. Detailed knowledge of the setup and an awareness of what to be looking out for in terms of problems and issues can help an organization move forward in the right way.
Neelesh Srinivas Salian is a software engineer on the data platform team at Stitch Fix, where he works on the compute infrastructure used by the company’s data scientists, particularly focusing on the Apache Spark ecosystem. Previously, he worked at Cloudera, where he worked with Apache projects like YARN, Spark, and Kafka. Neelesh holds a master’s degree in computer science with a focus on cloud computing from North Carolina State University and a bachelor’s degree in computer engineering from the University of Mumbai, India.
Help us make this conference the best it can be for you. Have questions you'd like this speaker to address? Suggestions for issues that deserve extra attention? Feedback that you'd like to share with the speaker and other attendees?
Join the conversation here (requires login)
©2017, O’Reilly UK Ltd • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • email@example.com