Security is a crucial component of the big data ecosystem. The need to protect data from exploits and vulnerability is evident in the strong push for cybersecurity and secure clusters across businesses and industries alike. Spark itself has been a major analytic backbone of that infrastructure. As with Hadoop, the security infrastructure for Spark is evolving and expanding.
So how do you ensure security with Spark without much hassle? Neelesh Srinivas Salian outlines the steps that need to be taken to set up security with Spark and discusses the potential issues with Spark Core, Streaming, and other components. Detailed knowledge of the setup and an awareness of what to be looking out for in terms of problems and issues can help an organization move forward in the right way.
Neelesh Srinivas Salian is a software engineer on the data platform team at Stitch Fix, where he works on the compute infrastructure used by the company’s data scientists. Previously, he worked at Cloudera, where he worked with Apache projects like YARN, Spark, and Kafka.
©2017, O’Reilly UK Ltd • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • firstname.lastname@example.org