Spark is now the de facto engine for big data processing. Vincent Van Steenbergen walks you through two real-world applications that use Spark to build functional machine-learning pipelines (wine price prediction and malware analysis), discussing the architecture and implementation and sharing the good, the bad, and the ugly experiences he had along the way.
Vincent Van Steenbergen is a certified Spark consultant and trainer at w00t data, where he helps companies scale big data and machine-learning solutions into production-ready applications and provides Spark training and consulting to a broad range of companies across Europe and the US. Vincent is a coorganizer of the PAPIs.io international conference.
Comments on this page are now closed.
©2017, O’Reilly UK Ltd • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • email@example.com