Presto is gaining popularity for interactive and reporting use cases owing to its performance and ease of use. Presto is performant because it executes pipelined in-memory query plans, but its default architecture isn’t amenable to handling Spot node loss. However, many users still use Spot nodes for their cost benefits and tolerate query failures as a necessary evil.
With Spot termination notification listeners and its query retry system, Presto improves the user experience. The changes also allow users who cannot tolerate failures to use Spot nodes and enjoy their cost benefits.
Shubham Tagra explores the gaps in Presto architecture, explains how to use Spot nodes, covers enhancements like Spot termination notification listeners and the query retry system, and showcases the improvements in terms of reliability and TCO achieved through them.
Shubham Tagra is a senior staff engineer at Qubole working on Presto and Hive development and making these solutions cloud ready. Previously, Shubham worked on the storage area network at NetApp. Shubham holds a bachelor’s degree in computer engineering from the National Institute of Technology, Karnataka, India.
©2019, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • email@example.com