Enterprise data is moving into Hadoop, but some data has to stay in operational systems. Optiq (the technology behind Hive’s new cost-based optimizer) is a query-optimization and data federation technology that allows you to combine data in Hadoop with data in NoSQL systems such as MongoDB and Splunk, and access it all via SQL. Hyde shows how to quickly build a SQL interface to a NoSQL system using Optiq. He shows how to add rules and operators to Optiq to push down processing to the source system, and how to automatically build materialized data sets in memory for blazing-fast interactive analysis.
Julian Hyde is an expert in query optimization and in-memory analytics. He is the lead developer of Optiq, the new cost-based optimizer for Apache Hive, an Apache Drill committer, and lead developer of the Mondrian OLAP engine. He is an architect at Hortonworks.