When building big data applications, it’s easy to get fixated on the many design decisions that depend on where you plan to deploy. Issues arise that are specific to cloud-based deployments, such as geographical location, storage tiers, service tiers, and pricing commitments, but these types of decisions should not be the primary driving factors.
The distinction between on-premises and cloud deployments is narrowing—the current trend is to include the cloud as a key component of any big data deployment, especially those that cross geographical boundaries. Thus, one should not be thinking in terms of one model versus another but rather how the models can complement each other. While many organizations begin their big data journeys in the cloud as a means for a quick on-ramp, the cloud today is more about the broad distribution and access of global data.
Dale Kim shares insights on architecting big data applications for the cloud, using an example reference application his team built and published as context for describing several key requirements for cloud-based environments.
Dale Kim is the senior director of products/solutions at Arcadia Data. His background includes a variety of technical and management roles at information technology companies. While Dale’s experience includes work with relational databases, much of his career pertains to nonrelational data in the areas of search, content management, NoSQL, and Hadoop/Spark, and includes senior roles in technical marketing, sales engineering, and support engineering. Dale holds an MBA from Santa Clara University, and a BA in computer science from the University of California, Berkeley.
©2017, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • email@example.com
Apache Hadoop, Hadoop, Apache Spark, Spark, and Apache are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries, and are used with permission. The Apache Software Foundation has no affiliation with and does not endorse, or review the materials provided at this event, which is managed by O'Reilly Media and/or Cloudera.