Presented By O'Reilly and Cloudera
Make Data Work
September 26–27, 2016: Training
September 27–29, 2016: Tutorials & Conference
New York, NY

Big data architectural patterns and best practices on AWS

Siva Raghupathy (Amazon Web Services)
4:35pm–5:15pm Wednesday, 09/28/2016
Data innovations
Location: 3D 12 Level: Intermediate
Average rating: ****.
(4.85, 13 ratings)

Prerequisite knowledge

  • An understanding of the Hadoop ecosystem and tools (Spark, Presto, Hive, Elasticsearch, NoSQL, Redis, etc.) and AWS tools (Amazon Kinesis, Amazon S3, Amazon Redshift, Amazon EMR, Amazon DynamoDB, etc.)
  • A desire to build scalable big data systems leveraging AWS
  • What you'll learn

  • Learn how to build a decoupled data bus using Hadoop and AWS
  • Understand how to choose the right tool for the job based on data structure, latency, throughput, and access patterns
  • Get Lambda Architecture ideas and tips for leveraging AWS managed services
  • Description

    The world is producing an ever-increasing volume, velocity, and variety of big data. Consumers and businesses are demanding up-to-the-second (or even millisecond) analytics on their fast-moving data, in addition to classic batch processing. The Hadoop ecosystem and AWS provide a plethora of tools for solving big data problems. But what tools should you use, why, and how?

    Siva Raghupathy demonstrates how to use Hadoop innovations in conjunction with Amazon Web Services innovations, showing how to simplify big data processing as a data bus comprising various stages: collect, store, process/analyze, and consume. Siva then discusses how to choose the right technology in each stage based on criteria such as data structure, query latency, cost, request rate, item size, data volume, durability, and so on before providing reference architecture, design patterns, and best practices for assembling these technologies to solve your big data problems at the right cost.

    Photo of Siva Raghupathy

    Siva Raghupathy

    Amazon Web Services

    Siva Raghupathy leads the Americas Big Data Solutions Architecture team at AWS, where he guides developers and architects in building successful big data solutions on AWS. Previously, as a principal technical program manager for AWS Database Service, Siva gathered emerging NoSQL requirements and wrote the first version of DynamoDB product specification. Later, as a development manager for Amazon Relational Database Services (RDS), he drove several enhancements. Prior to AWS, Siva spent several years at Microsoft.

    Comments on this page are now closed.


    Picture of Siva Raghupathy
    Siva Raghupathy
    09/28/2016 8:36pm EDT

    Hi Anagha, Thai, I upload the presentation to the speaker portal. The speaker manager told me that they will have to manually push it and it should be available Thursday (9/29). Thanks for coming to my session. Regards, Siva

    Anagha Khanolkar
    09/28/2016 1:31pm EDT

    Siva, great presentation!!
    Can we get a copy of the slides, please?

    Thai Truong
    09/28/2016 1:13pm EDT

    Could you share the presentation?