Architecting Virtualized Infrastructure for Big Data

Data Science, Ballroom CD
Average rating: ***..
(3.00, 1 rating)

This session will teach participants how to architect big data systems that leverage virtualization and platform as a service.

We will walk through a layered approach to building a unified analytics platform using virtualization, provisioning tools and platform as a service. We will show how virtualization can be used to simplify deployment and provisioning of Hadoop, SQL and NoSQL databases. We will describe the workload patterns of Hadoop and the infrastructure design implications. We will discuss the current and future role of PaaS to make it easy to deploy Java, SQL, R, and Python jobs against big-data sets.


  • A layered strategy for building a unified analytic stack:
    • Options and architecture for building virtual and cloud infrastructure
    • Values and problems solved using virtualization and cloud infrastructure
    • Picking the right storage and network architecture in light of Hadoop and big-SQL
      *Data layer
    • Using cloud provisioning tools to simplify deployment and management of databases
  • Databases
    • Taxonomy and feature comparison of datastores and databases for big-data
    • Quantitive comparison of popular NoSQL stores
  • PaaS and the runtime layer
    • Runtime and language services for big-data analysis
    • Using PaaS to provide simple/agile access to R, SQL, Python without the headaches
  • Analytics Tools and services for data scientists and data modelers
    • Tools for collaboration and sharing of big-data sources


Following this approach it is possible to build a unified analytics platform that extends into future needs. Attendees will learn a layered approach for how to build a virtualization-based architecture for multiple layers of a big data system.

Photo of Richard McDougall

Richard McDougall


Richard McDougall is the Application Infrastructure CTO and Principal Engineer in the Office of the CTO at VMware. He is responsible for driving advanced development and strategy for VMware’s application platform architecture – including the performance and integration of applications, runtimes, middleware, and application encapsulation technologies.

Richard’s is known as an expert in the areas of performance measurement and optimization, and in application deployment architectures.

Before the CTO office, as the Chief Performance architect Richard drove the performance strategy and initiatives to enable virtualization of high-end mission critical applications on VMware products.

Prior to joining VMware, Richard was a Distinguished Engineer at Sun Microsystems. During his 14 years at Sun, he was responsible for driving high performance and scalability initiatives for Solaris and key applications on the Sun platform. He served on the central software platform architecture review committee, and also drove the early resource management initiatives for Solaris. Recognized as an operating system and performance expert, he developed several technologies for the Solaris operating system and co-authored several books — including “Solaris Resource Management”, “Solaris Internals” and “Solaris Performance and Tools”.

Richard holds several patents in the area of performance instrumentation, algorithms and distributed file system technologies.


  • EMC
  • Microsoft
  • HPCC Systems™ from LexisNexis® Risk Solutions
  • MarkLogic
  • Shared Learning Collaborative
  • Cloudera
  • Digital Reasoning Systems
  • Pentaho
  • Rackspace Hosting
  • Teradata Aster
  • VMware
  • IBM
  • NetApp
  • Oracle
  • 1010data
  • 10gen
  • Acxiom
  • Amazon Web Services
  • Calpont
  • Cisco
  • Couchbase
  • Cray
  • Datameer
  • DataSift
  • DataStax
  • Esri
  • Facebook
  • Feedzai
  • Hadapt
  • Hortonworks
  • Impetus
  • Jaspersoft
  • Karmasphere
  • Lucid Imagination
  • MapR Technologies
  • Pervasive
  • Platform Computing
  • Revolution Analytics
  • Scaleout Software
  • Skytree, Inc.
  • Splunk
  • Tableau Software
  • Talend

For information on exhibition and sponsorship opportunities at the conference, contact Susan Stewart at

For information on trade opportunities with O'Reilly conferences contact Kathy Yu at mediapartners

For media-related inquiries, contact Maureen Jennings at

View a complete list of Strata contacts