Sep 23–26, 2019
Please log in

Cloudera Edge Management in the IoT

Purnima Reddy Kuchikulla (Cloudera), Timothy Spann (Cloudera), Abdelkrim Hadjidj (Cloudera), Andre Araujo (Cloudera), Hemanth Yamijala (Cloudera)
9:00am12:30pm Tuesday, September 24, 2019
Location: 1E 11
Average rating: *****
(5.00, 3 ratings)

Who is this presentation for?

  • Solution architects, application architects, and big data engineers




CEM is an edge-management solution for IoT and streaming use cases. CEM is made of two components—MiNiFi and Edge Flow Manager. MiNiFi is an edge agent and can be deployed into thousands of edge devices to collect data. It’s a lightweight version of NiFi and acts as a runtime at the edge to execute flow files. MiNiFi also supports TensorFlow, which allows for machine learning (ML) models to be executed at the edge.

Purnima Reddy Kuchikulla, Timothy Spann, Abdelkrim Hadjidj, and Andre Araujo explain Edge Flow Manager, a new component within Cloudera Edge Management (CEM). It’s an edge-management hub that manages, controls, and monitors MiNiFi agents. It allows you to develop, deploy, run, and monitor edge-flow apps and ML models at the edge. Edge Flow Manager offers an easy-to-use NiFi-like user interface that allows users to leverage NiFi processors to design data flows that can be pushed out to the edge. These flow files can instruct the edge agent to collect specific data points from the edge device as well as process it at the edge and stream it into the enterprise. These flow files can also be changed from the same user interface and can be deployed to the edge to any specific class of devices. This allows the user to change the behavior of a specific set of agents in the field based on specific criteria.

Prerequisite knowledge

  • General knowledge of programming
  • Experience programming hands on (useful but not required)

Materials or downloads needed in advance


What you'll learn

  • Get hands-on experience handling edge device sand agents, no-code flow programming, and the ability to control and monitor tens of thousands of these agents
Photo of Purnima Reddy Kuchikulla

Purnima Reddy Kuchikulla


Purnima Kuchikulla is a solution engineer at Cloudera, where she works with customers on their cloud and big data strategies, and a big data evangelist with 15 years of experience in the industry. Previously, she was at IBM and ADP.

Photo of Timothy Spann

Timothy Spann


Tim Spann is a field engineer for the data in motion team at Cloudera. Previously, he was a senior solutions architect at airisDATA working with Apache Spark and machine learning; a senior software engineer at SecurityScorecard, helping to build a reactive platform for monitoring real-time third-party vendor security risk in Java and Scala; and a senior field engineer for Pivotal focusing on Cloud Foundry, HAWQ and big data. He’s an avid blogger and the big data zone leader for Dzone. He runs the the very successful Future of Data: Princeton meetup with over 1192. You can find all the source and material behind his talks at his GitHub and Community blog: and

Photo of Abdelkrim Hadjidj

Abdelkrim Hadjidj


Abdelkrim Hadjidj is a senior data streaming specialist at Cloudera with 10 years experience on several distributed systems (big data, IoT, peer to peer and cloud). Previously, he held several positions including big data lead, CTO, and software engineer at several companies. He was a speaker at various international conferences and published several scientific papers at well-known IEEE and ACM journals. Abdelkrim holds a PhD, MSc, and MSe degrees in computer science.

Photo of Andre Araujo

Andre Araujo


André Araujo is a principal solutions architect at Cloudera. An experienced consultant with a deep understanding of the Hadoop stack and its components and a methodical and keen troubleshooter who loves making things run faster, André is skilled across the entire Hadoop ecosystem and specializes in building high-performance, secure, robust, and scalable architectures to fit customers’ needs.

Hemanth Yamijala


  • Cloudera
  • O'Reilly
  • Google Cloud
  • IBM
  • Cisco
  • Dataiku
  • Intel
  • Io-Tahoe
  • MemSQL
  • Microsoft Azure
  • Oracle Cloud Infrastructure
  • SAS
  • Arcadia Data
  • BMC Software
  • Hazelcast
  • SAP
  • Amazon Web Services
  • Anaconda
  • Esri
  •, Inc.
  • Kyligence
  • Pitney Bowes
  • Talend
  • Google Cloud
  • Confluent
  • DataStax
  • Dremio
  • Immuta
  • Impetus Technologies Inc.
  • Keyence
  • Kyvos Insights
  • StreamSets
  • Striim
  • Syncsort
  • SK holdings C&C

    Contact us

    For conference registration information and customer service

    For more information on community discounts and trade opportunities with O’Reilly conferences

    For information on exhibiting or sponsoring a conference

    For media/analyst press inquires