14–17 Oct 2019

Speakers

Hear from innovative researchers, talented CxOs, and senior developers who are doing amazing things with artificial intelligence. More speakers will be announced; please check back for updates.

Grid view List view

Sridhar Alla is cofounder and CTO at BlueWhale, which brings together the worlds of big data and artificial intelligence to provide comprehensive solutions to meet the business needs of organizations of all sizes. He and his team are cloud and tool agnostic and strive to embed themselves into the workstream to provide strategic and technical assistance, with solutions such as predictive modeling and analytics, capacity planning, forecasting, anomaly detection, advanced NLP, chatbot development, SAS to Python migration, and deep learning-based model building and operationalization. Sridhar is also the author of three books and an avid presenter at conferences including Strata, Hadoop World, Spark Summit and others.

Presentations

Anomaly detection using deep learning to measure the quality of large datasets Session

Any business, big or small, depends on analytics, whether the goal is revenue generation, churn reduction, or sales or marketing purposes. No matter the algorithm and the techniques used, the result depends on the accuracy and consistency of the data being processed. Sridhar Alla examines some techniques used to evaluate the quality of data and the means to detect the anomalies in the data.

Alasdair Allan is a director at Babilim Light Industries and a scientist, author, hacker, maker, and journalist. An expert on the internet of things and sensor systems, he’s famous for hacking hotel radios, deploying mesh networked sensors through the Moscone Center during Google I/O, and for being behind one of the first big mobile privacy scandals when, back in 2011, he revealed that Apple’s iPhone was tracking user location constantly. He’s written eight books and writes regularly for Hackster.io, Hackaday, and other outlets. A former astronomer, he also built a peer-to-peer autonomous telescope network that detected what was, at the time, the most distant object ever discovered.

Presentations

Measuring embedded machine learning Session

The future of machine learning is on the edge and on small, embedded devices that can run for a year or more on a single coin-cell battery. Alasdair Allan dives deep into how using deep learning can be very energy efficient and allows you to make sense of sensor data in real time.

Zahra Ashktorab is a research staff member at IBM Thomas J. Watson Center. At IBM Research, she studies social technologies, AI systems, and their influence on user behavior and interaction. Her interests and prior work lie at the intersection of machine learning, human-computer interaction (HCI), and design. She uses a mix of quantitative and qualitative methods in her research to address HCI-related questions and interaction design. She has published her work at the ACM Conference on Human Factors and Computing Systems (CHI), the ACM Conference on Computer Supported Cooperative Work and Social Computing (CSCW), and other reputable HCI and information systems conferences. She received her PhD on human-computer interaction at the University of Maryland, College Park.

Presentations

The intersection of AI and HCI: Gamifying the latest artificial intelligence research Session

Casey Dugan and Zahra Ashktorab challenge you to guess the backdoor of a hacked classifier. Join them to learn more about novel AI technologies through the design and development of engaging games. Take a look at their latest research around improving the interactions between humans and AI systems from empathy building to feedback design.

Bahman Bahmani is the vice president of data science and engineering at Rakuten (the seventh-largest internet company in the world), managing an AI organization with engineering and data science managers, data scientists, machine learning engineers, and data engineers globally distributed across three continents, and he’s in charge of the end-to-end AI systems behind the Rakuten Intelligence suite of products. Previously, Bahman built and managed engineering and data science teams across industry, academia, and the public sector in areas including digital advertising, consumer web, cybersecurity, and nonprofit fundraising, where he consistently delivered substantial business value. He also designed and taught courses, led an interdisciplinary research lab, and advised theses in the Computer Science Department at Stanford University, where he also did his own PhD focused on large-scale algorithms and machine learning, topics on which he’s a published author.

Presentations

Executive Briefing: Business at the speed of AI Session

Amid fears of sentient killing robots and a freezing AI winter, AI has a true potential to transform the enterprise. Actualizing this potential requires a well-informed organizational strategy and consistent execution of best practices regarding people, processes, and platforms. Bahman Bahmani examines these strategies and best practices and provides insights into their successful execution.

Antje Barth is a senior developer advocate for AI and machine learning at AWS. Besides AI and ML, Antje is passionate about helping developers leverage big data, container, and Kubernetes platforms in the context of AI and machine learning. Previously, Antje was in technical evangelist and solutions engineering roles at MapR and Cisco. She frequently speaks at AI and machine learning conferences and meetups around the world. Antje is a cofounder of the Düsseldorf chapter of Women in Big Data.

Presentations

Containerized architectures for deep learning Session

Container and cloud native technologies around Kubernetes have become the de facto standard in modern ML and AI application development. Antje Barth examines common architecture blueprints and popular technologies used to integrate AI into existing infrastructures and explains how you can build a production-ready containerized platform for deep learning.

Karim Beguir helps companies get a grip on the latest AI advances and deploy them in practice. A graduate of France’s École Polytechnique and former program fellow at NYU’s Courant Institute, Karim has a passion for teaching and using applied mathematics. This encouraged him to cofound InstaDeep (named one of the 20 global tech startups to watch in 2017 by PCMag). Karim is one of fewer than 100 Google Certified Machine Learning Developer Experts worldwide and serves on the steering committee of Indaba, Africa’s most important AI event series. Karim is now on a mission to democratize AI and make it accessible to a wider audience.

Presentations

Deep RL for bin packing Session

Karim Beguir discusses a system in which an agent that learns to pack boxes efficiently in containers while respecting multiple physical constraints. The agent is trained using reinforcement learning to minimize the wasted space. Without any human knowledge, the agent achieves superhuman performance and outperforms commercial optimization software.

Martin Benson is the head of AI consulting at Jaywing, where he invented the algorithm that underpins the company’s new patent-pending AI product, Archetype, which enables lenders to produces interpretable AI credit-scoring models and is recognized for driving business value from data and developing relevant technology that directly grows business opportunities. He’s led a number of game-changing data science products to discover new business opportunities for Jaywing and its clients, spearheading commercial applications of machine learning, statistical analysis, data mining, and modeling for clients like Avios and Swinton. Martin has a passion for turning data into products, actionable insights, and meaningful stories and is recognized both internally and externally as an expert to solve challenging data problems. He thrives on sharing his knowledge with others and has recently been invited to be an ambassador for the leading deep learning MOOC by Coursera, assisting people in learning about AI. Martin was also a DataIQ Talent Awards finalist for the data science leader category. A trained mathematician with a master’s degree and PhD in mathematics and a leading expert in driving business value from data, Martin is at the forefront of AI and data science.

Presentations

Fairness in AI: Applying deep learning to credit scoring Session

Machine learning has been used in credit scoring for three decades. Martin Benson discusses the history of machine learning in credit scoring and the need for explainable and justified decisions made by machine learning systems. Come find out if it's possible to overcome the black box problem and learn how machine learning systems are evolving and how to bypass the challenges to adoption.

Rajib Biswas is a lead data scientist at Ericsson’s Global AI Accelerator. He has 10 years of industry experience in AI- and ML-based product development and research and has applied AI and ML to solve problems related to domains like finance, telecom, and consumer electronics. He holds a master’s in computer science from BITS-Pilani.

Presentations

Adversarial network for natural language synthesis Session

Rajib Biswas outlines the application of AI algorithms like generative adversarial networks (GANs) to solve natural language synthesis tasks. Join in to learn how AI can accomplish complex tasks like machine translation, write poetry with style, read a novel, and answer your questions.

Cam Buscaron is a principal open source technologist and strategist at AWS, where he works with the robotics developer community and ecosystem to foster cloud innovation and widespread adoption of open source tools. He previously contributed to the design and development of a large-scale hardware-in-the-loop simulation system for self-driving cars and warehouse robots, built on ROS.

Presentations

The future of open source frameworks and cloud simulation for robots and AI systems development Session

As robots and AI systems become more prevalent in enterprise, industrial, and home settings, there's an increasing need for well-maintained, reliable, and secure development tools and frameworks for the next-generation production-grade robots and systems. Cam Buscaron explains how to leverage large-scale cloud simulation and the Robot Operating System (ROS) to build such systems.

Paris Buttfield-Addison is a cofounder of Secret Lab, a game development studio based in beautiful Hobart, Australia. Secret Lab builds games and game development tools, including the multi-award-winning ABC Play School iPad games, the BAFTA- and IGF-winning Night in the Woods, the Qantas airlines Joey Playbox games, and the Yarn Spinner narrative game framework. Previously, Paris was a mobile product manager for Meebo (acquired by Google). Paris particularly enjoys game design, statistics, blockchain, machine learning, and human-centered technology. He researches and writes technical books on mobile and game development (more than 20 so far) for O’Reilly; he recently finished writing Practical AI with Swift and is currently working on Head First Swift. He holds a degree in medieval history and a PhD in computing. Paris loves to bring machine learning into the world of practical and useful. You can find him on Twitter as @parisba.

Presentations

Building, teaching, and training simulations for machine learning with a game engine Session

You're building a high-volume, expensive, robot-driven warehouse. Your robots need to get to the right place quickly, find the right item, and sort it to the right place without colliding with each other, the shelves, or people. But you don't have any robots, and you need to start writing the logic and training them. Paris Buttfield-Addison and Tim Nugent outline how to use a simulation to do it.

Practical on-device AI and ML using Swift Session

On-device ML and AI is the future for privacy-conscious, cloud-averse users of modern smartphones. Paris Buttfield-Addison and Tim Nugent explore what's possible using CoreML, Swift, and associated frameworks in tandem with the powerful ML-tuned silicon in modern Apple iOS hardware. They demonstrate and create ML and AI features with Swift to show how much you can do without touching the cloud.

Umit Mert Cakmak is a manager and senior data scientist on the data science elite team at IBM. Umit excels at helping clients solve complex data science problems from inception to the delivery of deployable machine learning and AI pipelines. His research spans across multiple disciplines, and he enjoys sharing his insights at conferences, universities, and meetups.

Presentations

Executive Briefing: Why your AI initiative will fail Session

In every AI initiative, there’s a demand from businesses to protect or increase market share or decrease operational costs. Your competitors are a growing threat, seemingly adopting new technologies better than you. Umit Cakmak examines critical steps from countless client engagements on how to consistently deliver successful AI projects.

Douglas Calegari is a director of architecture and strategic development at a Fortune 500 insurance company. In a career that has spanned several decades, Doug has worked for multiple startup companies, insurance, and financial services institutions.

Presentations

Service center automation using the state-of-the-art NLP Session

Douglas Calegari details a solution that classifies and routes emails coming into a busy insurance service center. Join in to discover how his team evaluated NLP models, leveraged various techniques to increase classification and entity recognition accuracy, designed a scalable end-to-end machine learning data pipeline, and integrated them into an existing transactional system.

Roger Chen is cofounder and CEO of Computable and program chair for the O’Reilly Artificial Intelligence Conference. Previously, he was a principal at O’Reilly AlphaTech Ventures (OATV), where he invested in and worked with early-stage startups primarily in the realm of data, machine learning, and robotics. Roger has a deep and hands-on history with technology. Before startups and venture capital, he was an engineer at Oracle, EMC, and Vicor. He also developed novel nanoscale and quantum optics technology as a PhD researcher at UC Berkeley. Roger holds a BS from Boston University and a PhD from UC Berkeley, both in electrical engineering.

Presentations

Building and deploying AI applications and systems at scale Keynote

Details to come.

Thursday opening welcome Keynote

Program chairs Ben Lorica, Roger Chen, and Alexis Helzer open the second day of keynotes.

Wednesday opening welcome Keynote

Program chairs Ben Lorica, Roger Chen, and Alexis Helzer open the first day of keynotes.

Ira Cohen is a cofounder and chief data scientist at Anodot, where he’s responsible for developing and inventing the company’s real-time multivariate anomaly detection algorithms that work with millions of time series signals. He holds a PhD in machine learning from the University of Illinois at Urbana-Champaign and has over 12 years of industry experience.

Presentations

Herding cats: Product management in the machine learning era Tutorial

While the role of the manager doesn't require deep knowledge of ML algorithms, it does require understanding how ML-based products should be developed. Ira Cohen explores the cycle of developing ML-based capabilities (or entire products) and the role of the (product) manager in each step of the cycle.

Sequence to sequence (S2S) modeling for time series forecasting Session

Sequence to sequence (S2S) modeling using neural networks has become increasingly mainstream in recent years. In particular, it's been used for applications such as speech recognition, language translation, and question answering. Arun Kejariwal and Ira Cohen walk you through how S2S modeling can be leveraged for these use cases, visualization, real-time anomaly detection, and forecasting.

Robert Crowe is a data scientist and TensorFlow Developer Advocate at Google with a passion for helping developers quickly learn what they need to be productive. He’s used TensorFlow since the very early days and is excited about how it’s evolving quickly to become even better than it already is. Previously, Robert deployed production ML applications and led software engineering teams for large and small companies, always focusing on clean, elegant solutions to well-defined needs. In his spare time, Robert sails, surfs occasionally, and raises a family.

Presentations

TFX: Production ML pipelines with TensorFlow Tutorial

Putting together an ML production pipeline for training, deploying, and maintaining ML and deep learning applications is much more than just training a model. Robert Crowe and Pedram Pejman explore Google's TFX, an open source version of the tools and libraries that Google uses internally, made using its years of experience in developing production ML pipelines.

Alexis Crowell Helzer is senior director of artificial intelligence product marketing at Intel, where she and her team are responsible for technical positioning and messaging as well as outbound content and campaigns for Intel AI products. Alexis and her team partner with AI adopters across the industry from small device implementations to HPC clusters to launch products, showcase innovative use cases, and help other companies find their own AI path. She has an unyielding passion to deliver technology solutions that help businesses thrive. Over her rich career, she has run a cloud software engineering team focused on distributed computing and microservices integration, led the open source marketing efforts from Intel, and worked with many of the Fortune 100 companies to help incubate service offerings and deliver innovative products.

Presentations

The power of knowledge at scale Keynote

The AI revolution is poised to scale both machine and human knowledge. To generate that knowledge, companies must think differently about AI and how to deploy it. Alexis will cover the three “Be’s”, and how to approach AI systematically to truly harness knowledge at scale.

Thursday opening welcome Keynote

Program chairs Ben Lorica, Roger Chen, and Alexis Helzer open the second day of keynotes.

Trends to watch: How shifts in data structure and volume demand new approaches to AI compute Session

Demand for AI compute is doubling every three months. Alexis Crowell Helzer explains why the way we compute AI has to be completely rethought so it can evolve to enable the promise of global business transformation.

Wednesday opening welcome Keynote

Program chairs Ben Lorica, Roger Chen, and Alexis Helzer open the first day of keynotes.

Michael Cullan is a data scientist in residence at Pragmatic Institute, where he teaches hands-on courses in data science and business-oriented topics in managing data science initiatives at the organizational level. He also leads internal data science projects in support of marketing and operations teams. He earned a master’s degree in statistics and a bachelor’s degree in mathematics. His academic research areas ranged from computational paleobiology, where he developed software for measuring evidence for disparate evolutionary models based on fossil data, to music and AI, where he assisted in modeling musical data for a jazz improvisation robot. In his free time, he applies his math and programming skills toward creating code-based visual art and design projects.

Presentations

Deep learning with TensorFlow 2-Day Training

The TensorFlow library provides computational graphs with automatic parallelization across resources—ideal architecture for implementing neural networks. Michael Cullan walks you through TensorFlow's capabilities in Python, from building machine learning algorithms piece by piece to using the Keras API provided by TensorFlow with several hands-on applications.

Tim Daines is a principal designer with QuantumBlack. He’s also a proactive designer, telling the stories of human-centered explainable AI, ML, and IoT experiences across a variety of industries, including digital health, energy, elite sports, and learning. He has over ten years’ experience working closely with people to design and bring to market digital products and services, and he’s an innovative person who enjoys developing products and services across the entire end-to-end human experience, creating lasting experiences, and determining the best solutions to problems through insight discovery and journey mapping. His passion for creating better experiences for people drives all aspects of his designs and naturally aligns with companies who want to understand how their customers and employees interact with their product and services. Tim enjoys turning research gained through stakeholder workshops and meetings into designs that deliver significant value to humans to develop trust and loyalty. He holds master’s degrees in user experience design, and social science and research practices, and has worked with a range of companies from startups to global blue-chip companies across the US, UK, Europe, and Asia.

Presentations

Executive Briefing: Fusing data and design Session

Data scientists feel naturally comfortable with the language of mathematics, while designers think in the language of human empathy. Creating a bridge between the two was essential to the success of a recent project at an energy company. Tim Daines and Philip Pilgerstorfer detail what they learned while creating these bridges, showcasing techniques through a series of “aha” moments.

Danielle Dean is the technical director of machine learning at iRobot. Previously, she was a principal data science lead at Microsoft. She holds a PhD in quantitative psychology from the University of North Carolina at Chapel Hill.

Presentations

Azure AI reference architectures Session

Dive into the the newly released GitHub repository for recommended ways to train and deploy models on Azure with Danielle Dean, Wee Hyong Tok, and Mathew Salvaris. The repository ranges from running massively parallel hyperparameter tuning using Hyperdrive to deploying deep learning models on Kubernetes.

Training and deploying Python models on Azure Tutorial

Danielle Dean, Mathew Salvaris, and Wee Hyong Tok outline the recommended ways to train and deploy Python models on Azure, ranging from running massively parallel hyperparameter tuning using Hyperdrive to deploying deep learning models on Kubernetes.

Danielle Deibler is the cofounder and CEO of MarvelousAI, an early stage startup focused on building natural language technology to discover and expose propaganda, disinformation, and bias to enable advocates and policymakers to devise countermeasures and immunities. She has over 25 years’ experience in the internet infrastructure, security, networking, interactive technology, machine learning, and AI technologies. Her primary area of focus in the last 20 years has been building scalable real-time interactive platforms. Previously, she was CEO and cofounder of leading-edge regulatory technology startup Compliance.ai, founder of Apps54 and Ignited Artists, and an entrepreneur in residence at Trinity Ventures and held senior leadership positions in software development, engineering, business development, and product management for KIXEYE, Adobe, DIGEX, and UltraDNS.

Presentations

To arms: The battle against misinformation Session

Danielle Deibler examines an approach to detecting bias, fine-grained emotional sentiment, and misinformation through the detection of political narratives in online media. As building blocks, the methodology uses human-in-the-loop, alongside other natural language processing and computational linguistics techniques, with examples focused on the 2020 US presidential election.

Jim Dowling is the CEO of Logical Clocks, an associate professor at KTH Royal Institute of Technology in Stockholm, and lead architect of Hopsworks, an open source data and AI platform. He’s a regular speaker at big data industry conferences. He holds a PhD in distributed systems from Trinity College Dublin.

Presentations

ROCm and Hopsworks for end-to-end deep learning pipelines Session

The Radeon open ecosystem (ROCm) is an open source software foundation for GPU computing on Linux. ROCm supports TensorFlow and PyTorch using MIOpen, a library of highly optimized GPU routines for deep learning. Jim Dowling and Ajit Mathews outline how the open source Hopsworks framework enables the construction of horizontally scalable end-to-end machine learning pipelines on ROCm-enabled GPUs.

Casey Dugan is the manager of the AI Experience Lab at IBM Research in Cambridge. Her group is an interdiscipinary team made up of designers, engineers, and human-computer interaction (HCI) researchers. They design, build, and study systems at the intersection of HCI and AI, especially human-AI interaction. She has worked in the research areas of social media, analytics and visualization dashboards, human computation and crowdsourcing, and recommender systems since joining IBM. Her projects have ranged from designing meeting rooms of the future to studying #selfiestations, or kiosks for taking selfies at IBM labs around the world. She earned a couple of degrees from MIT and spent two summers interning with the IBM lab. Outside of work, she’s taught chocolate sculpture to teenagers, drinks a lot of Starbucks, and has a big fluffy dog named Lincoln.

Presentations

The intersection of AI and HCI: Gamifying the latest artificial intelligence research Session

Ty Dunn is a product manager at Berlin-based startup Rasa, where he focuses on empowering developers to build the best possible conversational experiences and deploy them to production. Previously, Ty was a software engineer, most recently on a research team at another conversational AI startup. He’s interested in using this technology to improve how we process and respond to the increasing amount of information and complexity we face every day. Ty holds a BS in cognitive science with a focus in computation from the University of Michigan.

Presentations

Building contextual AI assistants with machine learning and open source tools Session

AI assistants are getting a great deal of attention from the industry and research. However, the majority of assistants built to this day are still developed using a state machine and a set of rules. That doesn’t scale in production. Tyler Dunn explores how to build AI assistants that go beyond FAQ interactions using machine learning and open source tools.

Ted Dunning is the chief technology officer at MapR, an HPE company. He’s also a board member for the Apache Software Foundation, a PMC member, and committer on a number of projects. Ted has years of experience with machine learning and other big data solutions across a range of sectors. He’s contributed to clustering, classification, and matrix decomposition algorithms in Mahout and to the new Mahout Math library and designed the t-digest algorithm used in several open source projects and by a variety of companies. Previously, Ted was chief architect behind the MusicMatch (now Yahoo Music) and Veoh recommendation systems and built fraud-detection systems for ID Analytics (LifeLock). Ted has coauthored a number of books on big data topics, including several published by O’Reilly related to machine learning, and has 24 issued patents to date plus a dozen pending. He holds a PhD in computing science from the University of Sheffield. When he’s not doing data science, he plays guitar and mandolin. He also bought the beer at the first Hadoop user group meeting.

Presentations

Online evaluation of machine learning models Session

Evaluating machine learning models is surprisingly hard, but it gets even harder because these systems interact in very subtle ways. Ted Dunning breaks the problem into operational and functional concerns and shows you how each can be done without unnecessary pain and suffering. You'll also get to see some exciting visualization techniques to help make the differences strikingly apparent.

Raffaello D’Andrea is founder, CEO, and chairman of the board of Verity, the world’s leading autonomous indoor drone system provider; cofounder of ROBO Global, creator of the world’s first robotics exchange traded fund; and a professor of dynamic systems and control at ETH Zurich (on leave). He cofounded Kiva Systems—now Amazon Robotics—in 2003. He was the system architect and faculty advisor of the four-time world champion Cornell RoboCup team from 1999 to 2003. In addition, he’s a new media artist with exhibitions at various international venues, including the Venice Biennale, the FRAC Centre, and the National Gallery of Canada. Other creations and projects include the Flying Machine Arena, the Distributed Flight Array, the Balancing Cube, Cubli, Flight Assembled Architecture, the Blind Juggler, the Robotic Chair, and RoboEarth. His two TED talks, with almost 20 million views, have inspired a generation to pursue engineering, robotics, and computer science.

Presentations

When flying is cheaper than standing still Keynote

It's hard ignore the attention given to autonomy and robotics. The impact is significant and the reach is extensive, hitting transportation with self-driving cars, logistics and supply with mobile robots, and remote sensing applications with aerial vehicles or drones. Raffaello D'Andrea explores how autonomous indoor drones will drive the next wave of autonomous robotics development and growth.

Sergey Ermolin is a principal solutions architect (ML/DL/AI) for Amazon Web Services. Previously, he was a software solutions architect for deep learning, Spark analytics, and big data technologies at Intel. A Silicon Valley veteran with a passion for machine learning and artificial intelligence, Sergey has been interested in neural networks since 1996, when he used them to predict aging behavior of quartz crystals and cesium atomic clocks made by Hewlett-Packard. Sergey holds an MSEE and a certificate in mining massive datasets from Stanford and BS degrees in both physics and mechanical engineering from California State University, Sacramento.

Presentations

Build, train, and deploy predictive maintenance models at industrial scale (sponsored by AWS) Session

Sunil Mallya walks you through building complex ML-enabled products using reinforcement learning (RL), explores hardware design challenges and trade-offs, and details real-life examples of how any developer can up-level their RL skills through autonomous driving.

Using reinforcement learning to build recommendation systems with AWS SageMaker RL Tutorial

Sergey Ermolin and Vineet Khare provide a step-by-step overview on how to implement, train, and deploy a reinforcement learning (RL)-based recommender system with real-time multivariate optimization. They show you how leverage RL to implement a recommender system that optimizes an advertisement message that promotes adoption of merchant's services.

Carlos Escapa is the global AI and ML practice leader of the Consulting Partner Network at Amazon Web Services. Previously, he was the cofounder and CEO of VirtualSharp Software, where he led the company to a successful exit to Unitrends (Insight Venture Partners); the general manager of Southern Europe at VMware; vice president of channels at CA Technologies in Europe; and business development director at Sterling Software Japan. Carlos holds an MS in computer science from Virginia Tech and a BS from Illinois State University.

Presentations

Framing business problems as machine learning (sponsored by AWS) Session

Carlos Escapa takes a deep dive into how to identify use cases for ML, acquire cutting-edge best practices to frame problems in a way that key stakeholders and senior management can understand and support, and set the stage for delivering successful ML-based solutions for your business.

Mohamed Fawzy is senior manager and tech lead at Facebook. In his six years at the company, he’s worked on its distributed storage system and was part of the team that developed cold storage, Facebook’s exabyte archiver storage system that keeps your memories safe. More recently, he started the Distributed Training Group to build large-scale distributed training infrastructure for deep learning.

Presentations

Large-scale machine learning at Facebook: Implications of platform design on developer productivity Keynote

AI plays a key role in achieving Facebook's mission of connecting people and building communities. Nearly every visible product is powered by machine learning algorithms at its core, from delivering relevant content to making the platform safe. Kim Hazelwood and Mohamed Fawzy explain how applied ML has continued to change the landscape of the platforms and infrastructure at Facebook.

Ilya Feige is the director of AI at Faculty, where he leads the company’s research and development efforts and ensures that cutting-edge machine learning is used across all Faculty’s commercial data science projects. Previously, Ilya worked at McKinsey & Company, helping to deploy artificial intelligence for some of the world’s largest brands. He’s also an honorary senior research fellow in artificial intelligence at UCL. Ilya was awarded the Goldhaber prize for the best PhD in theoretical physics from Harvard University, and the Governor General’s award for the single highest ranked undergraduate student at McGill University.

Presentations

Concepts and tools for fairness, explainability, and robustness in machine learning Session

Ilya Feige explores AI safety concerns—explainability, fairness, and robustness—relevant for machine learning (ML) models in use today. With concepts and examples, he demonstrates tools developed at Faculty to ensure black box algorithms make interpretable decisions, do not discriminate unfairly, and are robust to perturbed data.

James Fletcher is a principal researcher with Grakn, investigating approaches to advance cognition and leveraging machine learning, automated reasoning, and a knowledge base.

Presentations

Why biotech needs knowledge graph convolutional networks for discovery Session

Statistical approaches alone are not sufficient to tackle the complexity of AI challenges today. Being smarter with the data we already have is critical to achieving machine understanding of any complex domain. James Fletcher explains how knowledge graph convolutional networks (KGCNs) demonstrate the usefulness of combining a connectionist deep learning approach with a symbolic approach.

Steve Flinter is the artificial intelligence practice lead at Mastercard Labs. He’s an IT professional with more than 20 years’ experience in industry, government, and academia. Previously, Steve was with the global Mastercard start path team, Mastercard’s startup engagement activity, where he supported fintech startup companies by connecting them to Mastercard and its global network of customers; managed an investment portfolio of approximately €120M in the software and computer science areas at Science Foundation Ireland (SFI), the Irish Government agency investing in academic research; and worked in various senior software development roles in a variety of industry verticals. Steve holds a BSc in computer applications from Dublin City University and a PhD in computer science specializing in artificial intelligence from Trinity College Dublin.

Presentations

Developing a modern, open source machine learning pipeline with Kubeflow Session

Steve Flinter and Ahmed Menshaw explore the work that Mastercard Labs undertook to build an end-to-end machine learning pipeline, suitable for both R&D and production, using Kubernetes and Kubeflow. They demonstrate how the pipeline can be defined, configured, connected to a data streaming service, and used to train and deploy a model, which can be exposed for inference via an API.

Ariadna Font Llitjós is the director of engineering of the Cortex Machine Learning (ML) Platform at Twitter, focusing on enabling teams and increasing their productivity, with the end goal to help facilitate a healthy public conversation and reduce miss information, and she’s the engineering site executive for the New York City office, responsible for growing the engineering and design teams, as well as making Twitter NYC the best place to work by instilling a culture of inclusiveness and growth. Ari is passionate about bringing innovation to market and making intelligent systems easy to use, always putting people at the center. Previously, Ari was director of product development and design principal at IBM Watson (data and AI), where she led a large global team of engineers focused on the Watson discovery portfolio. With her teams, she worked to infuse Watson solutions and applications with knowledge and natural language understanding, leveraging machine learning (ML), neural networks (NN), and natural language processing (NLP) techniques, turning unstructured data into knowledge in a way that improves both the offerings and the user experience. Leveraging Agile, Lean, design thinking, and Lean UX best practices, Ari has been leading teams of developers, designers, and researchers for the last eight years. She earned her PhD in language and information technologies in the School of Computer Science at Carnegie Mellon University. Her PhD research focused on improving machine translation quality and accuracy by developing a largely automated approach that used online postediting feedback to refine translation rules.

Presentations

Executive Briefing: Designing and building responsible AI Session

In the rapidly changing world of AI, adopting the right design principles is key. From data scientists and business users to client end users, IBM Watson always seeks to augment their capabilities. Ariadna Font Llitjós examines how IBM Watson applies ethical AI and user-centered design principles from the beginning and leverages them throughout the product development cycle.

Michael Friedrich is a Senior Computer Scientist for the Adobe Cloud Platform at Adobe. His team brings cloud operations to a new level, using machine learning to automate complex development and delivery processes, including by implementing automated canary analysis for deployments or researching new automated scaling solutions.

Prior to Adobe he was the chief software engineer for Hamburg Süd (now part of Maersk) and was working on their container routing software.

Presentations

About Space Invaders and automated scaling Session

Michael Friedrich and Stefanie Grunwald explore how an algorithm capable of playing Space Invaders can also improve your cloud service's automated scaling mechanism.

Siddha Ganju is a self-driving solutions architect at NVIDIA. Previously, she developed deep learning models for resource-constrained edge devices at Deep Vision. Her prior work ranges from visual question answering to generative adversarial networks to gathering insights from CERN’s petabyte-scale data. She was recently featured on Forbes‘s 30 under 30 list, and she’s been published at top-tier conferences including CVPR and NeurIPS. Serving as an AI domain expert, she’s also been guiding teams at NASA as well as featured as a jury member in several international tech competitions. She’s a graduate of Carnegie Mellon University.

Presentations

Deep learning on mobile Session

Over the last few years, convolutional neural networks (CNNs) have risen in popularity, especially in the area of computer vision. Many mobile applications running on smartphones and wearable devices would benefit from the new opportunities enabled by deep learning techniques. Siddha Ganju and Meher Kasam walk you through optimizing deep neural nets to run efficiently on mobile devices.

Arash Ghazanfari represents the office of the CTO in the UK and Ireland region for Dell Technologies. Arash serves as a field CTO, supporting the overall go-to-market strategy across the full breadth of the Dell Technologies ecosystem. Previously, he held senior roles at Intel Security, VMware, and other leading technology vendors in the high-tech sector.

Presentations

Unlocking data capital with AI (sponsored by Dell) Keynote

As we look toward more demanding applications of artificial intelligence to unlock value from data, it's increasingly essential to develop a sustainable big data strategy and to efficiently scale artificial intelligence initiatives. Arash Ghazanfari covers the fundamental principles that need to be considered in order to achieve this goal.

Biraja Ghoshal is a computer consultant with Tata Consultancy Service. He has 21 years of software development, architecture, and systems engineering expertise in information management and mining massive datasets technologies. Biraja assists clients to apply analytic capabilities using big data platforms to improve performance and optimize decision making with high-quality, actionable insights. Biraja is also interested in machine learning, cognitive computing, and artificial intelligence topics.

Presentations

Deep learning with TensorFlow Probability in cancer prediction with reporting confidence Session

Deep learning, which involves powerful black box predictors, has achieved state-of-the-art performance in medical imaging analysis, such as segmentation and classification for diagnosis, but knowing how much confidence there is in a prediction is essential for gaining clinicians' trust. Biraja Ghoshal explores probabilistic modeling with TensorFlow Probability in cancer prediction.

Martin Goodson is the chief scientist and CEO of Evolution AI, where he specializes in large-scale natural language processing. Martin has designed data science products that are in use at companies like Dun & Bradstreet, Time Inc., John Lewis, and Condé Nast. Previously, Martin was a statistician at the University of Oxford, where he conducted research on statistical matching problems for DNA sequences. He runs the largest community of machine learning practitioners in Europe, Machine Learning London, and convenes the CBI/Royal Statistical Society roundtable, AI in Financial Services. Martin’s work has been covered by publications such as the Economist, Quartz, Business Insider, TechCrunch, and others.

Presentations

The dangers of data leakage in production machine learning systems Session

Data leakage occurs when the model gains access to data that it shouldn't have. AI systems can fail catastrophically in production if leakage is not dealt with properly. Martin Goodson details the four main manifestations of data leakage and explains how to recognize the warning signs. By mastering several key scientific principles, you can mitigate the risk of failure.

Vignesh Gopakumar is a machine learning engineer specializing in fusion research with the United Kingdom Atomic Energy Authority. He spends his time building machine learning algorithms to model physics systems that help gain more understanding of the underlying phenomenons. He designs algorithms that help discover anomalies as well as predict malfunction of engineering systems. He’s working on building a model that can be augmented in real time when exposed to different physics principles.

Presentations

A data-driven approach to model the physics of superheated gas hitting a wall Session

Vignesh Gopakumar explores image mapping of the temporal evolution of physics parameters as plasma interacts with the reactor wall using a data-inferred approach. The model captures how parameters such as temperature and density evolve across space and time. By analyzing the patterns found in simulation data, the model learns the existing physics relations implicitly defined within the data.

Stefanie Grunwald is a senior data and platform engineer with Adobe Experience Cloud. As a trained software architect, she’s been working in the field of data science and data engineering since 2011, applying the best practices from software engineering to building intelligent data platforms. With her heart devoted to DataOps and the OSS community, she and her team help others at Adobe become data-driven through automation and real-time insights.

Presentations

About Space Invaders and automated scaling Session

Michael Friedrich and Stefanie Grunwald explore how an algorithm capable of playing Space Invaders can also improve your cloud service's automated scaling mechanism.

Adam Grzywaczewski is a deep learning solution architect at NVIDIA, where his primary responsibility is to support a wide range of customers in delivery of their deep learning solutions. Adam is an applied research scientist specializing in machine learning with a background in deep learning and system architecture. Previously, he was responsible for building up the UK government’s machine-learning capabilities while at Capgemini and worked in the Jaguar Land Rover Research Centre, where he was responsible for a variety of internal and external projects and contributed to the self-learning car portfolio.

Presentations

Developing perception algorithms for autonomous vehicles Session

Developing perception algorithms for autonomous vehicles is incredibly difficult, as they need to operate in thousands of driving conditions and locations. Adam Grzywaczewski explores the challenges involved in data collection, processing, and management, as well as model development and validation. He also provides an overview of the necessary hardware and software infrastructure.

Rebecca Gu is a senior project manager at Electron, where she’s building blockchain-based digital infrastructure for the energy industry. Previously, Rebecca was an economist at Baringa Partners, where she wrote a discussion paper on the topic of algorithmic pricing. On the human side, she’s brought forward economic evidence in the first UK High Court case to go to trial for damages against a cartelist. On the machine side, her interest lies in how machine learning and AI are changing what we might consider cartels. Machine learning is also broadly changing the way we shop, the way firms price, and how governments and industry should work together to face these challenges.

Presentations

Executive Briefing: A look at the future of online pricing and algorithm-led collusion Session

In a future of widespread algorithmic pricing, cooperation between algorithms is easier than ever, resulting in coordinated price rises. Rebecca Gu and Cris Lowery explore how a Q-learner algorithm can inadvertently reach a collusive outcome in a virtual marketplace, which industries are likely to be subject to greater restrictions or scrutiny, and what future digital regulation might look like.

Ritika Gunnar is the vice president of data and AI expert services and learning at IBM. She and her team work with clients on their transformation journey through a data- and AI-first methodology and help implement data and AI solutions using deep knowledge-based skills to accelerate the adoption of data and AI enterprise capabilities. Previously, Ritika was the vice president of offerings for IBM Watson, where she was responsible for all of Watson’s Data and AI portfolio, defining the portfolio strategy, execution of product offerings, and driving business results; she was vice president of IBM’s worldwide cloud and cognitive GTM organization, drove IBM’s mission to apply deep cloud, data, and AI expertise to its clients’ most-pressing needs; she was the vice president of IBM’s data and analytics business, where she was responsible for the setting the strategy and execution of IBM’s data platform, data science, and analytics practices; she led and managed IBM’s master data management and information integration and governance business and IBM’s data warehousing and analytics business; she joined IBM as a software engineer in 1999. Ritika holds a bachelor’s of science in computer science and an executive master’s in business from the University of Texas at Austin. She currently resides in New York.

Presentations

For AI to thrive, failure is necessary: A practical guide (sponsored by IBM Watson) Keynote

Ritika Gunnar explores why you need to focus on your organization’s culture and build a data-first approach to shape a strong, AI-ready organization.

For AI to thrive, failure is necessary: A practical guide (sponsored by IBM Watson) Session

Ritika Gunnar explores why you need to focus on your organization’s culture and build a data-first approach to shape a strong, AI-ready organization.

Charlotte Han processes data and computes brand and digital strategies for a living. Thanks to growing up in Asia, becoming American in Silicon Valley, and now living in Europe, she’s learned not take things for granted and to make connections where they may not seem apparent. She’s highly interested in all things tech, especially how technologies can advance human lives, and enjoys networking with the misfits, the rebels, and the troublemakers who aren’t afraid to shake things up and push the boundaries of what is possible. Connect with her on Twitter as @sunsiren or on LinkedIn.

Presentations

Executive Briefing: Will you learn Chinese to advance in AI? Session

According to research by AI2, China is poised to overtake the US in the most-cited 1% of AI research papers by 2025. The view that China is a copycat but not an innovator may no longer be true. Charlotte Han explores what the implications of China's government funding, culture, and access to massive data pools mean to AI development and how the world could benefit from such advancement.

Kristian Hartikainen is a visiting scholar in the Robotics and AI Lab (RAIL) at UC Berkeley, working with Sergey Levine and Tuomas Haarnoja, and will begin his PhD studies at the University of Oxford with Simon Whiteson in fall 2019. His research focus is on the development of model-free deep reinforcement learning algorithms for robotic control. He’s also working on Ray RLlib, a scalable reinforcement learning library, and Ray Tune, a distributed framework for model training. Kristian is the author and maintainer of Softlearning, the official soft actor-critic project. Previously, he spent several years as a software engineer working on statistical analysis and machine learning products at Statwing and Qualtrics.

Presentations

Scalable AI and reinforcement learning with Ray Tutorial

Edward Oakes, Peter Schafhalter, and Kristian Hartikainen take a deep dive into Ray, a new distributed execution framework for distributed AI applications developed by machine learning and systems researchers at RISELab, and explore Ray’s API and system architecture and sharing application examples, including several state-of-the-art distributed training, hyperparameter search, and RL algorithms.

Kim Hazelwood is a senior engineering manager leading the AI Infrastructure Foundation and AI Infrastructure Research efforts at Facebook, where the focus is designing and optimizing efficiency hardware and software systems for Facebook’s many applied machine learning-based products and services. Previously, Kim was a tenured associate professor at the University of Virginia, a software engineer at Google, and director of systems research at Yahoo Labs. She’s been recognized with an NSF CAREER Award, the Anita Borg Early Career Award, the MIT Technology Review Top 35 Innovators under 35 Award, and the ACM SIGPLAN 10-Year Test of Time Award. She serves on the board of directors of CRA and has authored over 50 conference papers and one book. She holds a PhD in computer science from Harvard University.

Presentations

Large-scale machine learning at Facebook: Implications of platform design on developer productivity Keynote

Thomas Henson is a data engineering advocate and senior systems engineer for the unstructured data solutions team at Dell EMC. Thomas has been involved in many different big data, analytics, and artificial intelligence projects throughout his career, with a focus on distributed systems. He’s a proud alumnus of the University of North Alabama, where he earned his undergraduate and graduate degree in computer information systems. Thomas is an accomplished speaker in the artificial intelligence and big data ecosystem at various conferences.

Presentations

AI growing pains: Platform considerations for moving from POC to large-scale deployments (sponsored by Dell Technologies) Session

As machine learning and deep learning techniques reach mainstream adoption, the architectural considerations for platforms that support large-scale production deployments of AI applications change significantly as you mature beyond small-scale sandbox and POC environments. Thomas Henson walks you through eliminating I/O bottlenecks to keep your GPU-powered AI rocket ship fueled with data.

Adithya Hrushikesh is an operational intelligence lead for data science at Vodafone, where he leads a team of data scientists and data engineers to build data products.

Presentations

Automating customer complaints classification in German Session

Every day, millions of Vodafone Germany customers reach out through various social media channels about issues related to mobile, internet, signal issues, etc. Adithya Hrushikesh details how to build and deploy an ensemble model to classify 26 (originally 56) complaint classes using machine learning over deep learning. He also touches on the business case, data product development, and GDPR.

Ihab Ilyas is a professor in the Cheriton School of Computer Science at the University of Waterloo, where his research focuses on the areas of big data and database systems, with special interest in data quality and integration, managing uncertain data, rank-aware query processing, and information extraction. Ihab is also a cofounder of Tamr, a startup focusing on large-scale data integration and cleaning. He’s a recipient of the Ontario Early Researcher Award (2009), a Cheriton faculty fellowship (2013), an NSERC Discovery Accelerator Award (2014), and a Google Faculty Award (2014), and he’s an ACM Distinguished Scientist. Ihab is an elected member of the VLDB Endowment board of trustees and an associate editor of ACM Transactions of Database Systems (TODS). He holds a PhD in computer science from Purdue University, West Lafayette.

Presentations

The quest for high-quality data Keynote

Ihab Ilyas highlights the data-quality problem and describes the HoloClean framework, a state-of-the-art prediction engine for structured data with direct applications in detecting and repairing data errors, as well as imputing missing labels and values.

Alex Ingerman is a product manager at Google AI, focusing on federated learning and other privacy-preserving technologies. His mission is to enable all ML practitioners to protect their users’ privacy by default. Previously, Alex worked on ML-as-a-service platforms for developers, web-scale search, content recommendation systems, and immersive data exploration and visualization. Alex lives in Seattle, where as a frequent bike and occasional kayak commuter, he has fully embraced the rain. Alex holds a BS in computer science and an MS in medical engineering.

Presentations

Federated learning introduction and examples with TensorFlow Federated Session

Federated learning is the approach of training ML models across many devices without collecting the data in a central location. Alex Ingerman explores learning concepts and the use cases for decentralized machine learning, drawing on Google's real-world deployments. You'll learn how to build your first federated models with the open source TensorFlow Federated.

Jewel James is a data scientist at Gojek.

Presentations

Using ML for personalizing food search at Gojek Session

GoFood, Gojek's food delivery product, is one of the largest of its kind in the world. Jewel James and Mudit Maheshwari explain how they prototyped the search framework that personalizes the restaurant search results by using ML to learn what constitutes a relevant restaurant given a user's purchasing history.

Katharine Jarmul is a cofounder of KIProtect and is a passionate and internationally recognized data scientist, programmer, and lecturer. Her work and research focuses on securing data for data science workflows. Previously, she held numerous roles at large companies and startups in the US and Germany, implementing data processing and machine learning systems with a focus on reliability, testability, and security. She’s an author for O‘Reilly and frequent keynote speaker at international software conferences.

Presentations

Executive Briefing: Advances in privacy for machine learning systems Session

Katharine Jarmul sates your curiosity about how far we've come in implementing privacy within machine learning systems. She dives into recent advances in privacy measurements and explains how this changed the approach of privacy in machine learning. You'll discover new techniques including differentially private data collection, federated learning, and homomorphic techniques.

Jeff Jonas is the founder and CEO of Senzing and is an acclaimed data scientist and the leading creator of entity resolution systems. He founded Senzing with the goal of making entity resolution technology available for everyone everywhere. For more than three decades, he’s been at the forefront of solving complex big data problems for companies and governments. He’s a three-time entrepreneur and sold his last company to IBM in 2005. Previously, he was an IBM fellow and chief scientist of context computing at IBM, where he led a team focused on creating next-generation AI for entity resolution technology.

Presentations

Real-time AI for entity resolution Keynote

Entity resolution—determining “who is who” and “who is related to whom”—is essential to almost every industry, including banking, insurance, healthcare, marketing, telecommunications, social services, and more. Jeff Jonas details how you can use a purpose-built real-time AI, created for general-purpose entity resolution, to gain new insights and make better decisions faster.

Andreas Kaltenbrunner is senior director of data analytics at NTENT, where he leads a team focused on user behavior analysis and improvements for ranking in mobile search. Andreas is also teaching a master course on data-driven social analytics at Universitat Pompeu Fabra and is involved in research activities centered on computational social science, social media and social network analysis, areas in which he has coauthored more than 70 publications. Previously, he led the Social Media Research Line at the Barcelona Media technology center and led the Digital Humanities Research Unit at the Eurecat technology center. He earned his PhD in computer science and digital communication from the Universitat Pompeu Fabra, with a thesis about stochastic effects in human and neural communication patterns.

Presentations

Executive Briefing: How the growth of voice-based AI stands to blur the lines of big data Session

Voiced-based AI continues to gain popularity among customers, businesses, and brands, but it’s important to understand that, while it presents a slew of new data at our disposal, the technology is still in its infancy. Andreas Kaltenbrunner examines three ways voice assistants will make big data analytics more complex and the various steps you can take to manage this in your company.

Ahmed Kamal is the machine learning platform lead at Careem, where he’s working on developing machine learning services and data infrastructure. Previously, he worked on building the data science platform at Seeloz. He’s passionate about building data products that change people’s lives.

Presentations

Scaling machine learning at Careem Session

Every day Careem’s platform relies on machine learning (ML) in production to enable the movement of millions of its users. Ahmed Kamal outlines the challenges Careem faced while productionizing ML on scale and explains how to build an in-house ML platform that facilitates development and fast deployment of scalable ML services and accelerates the impact of ML everywhere.

Manas Ranjan Kar is a Associate Vice President at US healthcare company Episource, where he leads the NLP and data science practice, works on semantic technologies and computational linguistics (NLP), builds algorithms and machine learning models, researches data science journals, and architects secure product backends in the cloud. He’s architected multiple commercial NLP solutions in the area of healthcare, food and beverages, finance, and retail. Manas is deeply involved in functionally architecting large-scale business process automation and deep insights from structured and unstructured data using NLP and ML. He’s contributed to NLP libraries including gensim and Conceptnet 5 and blogs regularly about NLP on forums like Data Science Central, LinkedIn, and his blog Unlock Text. Manas speaks regularly about NLP and text analytics at conferences and meetups, such as PyCon India and PyData, has taught hands-on sessions at IIM Lucknow and MDI Gurgaon, and has mentored students from schools including ISB Hyderabad, BITS Pilani, and the Madras School of Economics. When bored, he falls back on Asimov to lead him into an alternate reality.

Presentations

NLP for healthcare: Feature engineering and model diagnostics Session

Natural language processing (NLP) is hard, especially for clinical text. Manas Ranjan Kar explains the multiple challenges of NLP for clinical text and why it's so important that we invest a fair amount of time on domain-specific feature engineering. It’s also crucial to understand to diagnose an NLP model performance and identify possible gaps.

Meher Kasam is an iOS software engineer at Square and is a seasoned software developer with apps used by tens of millions of users every day. He’s shipped features for a range of apps from Square’s point of sale to the Bing app. Previously, he worked at Microsoft, where he was the mobile development lead for the Seeing AI app, which has received widespread recognition and awards from Mobile World Congress, CES, FCC, and the American Council of the Blind, to name a few. A hacker at heart with a flair for fast prototyping, he’s won close to two dozen hackathons and converted them to features shipped in widely used products. He also serves as a judge of international competitions including the Global Mobile Awards and the Edison Awards.

Presentations

Deep learning on mobile Session

Arun Kejariwal is an independent lead engineer. Previously, he was he was a statistical learning principal at Machine Zone (MZ), where he led a team of top-tier researchers and worked on research and development of novel techniques for install-and-click fraud detection and assessing the efficacy of TV campaigns and optimization of marketing campaigns, and his team built novel methods for bot detection, intrusion detection, and real-time anomaly detection; and he developed and open-sourced techniques for anomaly detection and breakout detection at Twitter. His research includes the development of practical and statistically rigorous techniques and methodologies to deliver high performance, availability, and scalability in large-scale distributed clusters. Some of the techniques he helped develop have been presented at international conferences and published in peer-reviewed journals.

Presentations

Herding cats: Product management in the machine learning era Tutorial

Sequence to sequence (S2S) modeling for time series forecasting Session

Ganes Kesari is a cofounder and head of analytics at Gramener, where he leads analytics and innovation in data science, advising enterprises on deriving value from data science initiatives and leading applied research in deep learning at Gramener AI Labs. He’s passionate about the confluence of machine learning, information design, and data-driven business leadership and strives to simplify and demystify data science.

Presentations

Predicting the quality of life from satellite imagery Session

In many countries, policy decisions are disconnected from data, and very few avenues exist to understand deeper demographic and socioeconomic insights. Ganes Kesari and Soumya Ranjan explain how satellite imagery can be a powerful aid when viewed through the lens of deep learning. When combined with conventional data, it can help answer important questions and show inconsistencies in survey data.

Vineet Khare is a manager of applied science at AWS, where he’s led the research and development efforts for multiple AWS products, including SageMaker built-in algorithms, SageMaker RL, AWS DeepRacer, and AWS Ground Truth. He’s presented his research at international conferences including PPSN, EMO, GECCO, and SEAL. He’s also conducted SageMaker workshops and tutorials at AWS events such as the annual conference, re:Invent.

Presentations

Using reinforcement learning to build recommendation systems with AWS SageMaker RL Tutorial

Anastasia Kouvela is a principal at A.T. Kearney with more than 10 years in the advisory space. She leads international large-scale operations transformations across industries and is well known for delivering high-impact transformation programs that address postmerger operations integrations, cost optimization, complexity reduction, and supply chain and logistics optimization. Anastasia is particularly passionate about analytics and AI.

Presentations

Executive Briefing: From laggard to leader—Winning the AI race Session

The Analytics Impact Index gives organizations an understanding of the value potential of analytics as well as the capabilities required to capture the most value. Anastasia Kouvela and Bharath Thota walk you through the 2019 results and the analytics journey of leading global organizations and empower companies to develop a case for change.

Akshay Kulkarni is a senior data scientist with SapientRazorfish’s core AI and data science team, where he’s part of strategy and transformation interventions through AI, manages high priority growth initiatives around data science and works on various machine learning, deep learning, natural language processing, and artificial intelligence engagements by applying state-of-the-art techniques, as well as a renowned AI and machine learning evangelist, an author, and a speaker. He was recently recognized as one of the “top 40 under 40 data scientists” in India by Analytics India Magazine. He’s consulted with several Fortune 500 and global enterprises in driving AI and data science-led strategic transformations. Akshay has a rich experience of building and scaling AI and machine learning businesses and creating significant client impact. He’s actively involved in next gen AI research and is also a part of next gen AI community. Previously, he was part of Gartner and Accenture, where he scaled the AI and data science business. He’s a regular speaker at major data science conferences recently gave a talk on “Sequence Embeddings for Prediction Using Deep Learning” at GIDS. He’s the author of a book on NLP with Apress and currently authoring couple more books with Packt on deep learning and next gen NLP. He is also a visiting faculty (industry expert) at few of the top universities in India. In his spare time, he likes to read, write, code, and help aspiring data scientists.

Presentations

Text analytics 101: Deep learning and attention networks all the way to production Tutorial

An estimated 80% of data generated is an unstructured format, such as text, an image, audio, or video. Vijay Srinivas Agneeswaran, Pramod Singh, and Akshay Kulkarni explore how to create a language model that generates natural language text by implementing and forming a recurrent neural network and attention networks built on top of TensorFlow 2.0.

Abhishek Kumar is a senior manager of data science in Publicis Sapient’s India office, where he looks after scaling up the data science practice by applying machine learning and deep learning techniques to domains such as retail, ecommerce, marketing, and operations. Abhishek is an experienced data science professional and technical team lead specializing in building and managing data products from conceptualization to the deployment phase and interested in solving challenging machine learning problems. Previously, he worked in the R&D center for the largest power-generation company in India on various machine learning projects involving predictive modeling, forecasting, optimization, and anomaly detection and led the center’s data science team in the development and deployment of data science-related projects in several thermal and solar power plant sites. Abhishek is a technical writer and blogger as well as a Pluralsight author and has created several data science courses. He’s also a regular speaker at various national and international conferences and universities. Abhishek holds a master’s degree in information and data science from the University of California, Berkeley. Abhishek has spoken at past O’Reilly conferences, including Strata 2019, Strata 2018, and AI 2019.

Presentations

Industrialized capsule networks for text analytics Session

Abhishek Kumar outlines how to industrialize capsule networks by detailing capsule networks and how capsule networks help handle spatial relationships between objects in an image and how to apply them to text analytics and tasks such as NLU or summarization. Join in to see a scalable, productionizable implementation of capsule networks over KubeFlow.

Marta Kwiatkowska is a professor of computing systems and fellow of Trinity College, University of Oxford. She’s known for fundamental contributions to the theory and practice of model checking for probabilistic systems. She led the development of the PRISM model checker, the leading software tool in the area. Probabilistic model checking has been adopted in diverse fields, including distributed computing, wireless networks, security, robotics, healthcare, systems biology, DNA computing, and nanotechnology, with genuine flaws found and corrected in real-world protocols. Marta was awarded two ERC Advanced Grants, VERIWARE and FUN2MODEL, and is a coinvestigator of the EPSRC Programme Grant on Mobile Autonomy. She was honored with the Royal Society Milner Award in 2018 and the Lovelace Medal in 2019 and is a Fellow of the Royal Society, ACM and BCS, and Member of Academia Europea.

Presentations

When to trust AI Keynote

Machine learning solutions are revolutionizing AI, but Marta Kwiatkowska explores their instability against adversarial examples—small perturbations to inputs that can catastrophically affect the output—which raises concerns about the readiness of this technology for widespread deployment.

Holger Kyas is Open Group Board Member for OpenCA Architecture Certifications, Enterprise Architect at Helvetia Insurances and Adjunct Professor at the University of Applied Sciences Bern in Switzerland. He has presented at international conferences like “IBM World of Watson” or “Insurance AI and Analytics Europe.”

Presentations

Implementing an AI multicloud broker Session

Holger Kyas details the AI multicloud broker, which is triggered by Amazon Alexa and mediates between AWS Comprehend (Amazon), Azure Text Analytics (Microsoft), GCP Natural Language (Google), and Watson Tone Analyzer (IBM) to compare and analyze sentiment. The extended AI part generates new sentences (e.g., marketing slogans) with a recurrent neural network (RNN).

Lyndon Leggate is the founder and chief technology officer at Deep, a data science and machine learning consultancy that assists a variety of clients with technology strategy and data solutions, a senior technology leader with extensive defining and delivering complex technical solutions on large, business critical projects for consumer facing brands, and a Machine Learning AWS Hero. He’s a founder of Nino City, an early stage ed tech startup trying to help parents make children’s screen time more productive (implemented as a 100% serverless solution on AWS). Alongside his day job, Lyndon is a keen participant in the AWS DeepRacer league. Racing as Etaggel, he’s regularly positioned in the top 10, is featured in DeepRacer TV, and in May 2019, established the AWS DeepRacer Community. This vibrant and rapidly growing community provides a space for new and experienced racers to seek advice and share tips. The community has gone on to expand the DeepRacer tool sets, making the platform more accessible and pushing the bounds of the technology. He also organizes the AWS DeepRacer London Meetup series.

Presentations

Making reinforcement learning practical for real-world developers (sponsored by AWS) Session

Lyndon Leggate walks you through a step-by-step demonstration of how you can up level your reinforcement learning (RL) skills through autonomous driving.

Chang Liu is an applied research scientist at Georgian Partners and a member of the Georgian impact team, where she draws on her in-depth knowledge of mathematical and combinatorial optimization to help Georgian’s portfolio companies. Previously, Chang was a risk analyst at Manulife Bank, where she built models to assess the bank’s risk exposure based on extensive market research, including evaluating and predicting the impact of the oil price drop to the mortgage lending risks in Alberta in 2014. Chang holds a master of applied science in operations research from the University of Toronto, where she specialized in combinatorial optimization, and a bachelor’s degree in mathematics from the University of Waterloo.

Presentations

Building differentially private machine learning models using TensorFlow Session

The world is increasingly data driven, and people have developed an awareness and concern for their data. Chang Liu and Ji Chao Zhang examine differential privacy—the component of the TensorFlow Privacy library that allows users to train differentially private logistic regression and support vector machines—along with real-world use cases and demonstrations for how to apply the tools.

Ben Lorica is the chief data scientist at O’Reilly. Ben has applied business intelligence, data mining, machine learning, and statistical analysis in a variety of settings, including direct marketing, consumer and market research, targeted advertising, text mining, and financial engineering. His background includes stints with an investment management company, internet startups, and financial services.

Presentations

Building and deploying AI applications and systems at scale Keynote

Details to come.

Thursday opening welcome Keynote

Program chairs Ben Lorica, Roger Chen, and Alexis Helzer open the second day of keynotes.

Wednesday opening welcome Keynote

Program chairs Ben Lorica, Roger Chen, and Alexis Helzer open the first day of keynotes.

Cristobal Lowery is a senior manager and team lead for Baringa Partners’s modeling and machine learning centre of excellence, where he led the creation of Baringa’s data science and analytics team and supported our clients in their journeys to become leaders in artificial intelligence. Previously, he was an independent data science consultant in an investment bank and for a leading Formula 1 team. Cristobal is a passionate advocate of artificial intelligence and its potential to transform businesses. He holds two first-class master’s degrees in quantitative subjects and has published and patented a machine learning system.

Presentations

Executive Briefing: A look at the future of online pricing and algorithm-led collusion Session

Angie Ma is a cofounder and chief operating officer of Faculty, a London-based AI technology company that provides products and services in strategy, software, and skills. Faculty has delivered more than 300 commercial data science projects across 23 sectors and 8 countries. Angie is passionate about real-world applications of machine learning that generate business value for companies and organizations and has experience delivering complex projects from prototyping to implementation. She supports senior leaders to build AI capability, advising on skills transformation. A physicist by training, previously, Angie was a researcher in nanotechnology working on developing optical detection for medical diagnostics.

Presentations

AI for executives 2-Day Training

Angie Ma and Richard Sargeant offer a condensed introduction to key AI and machine learning concepts and techniques, showing you what is (and isn't) possible with these exciting new tools and how they can benefit your organization.

Mark Madsen is a Fellow at Teradata, where he’s responsible for understanding, forecasting, and defining analytics ecosystems and architectures. Previously, he was CEO of Third Nature, where he advised companies on data strategy and technology planning, and vendors on product management. Mark has designed analysis, machine learning, data collection, and data management infrastructure for companies worldwide.

Presentations

Executive Briefing: The black box—Interpretability, reproducibility, and data management Session

The growing complexity of data science leads to black box solutions that few people in an organization understand. Mark Madsen explains why reproducibility—the ability to get the same results given the same information—is a key element to build trust and grow data science use. And one of the foundational elements of reproducibility (and successful ML projects) is data management.

Mudit Maheshwari is a product engineer at Gojek working with the GoFood search team focused on providing relevant results to the user. Previously, he’s worked on developing and designing scalable, reliable, and fault-tolerant systems for one of the biggest food delivery business.

Presentations

Using ML for personalizing food search at Gojek Session

Michael W. Mahoney is a professor in the Department of Statistics and the International Computer Science Institute (ICSI) at the University of California, Berkeley. He works on the algorithmic and statistical aspects of modern large-scale data analysis. He’s also the director of the NSF/TRIPODS-funded Foundations of Data Analysis (FODA) Institute at UC Berkeley. Much of his recent research has focused on large-scale machine learning, including randomized matrix algorithms and randomized numerical linear algebra, geometric network analysis tools for structure extraction in large informatics graphs, scalable implicit regularization methods, computational methods for neural network analysis, and applications in genetics, astronomy, medical imaging, social network analysis, and internet data analysis. Previously, he worked and taught in the Mathematics Department at Yale University, at Yahoo Research, and in the Mathematics Department at Stanford University. Among other things, he’s on the national advisory committee of the Statistical and Applied Mathematical Sciences Institute (SAMSI), he was on the National Research Council’s Committee on the Analysis of Massive Data, he co-organized the Simons Institute’s fall 2013 and 2018 programs on the foundations of data science, and he runs the biennial MMDS Workshops on Algorithms for Modern Massive Data Sets. He earned his PhD from Yale University with a dissertation in computational statistical mechanics. More information is available at https://www.stat.berkeley.edu/~mmahoney/.

Presentations

Principled tools for analyzing weight matrices of production-scale deep neural networks Session

Developing theoretically principled tools to guide the use of production-scale neural networks is an important practical challenge. Michael Mahoney explores recent work from scientific computing and statistical mechanics to develop such tools, covering basic ideas and their use for analyzing production-scale neural networks in computer vision, natural language processing, and related tasks.

Ted Malaska is a director of enterprise architecture at Capital One. Previously, he was the director of engineering in the Global Insight Department at Blizzard; principal solutions architect at Cloudera, helping clients find success with the Hadoop ecosystem; and a lead architect at the Financial Industry Regulatory Authority (FINRA). He has contributed code to Apache Flume, Apache Avro, Apache Yarn, Apache HDFS, Apache Spark, Apache Sqoop, and many more. Ted is a coauthor of Hadoop Application Architectures, a frequent speaker at many conferences, and a frequent blogger on data architectures.

Presentations

Executive Briefing: Optimizing for skill sets—Data engineers, data scientists, and analysts Session

While at a big tech conference on AI, it's important to reflect on the human components. Ted Malaska walks you through scenarios and strategies to help different groups work together and explains how to evaluate success and sniff out trouble areas. You'll look at every part of the pipeline to see who's involved and how to optimize the interaction points throughout the pipeline—and how to have fun.

Tobias is a technology sociologist and political scientist. He started whoelse.ai in 2013 with the idea to make Internet services explainable in a simplified, child-like, language. Since September 2018 the project operates as an R&D company based in Berlin.

Tobias studied Public Policy at the Humboldt-Viadrina School of Governance in Berlin, Technology Sociology at the University of Vienna, and European Studies at the City University Bremen and International University of Cairo. Since 2010 he works for public agencies, venture-financed startups, consulting companies, and in research and education. From 2011 to 2014 he was appointed as Young Advisor to EU Commissioner Neelie Kroes on the Digital Agenda for Europe.

Presentations

Make Alexa and Siri speak with each other: Toward a universal grammar in AI Session

More than 50% of all interactions between humans and machines are expected to be speech-based by 2022. The challenge: Every AI interprets human language slightly different. Tobias Martens details current issues in NLP interoperability and uses Chomsky's theory of universal hard-wired grammar to outline a framework to make the human voice in AI universal, accountable, and computable.

Ian Massingham is a developer evangelist and is part of the technical leadership team at Amazon Web Services, where he draws on over two decades of expertise in internet technologies, technology operations leadership, architecture, and software engineering to help customers bring their ideas to life through technology. In his time at AWS, Ian has helped developers and other technical end users in companies of all sizes, from startups to large enterprises, apply cloud computing technologies, solve business problems, and exploit market opportunities.

Presentations

Start your engines: Making deep reinforcement learning accessible to all developers (sponsored by AWS) Keynote

Reinforcement learning is an advanced machine learning technique that makes short-term decisions while optimizing for a longer-term goal through trial and error. Ian Massingham dives into state-of-the-art techniques in deep reinforcement learning for a variety of use cases.

Ajit Mathews is the vice president of machine learning software engineering at AMD, where he’s the engineering leader responsible for the design and development of Radeon Open Compute (ROCm) machine intelligence software spanning deep learning frameworks, compilers, language runtimes, libraries and Linux compute kernel. Ajit is also responsible for the machine learning software road map and strategy. He’s passionate about distributed machine learning and high-performance computing. Ajit holds a master’s degree in computer science and an MBA from Kellogg.

Presentations

ROCm and Hopsworks for end-to-end deep learning pipelines Session

Ahmed Menshawy is a machine learning engineer in the AI Practice within the R&D Group at Mastercard Labs, where he works on a wide range of problems related to the application of AI and machine learning to Mastercard’s products and services. Ahmed is interested in studying the overlap between knowledge, logic, language, and learning. In particular, his focus is in how machine learning can be used for distilling large amounts of unstructured, semistructured, and structured data with hidden patterns into new knowledge about the world by using methods ranging from deep learning to statistical relational learning. Ahmed has authored two books, Deep Learning with TensorFlow and Deep Learning by Example, which focus on advanced deep learning topics. Ahmed has a BSc in computer science and an MSc in machine learning from Helwan University, Cairo, Egypt.

Presentations

Developing a modern, open source machine learning pipeline with Kubeflow Session

Umberto Michelucci is a cofounder and the chief AI scientist at TOELT LLC, a company aiming to develop new and modern teaching, coaching, and research methods for AI to make AI technologies and research accessible to every company and everyone. He’s an expert in numerical simulation, statistics, data science, and machine learning. In addition to several years of research experience at the George Washington University (US) and the University of Augsburg (DE), he has 15 years of practical experience in the fields of data warehouse, data science, and machine learning. His last book, Applied Deep Learning—A Case-Based Approach to Understanding Deep Neural Networks, was published by Springer in 2018, and he’s working on a new book, Convolutional and Recurrent Neural Networks Theory and Applications. He’s very active in research in the field of artificial intelligence. He publishes his research results regularly in leading journals and gives regular talks at international conferences. Umberto studied physics and mathematics. Sharing is caring—for that, he is a lecturer at the ZHAW University of Applied Sciences for deep learning and neural networks theory and applications and at the HWZ University of Applied Science for big data analysis and statistics. At Helsana Versicherung AG, he’s also responsible for research and collaborations with universities in the area of AI.

Presentations

Convolutional neural networks for image recognition in Keras and TensorFlow 2-Day Training

Convolutional neural networks (CNNs) are the basis of many algorithms that deal with images, from image recognition and classification to object detection. Using practical examples, Umberto Michelucci walks you through developing convolutional neural networks, using pretrained networks, and even teaching a network to paint. TensorFlow or Keras will be used for all examples.

Laurence Moroney is a developer advocate on the Google Brain team at Google, working on TensorFlow and machine learning. He’s the author of dozens of programming books, including several best sellers, and a regular speaker on the Google circuit. When not Googling, he’s also a published novelist, comic book writer, and screenwriter.

Presentations

Zero to hero with TensorFlow 2.0 Session

Laurence Moroney explores how to go from wondering what machine learning (ML) is to building a convolutional neural network to recognize and categorize images. With this, you'll gain the foundation to understand how to use ML and AI in apps all the way from the enterprise cloud down to tiny microcontrollers using the same code.

Josh Muncke is a Principal in the Commercial team at Faculty. Prior to joining Faculty, Josh was the Director of Data Science for Red Bull where he was responsible for the deployment of a wide range of AI and machine-learning initiatives across the Sales, Marketing, Distribution, and Media business units. Josh has ten years of experience in building and leading Data Science teams and projects and previously worked as a Manager at Deloitte – specializing in developing AI roadmaps for clients in the FMCG and Retail sectors. Josh has a degree in Physics from the University of Manchester.

Presentations

AI for executives 2-Day Training

Paco Nathan is known as a “player/coach” with core expertise in data science, natural language processing, machine learning, and cloud computing. He has 35+ years of experience in the tech industry, at companies ranging from Bell Labs to early-stage startups. His recent roles include director of the Learning Group at O’Reilly and director of community evangelism at Databricks and Apache Spark. Paco is the cochair of Rev conference and an advisor for Amplify Partners, Deep Learning Analytics, Recognai, and Primer. He was named one of the "top 30 people in big data and analytics" in 2015 by Innovation Enterprise.

Presentations

Executive Briefing: Unpacking AutoML Session

Paco Nathan outlines the history and landscape for vendors, open source projects, and research efforts related to AutoML. Starting from the perspective of an AI expert practitioner who speaks business fluently, Paco unpacks the ground truth of AutoML—translating from the hype into business concerns and practices in a vendor-neutral way.

Tim Nugent pretends to be a mobile app developer, game designer, tools builder, researcher, and tech author. When he isn’t busy avoiding being found out as a fraud, Tim spends most of his time designing and creating little apps and games he won’t let anyone see. He also spent a disproportionately long time writing his tiny little bio, most of which was taken up trying to stick a witty sci-fi reference in…before he simply gave up. He’s writing Practical Artificial Intelligence with Swift for O’Reilly and building a game for a power transmission company about a naughty quoll. (A quoll is an Australian animal.)

Presentations

Building, teaching, and training simulations for machine learning with a game engine Session

Practical on-device AI and ML using Swift Session

Edward Oakes is a second-year PhD student at UC Berkeley and a contributor to the Ray project. Previously, he worked on isolation mechanisms for serverless computing and infrastructure for microservice deployments.

Presentations

Scalable AI and reinforcement learning with Ray Tutorial

Richard Ott obtained his PhD in particle physics from the Massachusetts Institute of Technology, followed by postdoctoral research at the University of California, Davis. He then decided to work in industry, taking a role as a data scientist and software engineer at Verizon for two years. When the opportunity to combine his interest in data with his love of teaching arose at The Data Incubator, he joined and has been teaching there ever since.

Presentations

Deep learning with PyTorch 2-Day Training

PyTorch is a machine learning library for Python that allows you to build deep neural networks with great flexibility. Its easy-to-use API and seamless use of GPUs make it a sought-after tool for deep learning. Join Rich Ott to get the knowledge you need to build deep learning models using real-world datasets and PyTorch.

Vanja Paunic is a data scientist in the Algorithms and Data Science Group at Microsoft London. She works on building machine learning solutions with external companies utilizing Microsoft’s AI Cloud Platform. She holds a PhD in computer science with a focus on data mining in the biomedical domain from the University of Minnesota.

Presentations

Using the Azure Cloud to Scale Up Hyperparameter Optimization for Machine Learning

Hyperparameter optimization for machine leaning is a complex task that requires advanced optimization techniques and can be implemented as a generic framework decoupled from the specific details of algorithms. We show how such a framework can be applied to learning unrelated tasks like object detection and text matching in a transparent, scalable, and easy to manage way in a cloud service.

Pedram Pejman is a technical program manager on the TensorFlow Extended team at Google Brain, on a mission to create the best inference experience on TensorFlow. Previously, he managed some of Google Cloud’s internal efforts in the machine intelligence space while getting to work on distributed systems and Kubernetes. Aside from building infrastructure, he enjoys writing music, playing soccer, and scrolling through memes.

Presentations

TFX: Production ML pipelines with TensorFlow Tutorial

Brett Phaneuf is the founder and chief executive of Submergence Group (US) and MSubs (UK), and through his office in the United Kingdom, he overseas the design and production of manned and unmanned, underwater vehicle systems. A serial entrepreneur, Brett recently turned his attention to machine learning and artificial intelligence; a new company (Marine Ai) has been spun out from MSubs with the goal of creating cognitive AI to enhance maritime capabilities by drawing on decades of experience in manned and unmanned marine vehicle design, manufacture and operations, coupled with vast experience in automation and autonomous systems software architecture, and computer vision expertise. Brett is also one of three founding board members of Promare, a nonprofit (501( c )(3)) public charity founded in 2001 to promote marine exploration throughout the world. Through the confluence of these varied and interrelated fields of endeavor, Brett leads the development of the Mayflower Autonomous Ship, which will sail from Plymouth, UK, to Plymouth, US, in commemoration of the 400th anniversary for the original Mayflower sailing in September 1620. The Mayflower Autonomous Ship is a Promare project but will draw on the expertise resident in Submergence Group, MSubs, Marine Ai, Promare, and many other private and corporate sponsors. Previously, Brett studied physics before switching to archaeology, and then worked as a classical archaeologist on ancient sites in North Africa. His love of physics, technology, and history lead him to marine archaeology and the founding of Promare, through which numerous underwater archaeological research programs have been carried out in the past two decades.

Presentations

Autonomous ship: The Mayflower project (sponsored by IBM Watson) Session

Brett Phaneuf outlines how similar types of AI can fit into your company solutions and how technologies like containers, deep learning, cloud, machine learning, and more all fit together to drive innovation for the "new world" of the future.

Thomas Phelan is cofounder and chief architect of BlueData. Previously, a member of the original team at Silicon Graphics that designed and implemented XFS, the first commercially availably 64-bit file system; and an early employee at VMware, a senior staff engineer and a key member of the ESX storage architecture team where he designed and developed the ESX storage I/O load-balancing subsystem and modular pluggable storage architecture as well as led teams working on many key storage initiatives such as the cloud storage gateway and vFlash.

Presentations

Deep learning with Horovod and Spark using GPUs and Docker containers Session

Today, organizations understand the need to keep pace with new technologies when it comes to performing data science with machine learning and deep learning, but these new technologies come with their own challenges. Thomas Phelan demonstrates the deployment of TensorFlow, Horovod, and Spark using the NVIDIA CUDA stack on Docker containers in a secure multitenant environment.

How to deploy large-scale distributed data analytics and machine learning on containers (sponsored by HPE) Session

Join Thomas Phelan to learn whether the combination of containers with large-scale distributed data analytics and machine learning applications is like combining oil and water or like peanut butter and chocolate.

Philip Pilgerstorfer is a data scientist at QuantumBlack, where he’s a contributor to QuantumBlack’s internal R&D efforts on causal inference. He’s delivered projects in manufacturing, oil and gas, motorsports, and pharmaceuticals. Philip’s academic background is in econometrics, statistics, and machine learning.

Presentations

Executive Briefing: Fusing data and design Session

Soumya Ranjan is a data scientist at Gramener AI Labs specializing in using deep learning and machine learning techniques to solve problems across verticals like healthcare, environment conservation, and safety. He’s passionate about data and adores narrating beautiful stories around it, thanks to his experience in building data visualization tools and libraries covering real-time election analysis and visualization. Soumya strongly believes in making quality education free and accessible. To this end, he teaches at universities, is involved in discussing AI/ML curriculum, and has worked as a code reviewer and mentor at Udacity and Thinkful.

Presentations

Predicting the quality of life from satellite imagery Session

Matt Armstrong-Barnes is a chief technologist at HPE with a passion for artificial intelligence, systems integration, and DevOps. A strategic leader, he works to drive innovative technical solutions across industries and technology trends. He has a diverse background in the IT industry, with over 25 years of experience, and has held numerous senior leadership positions, winning, architecting, and delivering sizable, complex transformation programs. Matt holds a BSc (Hons.) in computing science. He’s a fellow of the Institute of Engineering and Technology, a Chartered Fellow of the British Computer Society, and a Chartered Engineer.

Presentations

Artificial intelligence: Friend or foe? (sponsored by HPE) Session

Advances in artificial intelligence have meant that it's now more accessible than ever before—and this accessibility means that it can be both the hunter and the hunted. In the race to ensure cybersecurity, AI is an essential tool to protect your most sensitive assets. Join Matt Armstrong-Barnes to find out how this new dimension is changing the threat landscape and how to make AI your friend.

Walter Riviera is an AI technical solution specialist (TSS) covering EMEA at Intel, playing an active role on most of the AI project engagements within the data centers business in Europe. He’s responsible for increasing business awareness regarding the Intel AI offer, enabling and providing technical support to end user customers, independent software vendors (ISVs), original equipment manufacturers (OEMs), partners in implementing high-performance computing (HPC) and/or cloud solutions for AI based on Intel’s products and technologies. Previously, Walter collected research experience working on adopting ML techniques to enhance image-retrieval algorithms for robotic applications, conducting sensitive data analysis in a startup environment, and developing software for text-to-speech applications.

Presentations

Accelerate with purpose Keynote

Walter Riviera details three key shifts in the AI landscape—incredibly large models with billions of hyperparameters, massive clusters of compute nodes supporting AI, and the exploding volume of data meeting ever-stricter latency requirements—how to navigate them, and when to explore hardware acceleration.

AI beyond the buzzword: Do it well or do it twice! Session

What are the essentials steps to take in order to develop an AI solution? How long would this process would take? As machine learning is teaching us, the answers can be learned from previous experience. Walter Riviera walks you through a collection of real-life stories, looking for successful and misleading behavioral patterns.

Carlos Rodrigues is a lead cloud engineer and data scientist at Siemens Cyber Defense Department. Previously, Carlos worked at a financial institution that manages more than £20 billion of assets, helping them to design a data-driven strategy, among other things. During his spare time, Carlos teaches postgraduates in data science at Rumos.

Presentations

Fighting cybercrime with AI Session

An evolving landscape of cyber threats demands innovation. It's time to bring AI to the fight. Carlos Rodrigues explains why it's mandatory to use bleeding-edge AI in production to improve threat detection in a worldwide company such as Siemens. The corporate network has more than 500,000 endpoint and more than 370,000 employees. The attack vectors are endless; thus, legacy approaches don't scale.

Tom Sabo is a principal solutions architect at SAS. He’s been immersed in the field of text analytics as it applies to federal government challenges since 2005. Tom presents work internationally on diverse topics including modeling applied to government procurement, best practices in social media analysis, and using analytics to leverage and predict research trends. He also served on a panel for the Institute of Medicine’s Standing Committee on Health Threats Resilience to inform DHS and OHA on social media strategies. He holds a bachelor’s degree in cognitive science and a master’s in computer science, both from the University of Virginia.

Presentations

An artificial intelligence framework to counter international human trafficking Session

Efforts to counter human trafficking internationally must assess data from a variety of sources to determine where best to devote limited resources. Tom Sabo explores text-based machine learning, rule-based text extraction to generate training data for modeling efforts, and interactive visualization to improve international trafficking response.

Mathew Salvaris is a data scientist at Microsoft. Previously, Mathew was a data scientist for a small startup that provided analytics for fund managers; a postdoctoral researcher at UCL’s Institute of Cognitive Neuroscience, where he worked with Patrick Haggard in the area of volition and free will, devising models to decode human decisions in real time from the motor cortex using electroencephalography (EEG); and a postdoc in the University of Essex’s Brain Computer Interface Group, where he worked on BCIs for computer mouse control. Mathew holds a PhD in brain-computer interfaces and an MSc in distributed artificial intelligence.

Presentations

Azure AI reference architectures Session

Deploying machine learning models on the edge Session

When IoT meets AI, a new round of innovations begins. Yan Zhang and Mathew Salvaris examine the methodology, practice, and tools around deploying machine learning models on the edge. They offer a step-by-step guide to creating an ML model using Python, packaging it in a Docker container, and deploying it as a local service on an edge device as well as deployment on GPU-enabled edge devices.

Training and deploying Python models on Azure Tutorial

Richard Sargeant is the chief commercial officer at Faculty. Richard supports senior leaders across a variety of sectors to transform their businesses to use AI effectively. Previously, he was director of transformation at the Home Office, where he oversaw the creation of the second most advanced in-house machine learning capability in government; he was one of the founding directors of the UK’s Government Digital Service; and he was at Google. He has also worked at the Prime Minister’s Strategy Unit and HM Treasury. He is a nonexec on the Board of Exeter University, and the Government’s Centre for Data Ethics and Innovation. He has a degree in political philosophy, economics, and social psychology from Cambridge University.

Presentations

AI for executives 2-Day Training

Alejandro Saucedo is chairman at the Institute for Ethical AI & Machine Learning. In his more than 10 years of software development experience, Alejandro has held technical leadership positions across hypergrowth scale-ups and tech giants including Eigen Technologies, Bloomberg LP, and Hack Partners. Alejandro has a strong track record of building multiple departments of machine learning engineers from scratch and leading the delivery of numerous large-scale machine learning systems across the financial, insurance, legal, transport, manufacturing, and construction sectors in Europe, the US, and Latin America.

Presentations

A practical guide toward algorithmic bias and explainability in machine learning Session

Alejandro Saucedo demystifies AI explainability through a hands-on case study, where the objective is to automate a loan-approval process by building and evaluating a deep learning model. He introduces motivations through the practical risks that arise with undesired bias and black box models and shows you how to tackle these challenges using tools from the latest research and domain knowledge.

Peter Schafhalter is a first-year PhD student at UC Berkeley’s RISELab. His focus is AI systems, which involves writing software that makes AI run quickly, securely, explainably, and in a way that’s resilient to failures. He’s building an operating system for self-driving cars based on Ray.

Presentations

Scalable AI and reinforcement learning with Ray Tutorial

Tuhin Sharma is a cofounder and CTO of Binaize, an AI-based firm. Previously, he was a data scientist at IBM Watson and Red Hat, where he mainly worked on social media analytics, demand forecasting, retail analytics, and customer analytics, and he worked at multiple startups, where he built personalized recommendation systems to maximize customer engagement with the help of ML and DL techniques across multiple domains like fintech, ed tech, media, and ecommerce. He’s filed five patents and published four research papers in the field of natural language processing and machine learning. He holds a postgraduate degree in computer science and engineering, specializing in data mining, from the Indian Institute of Technology Roorkee. He loves to play table tennis and guitar in his leisure time. His favorite quote is, “Life is beautiful.”

Presentations

Anomaly detection in smart buildings using federated learning Session

There's an exponential growth in the number of internet-enabled devices on modern smart buildings. IoT sensors measure temperature, lighting, IP camera, and more. Tuhin Sharma and Bargava Subramanian explain how they built anomaly-detection models using federated learning—which is privacy preserving and doesn't require data to be moved to the cloud—for data quality and cybersecurity.

Julien Simon is a technical evangelist at AWS. Previously, Julien spent 10 years as a CTO and vice president of engineering at a number of top-tier web startups. He’s particularly interested in all things architecture, deployment, performance, scalability, and data. Julien frequently speaks at conferences and technical workshops, where he helps developers and enterprises bring their ideas to life thanks to the Amazon Web Services infrastructure.

Presentations

A pragmatic introduction to building NLP models Session

Many natural language processing (NLP) tasks require each word in the input text to be mapped to a vector of real numbers. Julien Simon explores word vectors, why they’re so important, and which are the most popular algorithms to compute them (Word2Vec, GloVe, BERT). You'll get to see how to solve typical NLP problems through several demos by either computing embeddings or reusing pretrained ones.

Pramod Singh is a senior machine learning engineer at Walmart Labs. He has extensive hands-on experience in machine learning, deep learning, AI, data engineering, designing algorithms, and application development. He has spent more than 10 years working on multiple data projects at different organizations. He’s the author of three books Machine Learning with PySpark, Learn PySpark, and Learn TensorFlow 2.0. He’s also a regular speaker at major conferences such as the O’Reilly Strata Data and AI Conferences. Pramod holds a BTech in electrical engineering from BATU, and an MBA from Symbiosis University. He’s also done data science certification from IIM–Calcutta. He lives in Bangalore with his wife and three-year-old son. In his spare time, he enjoys playing guitar, coding, reading, and watching football.

Presentations

Text analytics 101: Deep learning and attention networks all the way to production Tutorial

Gianmario Spacagna is the chief scientist and head of AI at Helixa. His team’s mission is building the next generation of behavior algorithms and models of human decision making with careful attention to their potential and effects on society. His experience covers a diverse portfolio of machine learning algorithms and data products across different industries. Previously, he worked as a data scientist in IoT automotive (Pirelli Cyber Technology), retail and business banking (Barclays Analytics Centre of Excellence), threat intelligence (Cisco Talos), predictive marketing (AgilOne), plus some occasional freelancing. He’s a coauthor of the book Python Deep Learning, contributor to the “Professional Manifesto for Data Science,” and founder of the Data Science Milan community. Gianmario holds a master’s degree in telematics (Polytechnic of Turin) and software engineering of distributed systems (KTH of Stockholm). After having spent half of his career abroad, he now lives in Milan. His favorite hobbies include home cooking, hiking, and exploring the surrounding nature on his motorcycle.

Presentations

Audience projection of target consumers over multiple domains: A NER and Bayesian approach Session

AI-powered market research is performed by indirect approaches based on sparse and implicit consumer feedback (e.g., social network interactions, web browsing, or online purchases). These approaches are more scalable, authentic, and suitable for real-time consumer insights. Gianmario Spacagna proposes a novel algorithm of audience projection able to provide consumer insights over multiple domains.

Bargava Subramanian is a cofounder and deep learning engineer at Binaize in Bangalore, India. He has 15 years’ experience delivering business analytics and machine learning solutions to B2B companies. He mentors organizations in their data science journey. He holds a master’s degree from the University of Maryland, College Park. He’s an ardent NBA fan.

Presentations

Anomaly detection in smart buildings using federated learning Session

Zaid Tashman is a R&D data scientist at Accenture Labs exploring new research problems in the areas of probabilistic programming, casual inference, and stochastic optimization. Zaid has a progressive experience in recommendation systems, customer behavior analysis, survival modeling, failure time prediction, hierarchical Bayesian networks, and anomaly detection. Previously, Zaid was a senior data scientist at ABB where he led the analytics efforts within ABB’s IoT platform serving all of their business units and a senior data scientist at Spacetime Insights, a Silicon Valley IoT startup where he successfully led and completed many machine learning projects in areas of predictive maintenance, anomaly detection, fraud detection, and optimization. Zaid holds a MSc in electrical engineering from Washington State University.

Presentations

Rethinking predictive maintenance Session

Today traditional approaches to predictive maintenance fall short. Zaid Tashman dives into a novel approach to predict rare events using a probabilistic model, the mixed membership hidden Markov model, highlighting the model's interpretability, its ability to incorporate expert knowledge, and how the model was used to predict the failure of water pumps in developing countries.

Bharath Thota is a vice president with A.T. Kearney’s analytics practice with over 14 years of deep expertise in the application of data science, advanced analytics, and technology to help clients with analytics transformation, improve business performance, drive operational excellence, and become more insight driven. He’s contributed to research and written on the topic of big data.

Presentations

Executive Briefing: From laggard to leader—Winning the AI race Session

Wee Hyong Tok is a principal data science manager with the AI CTO Office at Microsoft, where he leads the engineering and data science team for the AI for Earth program. Wee Hyong has worn many hats in his career, including developer, program and product manager, data scientist, researcher, and strategist, and his track record of leading successful engineering and data science teams has given him unique superpowers to be a trusted AI advisor to customers. Wee Hyong coauthored several books on artificial intelligence, including Predictive Analytics Using Azure Machine Learning and Doing Data Science with SQL Server. Wee Hyong holds a PhD in computer science from the National University of Singapore.

Presentations

Azure AI reference architectures Session

Training and deploying Python models on Azure Tutorial

Ciprian Tomoiaga is a computer vision engineer at AXA, where he builds models to assist the company’s 105 million clients. Previously, he spent one year at CERN, where he developed software for designing its future accelerators; then he pursued a master’s degree at École Polytechnique Fédérale de Lausanne, Switzerland, where he aided the time machine project to uncover data from the past and studied how languages evolve; this led him into handwriting recognition, which he applied to AXA’s challenging documents. He contributed to his team’s new approach for handwritten field extraction and recognition and their work was published in ICDAR’19 conference. As an undergraduate at the University of Manchester, Ciprian chaired student programming societies and contributed to the creation of now-renowned hackathons. He is still a hacker at heart, and this motivates him to be part of REV, AXA’s innovation department that puts data and AI to good use for AXA’s clients.

Presentations

More info from your documents: AI handwriting recognition and automatic parsing (sponsored by AXA) Session

Your company has a large amount of data locked into thousands or millions of scanned paper documents. You'd like to extract and analyze it, but you first have to prove that your algorithm works and brings business value. Ciprian Tomoiaga explains how to start.

Jameson Toole is the cofounder and CEO of Fritz AI, a company building tools to help developers optimize, deploy, and manage machine learning models on mobile devices. Previously, he built analytics pipelines for Google X’s Project Wing and ran the data science team at Boston technology startup Jana Mobile. He holds undergraduate degrees in physics, economics, and applied mathematics from the University of Michigan and both an MS and PhD in engineering systems from MIT, where he worked on applications of big data and machine learning to urban and transportation planning at the Human Mobility and Networks Lab.

Presentations

Creating smaller, faster, production-worthy mobile machine learning models Session

Getting machine learning models ready for use on device is a major challenge. Drag-and-drop training tools can get you started, but the models they produce aren’t small enough or fast enough to ship. Jameson Toole walks you through optimization, pruning, and compression techniques to keep app sizes small and inference speeds high.

Arun Verma is the head of the quantitative research solutions team at Bloomberg. He also serves on the board of a nonprofit that helps with humanitarian projects in India serving impoverished children and women in the areas of education and vocational training. Since he joined the Bloomberg Quantitative Research Group, Arun has worked on stochastic volatility models for derivatives and exotics pricing and hedging (e.g., variance swaps and VIX Futures fair pricing and cross-currency volatility surface construction) and at the intersection of diverse areas such as data science, innovative quantitative finance models across asset classes, and using machine learning methods to help reveal embedded signals in traditional and alternative data that can be used to construct quantitative trading strategies. He holds a PhD in computer science and applied mathematics from Cornell University and a bachelor of technology in computer science from IIT Delhi. Arun lives in central New Jersey with his lovely wife and two children.

Presentations

Extracting trading signals from alternative data using machine learning Session

To gain an edge in the markets, quantitative hedge fund managers require automated processing to quickly extract actionable information from unstructured and increasingly nontraditional sources of data. Arun Verma shares NLP, AI, and ML techniques that help extract derived signals that have significant trading alpha or risk premium and lead to profitable trading strategies.

Bruno Wassermann is a research staff member at IBM Research – Haifa, where he’s worked on parts of the distributed systems infrastructure of Watson Developer Cloud, is trying to help SREs make better sense of monitoring and log data, and, more recently, has begun working on some of the issues that arise from the productionization of machine learning applications.

Presentations

Clue: Evaluate the impact of your new training pipeline on existing models in production Session

Imagine there's a new version of your complex machine learning pipeline, but you need to make sure it doesn't negatively impact the performance of large numbers of existing customer models in production. Bruno Wassermann explains how IBM Research tackled the challenge for the natural language understanding layer of the IBM Watson Assistant service and demonstrates a new tool called Clue.

SVP of 7bulls.com – a software house that provides AI technology and competences. 7bulls technically supports AI Investments, the startup which aims to build a complete, AI-based investment solution. Konrad is a co-creator and a substantive leader of the team developing machine learning solutions for leading global companies. He has extensive experience in the finance and investments field. He participated in Polish and international research projects in the field of computer systems with high processing power and AI technology. He was responsible for creating and implementing transaction software used by large international financial institutions (Standard Bank, Rabobank, Credit Agricole, mBank, First Data) including Algo Trading solutions.

Presentations

AI for financial time series forecasting and dynamic assets portfolio optimization Session

Real business usage of most advanced methods for financial time series forecasting (based on winning methods from M4 competition) and assets portfolio optimization (based on Monte Carlo Tree Search with neural networks - Alpha Zero approach). Complete investments platform with the AI workflow and real time integration with the brokers. Real usage demo.

Emily Webber is a machine learning specialist solutions architect at Amazon Web Services (AWS). She guides customers from project ideation to full deployment, focusing on Amazon SageMaker, where her customers are household names across the world, such as T-Mobile. She’s been leading data science projects for many years, piloting the application of machine learning into such diverse areas as social media violence detection, economic policy evaluation, computer vision, reinforcement learning, the IoT, drones, and robotic design. Previously, she was a data scientist at the Federal Reserve Bank of Chicago and a solutions architect for an explainable AI startup in Chicago. Her master’s degree is from the University of Chicago, where she developed new applications of machine learning for public policy research with the Data Science for Social Good Fellowship.

Presentations

Public policy and deep reinforcement learning on AWS Keynote

If you've ever wondered if you could use AI to inform public policy, join Emily Webber as she combines classic economic methods with AI techniques to train a reinforcement learning agent on decades of randomized control trials. You'll learn about classic philosophical foundations for public policy decision making and how these can be applied to solve the problems that impact the many.

Qun Ying is a senior product manager on the AI platform team within the Cloud and AI Division at Microsoft.

Presentations

Introducing a new anomaly-detection algorithm (SR-CNN) inspired by computer vision Session

Anomaly detection may sound old fashioned, yet it's super important in many industry applications. Tony Xing, Bixiong Xu, Congrui Huang, and Qun Ying detail a novel anomaly-detection algorithm based on spectral residual (SR) and convolutional neural network (CNN) and explain how this method was applied in the monitoring system supporting Microsoft AIOps and business incident prevention.

Ji Chao Zhang is the director of software engineering and a member of the Georgian impact team. In that role, he leads its internal software engineering efforts and supports portfolio engagements.

Presentations

Building differentially private machine learning models using TensorFlow Session

Yan Zhang is a senior data scientist with the algorithm and data science team of the Data Group within Cloud and Enterprise at Microsoft. She builds predictive analytics models and generalizes machine learning solutions on the cloud machine learning platform. Her recent research includes cost prediction and fraud claim detection in the healthcare domain, predictive maintenance in IoT applications, customer segmentation, and text mining. Previously, she was a research faculty member at Syracuse University. Yan earned her PhD in data mining from the Computer Science Department at the University of Vermont. She’s the author of 23 publications, including journal articles, conference papers, and blog posts. Her first paper won the best paper award at the 17th IEEE International Conference on tools with artificial intelligence. She’s one of the reviewers for the book Predictive Analytics with Microsoft Azure Machine Learning, second edition, published in September 2015.

Presentations

Deploying machine learning models on the edge Session

Zhe Zhang is a senior manager of core big data infrastructure at LinkedIn, where he leads an excellent engineering team to provide big data services (Hadoop distributed file system (HDFS), YARN, Spark, TensorFlow, and beyond) to power LinkedIn’s business intelligence and relevance applications. Zhe’s an Apache Hadoop PMC member; he led the design and development of HDFS Erasure Coding (HDFS-EC).

Presentations

Improve the speed of ML innovations at LinkedIn Session

Machine learning (ML) engineering differs fundamentally from traditional software engineering in the level of uncertainty and unpredictability of an idea until fully verified in production. Join Zhe Zhang to explore the deciding factor in ML-based products (e.g., recommendation, ranking)—the speed of the trial-and-error loop.

Machine learning challenges at LinkedIn: Spark, TensorFlow, and beyond Keynote

From people you may know (PYMK) to economic graph research, machine learning is the oxygen that powers how LinkedIn serves its 630M+ members. Zhe Zhang provides you with an architectural overview of LinkedIn’s typical machine learning pipelines complemented with key types of ML use cases.

Weifeng Zhong is a senior research fellow at the Mercatus Center at George Mason University. His work focuses on bridging the field of natural language processing and machine learning to economic policy studies. His other research interests include the political economy, US-China economic relations, and China’s economic issues. Weifeng is a core maintainer of the open source Policy Change Index (PCI) project, a framework that uses machine learning to “read” large volumes of text and detect subtle, structural changes embedded in it. As a first use case, the PCI for China is an algorithm that can predict China’s policy changes using the information in the government’s official newspaper. The PCI framework has received significant academic interest and media coverage. The resources of this project are freely available at Policychangeindex.org. Weifeng has been published in a variety of scholarly journals, including the Journal of Institutional and Theoretical Economics. His research and writings have been featured in the Financial Times, Foreign Affairs, The National Interest, Real Clear Markets, Real Clear Politics, the South China Morning Post, and the Wall Street Journal, among others.

Presentations

Learning structural changes from text data Session

Weifeng Zhong explores a novel method to learn structural changes embedded in unstructured texts based on the Policy Change Index (PCI) framework developed by economists Julian Chan and Weifeng Zhong. He explains how an off-the-shelf application of deep learning—with an important twist—can help you detect structural break points in time series text data.