Over the last few years, convolutional neural networks (CNN) have risen in popularity, especially in computer vision. Anirudh Koul explains how to bring the power of deep learning to memory- and power-constrained devices like smartphones and drones.
Many mobile applications running on smartphones and wearable devices would potentially benefit from the accuracy of deep learning techniques. Also, local execution allows data to stay on the mobile device, hence avoiding latency issues of data transmission to the cloud and also alleviating privacy concerns. However, CNNs, by nature, are computationally expensive and memory intensive, making them challenging to deploy on a mobile device. Anirudh explores various strategies to circumvent these obstacles and build mobile-friendly shallow CNN architectures to significantly reduce the memory footprint, making CNNs easier to store on a smartphone. By comparing a family of model compression techniques to prune the network size for live image processing, you can build a CNN version optimized for inference on mobile devices. Anirudh also covers practical strategies to preprocess your data in a manner that makes the models more efficient in the real world.
Anirudh showcases these techniques using a real-world project, as well as tips and tricks, to demonstrate how to get started developing your own deep learning application suitable for deployment on storage- and power-constrained mobile devices. Similar techniques can also be applied to make deep neural nets more efficient when deploying in a regular cloud-based production environment, reducing the number of GPUs required and optimizing on cost.
Anirudh Koul is a head of AI and research at Aira, noted by Time magazine as one of the best inventions of 2018. He’s a noted AI expert and O’Reilly author, including the upcoming Practical Deep Learning for Cloud and Mobile. Previously, he was a scientist at Microsoft AI, where he founded Seeing AI, the most-used technology among the blind community after the iPhone. With features shipped to a billion users, he brings over a decade of production-oriented applied research experience on petabyte-scale datasets. He’s been developing technologies using AI techniques for augmented reality, robotics, speech, productivity, and accessibility. Some of his recent work, which IEEE has called “life-changing,” has been honored by CES, FCC, Cannes Lions, American Council of the Blind, showcased at events by the UN, the White House, the House of Lords, the World Economic Forum, Netflix, National Geographic, and applauded by world leaders including Justin Trudeau and Theresa May.
©2017, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • firstname.lastname@example.org
Apache Hadoop, Hadoop, Apache Spark, Spark, and Apache are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries, and are used with permission. The Apache Software Foundation has no affiliation with and does not endorse, or review the materials provided at this event, which is managed by O'Reilly Media and/or Cloudera.