Deep neural networks are responsible for many advances in natural language processing, computer vision, speech recognition, and even forecasting. However, these networks usually require vast amounts of data and are computationally expensive to train. Even if you obtain the hardware to enable the efficient training of deep neural networks, it can still take quite a long time to do so—especially since you also often have to tune the model and try different network architectures. As a result, it’s important to have an environment for building deep learning solutions in which one can explicitly consider trade-offs in this process between aspects such as training time versus cost. The environment should also allow you to conduct exploratory analyses, schedule programmatic training, enable real-time scoring, and enable batch scoring.
Danielle Dean and Wee Hyong Tok illustrate how cloud computing has been leveraged to complete several deep learning projects for the scenarios outlined above, using the Azure Data Science Virtual Machine with Deep Learning Toolkit, Azure Batch Shipyard, Spark on HDInsight, and even deep learning within SQL Server. Along the way, Danielle and Wee Hyong share several practical tips for approaching deep learning projects in several industry settings, including healthcare, manufacturing, and utilities, based on their real-world experience.
Although several of the technologies covered are specific to Microsoft’s Azure cloud computing platform, the different stages and approaches to a deep learning project are applicable across a range of technologies, many of them open source, such as Microsoft Cognitive Toolkit (previously known as CNTK) for deep learning.
Wee Hyong Tok is a principal data science manager with Microsoft. Wee Hyong has worn many hats in his career, including developer, program and product manager, data scientist, researcher, and strategist, and his track record of leading successful engineering and data science teams has given him unique superpowers to be a trusted AI advisor to customers. Wee Hyong coauthored several books on artificial intelligence, including Predictive Analytics Using Azure Machine Learning and Doing Data Science with SQL Server. Wee Hyong holds a PhD in computer science from the National University of Singapore.
Danielle Dean is a principal data scientist lead in AzureCAT within the Cloud AI Platform Division at Microsoft, where she leads an international team of data scientists and engineers to build predictive analytics and machine learning solutions with external companies utilizing Microsoft’s Cloud AI Platform. Previously, she was a data scientist at Nokia, where she produced business value and insights from big data through data mining and statistical modeling on data-driven projects that impacted a range of businesses, products, and initiatives. Danielle holds a PhD in quantitative psychology from the University of North Carolina at Chapel Hill, where she studied the application of multilevel event history models to understand the timing and processes leading to events between dyads within social networks.
©2017, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • firstname.lastname@example.org