Machine learning on source code is a new area of research in the field of artificial intelligence, which, unlike classical problems such as image segmentation, does not yet have established standard techniques. For instance there are standard methods for processing images that make machine learning algorithms pay attention to their two-dimensionality. However, there are currently no common techniques for encoding the semantic structure of source code. Therefore, you need new ways to mathematically represent the code of projects. This technology offers a variety of possible applications, for example, in the area of static code analysis or in the automatic selection of relevant test cases.
Thomas Endres and Samuel Hopstock share methods for transferring classic machine learning approaches to this new field of expertise. Along the way, Thomas and Samuel detail approaches for both automatic and manual training data generation and offer an overview of suitable models and machine learning frameworks for this challenge. They conclude by exploring the possibilities of using such models for the analysis of code.
Thomas Endres is a partner at TNG, where he works as an IT consultant. Besides his normal work for the company and the customers, he creates various prototypes, like a telepresence robotics system with which you can see reality through the eyes of a robot, or an augmented reality AI that shows the world from the perspective of an artist. He’s working on various applications in the fields of AR, VR, and AI and gesture control, putting them to use, for example, in autonomous or gesture-controlled drones. But he’s also involved in other open source projects written in Java, C#, and all kinds of JavaScript languages. Thomas studied IT at the TU Munich and is passionate about software development and all the other aspects of technology. As an Intel Software Innovator and Black Belt, he promotes new technologies like AI, AR, and VR and robotics around the world. For this he’s received, among others, a JavaOne Rockstar award.
Samuel Hopstock is working toward his bachelor’s degree in computer science at the Technical University of Munich. He’s also a working student at TNG Technology Consulting in Unterföhring, where he is currently involved in the development of software in the field of machine learning with Python and Java. He is interested in any new technological developments, especially involving Android.
For exhibition and sponsorship opportunities, email aisponsorships@oreilly.com
For information on trade opportunities with O'Reilly conferences, email partners@oreilly.com
View a complete list of AI contacts
©2018, O’Reilly UK Ltd • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • confreg@oreilly.com