For more than 16 years, Dow Jones has supplied risk and compliance data to banking and financial institutions, corporations, and governments across the world, sharing defined, structured content sets of people and entities used to manage third-party risk: anti-money laundering, antibribery and corruption, sanctions, and reputational risk. In order to achieve a comprehensive coverage guided by international regulation and guidance since 2002, the company follows very high editorial standards and research methodologies, combined with state-of the-art machine learning techniques, to manage 30 risk categories 24 hours a day in over 70 languages.
Dow Jones recently wanted to apply a new approach to the existing content delivery pipeline with the objectives to eliminate low-level, repeatable, manual processes, enabling researchers to focus on strategic tasks; gain intelligence from global media and research tools, scanning and monitoring almost 2 million articles per week; and achieve near-real-time risk data detection and delivery capabilities. Yulia Zvyagelskaya and Victor Llorent explain how the company created an AI-powered Risk & Compliance data research solution that uses natural language processing for risk profiles creation and management. Along the way, they highlight the unstructured data preprocessing stage, model selection criteria, and neural networks parameter tuning processes to provide scalability and performance in order to achieve mentioned key objectives.
Yulia Zvyagelskaya is a data scientist at Dow Jones, where she’s responsible for the development and implementation of machine learning applications. Yulia has developed several AI-driven projects in the fields of computer vision and natural language processing. She holds master’s degrees in NLP (computational linguistics and artificial intelligence) and big data management and analytics. Yulia has won several international artificial intelligence and big data competitions.
Victor Llorente is a technology program manager at Dow Jones, where he’s responsible for data strategy applications in the professional information business. Victor has worked in several workflow automation and AI-driven projects using BPM engines, big data, and data science technologies. He holds master’s degrees in computer engineering from Polytechnic University of Catalonia, Barcelona, and computer science from the Royal Institute of Technology, Stockholm, as well as an MBA from Instituto de Empresa, Madrid.
©2019, O'Reilly Media, Inc. • (800) 889-8969 or (707) 827-7019 • Monday-Friday 7:30am-5pm PT • All trademarks and registered trademarks appearing on oreilly.com are the property of their respective owners. • firstname.lastname@example.org