Sep 9–12, 2019
Alberto Andreotti

Alberto Andreotti
Senior Data Scientist, John Snow Labs


Alberto Andreotti is a senior data scientist on the Spark NLP team at John Snow Labs, where he implements state-of-the-art NLP algorithms on top of Spark. He has a decade of experience working for companies including Motorola, Intel, and Samsung and as a consultant, specializing in the field of machine learning. Alberto has written lots of low-level code in C/C++ and was an early Scala enthusiast and developer. A lifelong learner, he holds degrees in engineering and computer science and is working on a third in AI. Alberto was born in Argentina. He enjoys the outdoors, particularly hiking and camping in the mountains of Argentina.


4:00pm4:40pm Wednesday, September 11, 2019
Location: LL21 C/D
Stacy Ashworth (SelectData), Alberto Andreotti (John Snow Labs)
Much business data still exists as challenging scanned or snapped documents. Stacy Ashworth and Alberto Andreotti explore a real-world case of reading, understanding, classifying, and acting on facts extracted from such image files using state-of-the-art, open source, deep learning-based optical character recognition (OCR), natural language processing (NLP), and forecasting libraries at scale. Read more.

Contact us

For conference registration information and customer service

For more information on community discounts and trade opportunities with O’Reilly conferences

Become a sponsor

For information on exhibiting or sponsoring a conference

For media/analyst press inquires