Sep 9–12, 2019
Alberto Andreotti

Alberto Andreotti
Senior Data Scientist, John Snow Labs


Alberto Andreotti is a senior data scientist on the Spark NLP team at John Snow Labs, where he implements state-of-the-art NLP algorithms on top of Spark. He has a decade of experience working for companies including Motorola, Intel, and Samsung and as a consultant, specializing in the field of machine learning. Alberto has written lots of low-level code in C/C++ and was an early Scala enthusiast and developer. A lifelong learner, he holds degrees in engineering and computer science and is working on a third in AI. Alberto was born in Argentina. He enjoys the outdoors, particularly hiking and camping in the mountains of Argentina.


4:00pm4:40pm Wednesday, September 11, 2019
Location: LL21 C/D
Stacy Ashworth (SelectData), Alberto Andreotti (John Snow Labs)
Much business data still exists as challenging scanned or snapped documents. Stacy Ashworth and Alberto Andreotti explore a real-world case of reading, understanding, classifying, and acting on facts extracted from such image files using state-of-the-art, open source, deep learning-based optical character recognition (OCR), natural language processing (NLP), and forecasting libraries at scale. Read more.
  • Intel AI
  • O'Reilly
  • Amazon Web Services
  • IBM Watson
  • Dataiku
  • Dell Technologies
  • Intuit
  • Gamalon
  • Hewlett Packard Enterprise
  • MapR Technologies
  • Sisu Data
  • Intuit

Contact us

For conference registration information and customer service

For more information on community discounts and trade opportunities with O’Reilly conferences

Become a sponsor

For information on exhibiting or sponsoring a conference

For media/analyst press inquires