The biggest problem in data science is … the data itself.
It’s messy, it’s inconsistent, it arrives from myriad
sources, and it sometimes changes without warning. Such
hurdles distract you from your intended purpose: getting
meaningful insight out of your data.
Q Ethan McCallum, consultant and author of Parallel R
(O’Reilly), will walk through the various forms of bad data
and explore common pitfalls that can derail your research
efforts. Most of all, he’ll explain ways to handle bad data
so you can get back to work.
Q Ethan McCallum is a consultant, writer, and technology enthusiast, though perhaps not in that order. Most recently put the finishing touches on Parallel R (O’Reilly).
For information on exhibition and sponsorship opportunities at the conference, contact Susan Stewart at firstname.lastname@example.org.
For information on trade opportunities with O'Reilly conferences contact Kathy Yu at mediapartners
For media-related inquiries, contact Maureen Jennings at email@example.com
View a complete list of Strata contacts