Abstract: It starts innocently - few misspellings here and there, an inappropriate date or two, a few missing cells, … suddenly your hair turns grey and your friends consider funding you a therapy. Does cleaning indeed have to be a nightmare? How to speed up the process, transform the majority of your project into fun and not just wait impatiently for the modelling phase?
This speech gathers most helpful tricks and hacks with special emphasis on the usage of Machine Learning for the cleaning tasks purposes. Pieces of advice are domain agnostic, so proposed approaches can be leveraged regardless of the business area of interest.
Bio: Currently Senior (Big) Data Scientist at InPost and Lecturer at Wroclaw University of Economics and Business, previously Head of Data Science at Objectivity, with background in Mathematical Statistics. For almost 10 years, she has been discovering the potential of data in various business domains, from medical data, through retail, HR, finance, aviation, real estate, logistics, ... She deeply believes in the power of data in every area of life. Articles’ writer, conference speaker and privately - passionate dancer and hand-made jewellery creator.