The Evolution of Data Labeling


Data labeling, a job once outsourced by grad students to hungry undergrads paid in pizza, has grown and evolved into a +$1B industry. In my talk I will discuss the evolution of data labeling from the humble bounding box around a cat, to software platforms and global operational teams who clean, sort, label, and report on labeled data. My goal is to shed light on the data labeling challenges of today, both for ML/AI and data labeling companies. Finally, I hope to leave you all with guideposts and bread crumbs that lead you all down a better path to success in your data curation, creation, and operations endeavors.


Soo has been working with Computer Vision, Machine Learning Engineers, and Research Scientists, across industries to create training datasets for the last 4+ years. As a Solutions Architect at iMerit, she helps our clients by connecting the dots between the technical details of tooling, designing annotation workflows, and integrating a remote data labeling team for the execution. Previously, Soo served as the Data Operations Manager at a geospatial analytics startup where she built and scaled a Data Operations team from the ground up, leading a team 10 analysts.

Open Data Science




Open Data Science
One Broadway
Cambridge, MA 02142

Privacy Settings
We use cookies to enhance your experience while using our website. If you are using our Services via a browser you can restrict, block or remove cookies through your web browser settings. We also use content and scripts from third parties that may use tracking technologies. You can selectively provide your consent below to allow such third party embeds. For complete information about the cookies we use, data we collect and how we process them, please check our Privacy Policy
Consent to display content from - Youtube
Consent to display content from - Vimeo
Google Maps
Consent to display content from - Google