Predictive Bounding Boxes: A Machine Learning Powered Image Annotation Tool For Creating High Quality Training Data For Object Detection
Predictive Bounding Boxes: A Machine Learning Powered Image Annotation Tool For Creating High Quality Training Data For Object Detection


The state of the art deep learning models for object detection require large volume of high quality training data to perform well. While the advances of data collection technology have enabled the acquisition of a massive volume of data, labeling the data remains an expensive and time-consuming task
In this study, we propose a new interface for bounding box annotation which uses machine learning to accelerate the annotation and increase the quality of training data, lowering cost and time required to complete the task. In particular we study challenging tasks such as drawing bounding boxes for large images that contains many objects at different scales such as drone and satellite imagery. We study and measure the user interactions with the tool and their performance through launching multiple jobs on a crowdsourced based data labeling platform to understand the best design of the machine learning powered interface. Results demonstrate employing machine learning with the proposed design dramatically increases the accuracy of bounding boxes adjusted by humans.


Qazaleh Mirsharif is an applied researcher currently employed as machine learning scientist at Figure Eight, applying artificial intelligence, in particular computer vision, to a broad range of real- world problems. She earned her PhD in computer science from University of Houston, Texas while focusing on building computer vision models for studying and evaluating the development of visual attention in infants. She received her MSc in artificial intelligence focusing on processing retinal images to help with early detection of eye diseases, in particular diabetic retinopathy. She currently works on a large variety of projects applying computer vision techniques to digitize on- street parking rules, detect and classify objects in drone and satellite imagery, and rate gif images, just to name a few.

Open Data Science




Open Data Science
One Broadway
Cambridge, MA 02142

Privacy Settings
We use cookies to enhance your experience while using our website. If you are using our Services via a browser you can restrict, block or remove cookies through your web browser settings. We also use content and scripts from third parties that may use tracking technologies. You can selectively provide your consent below to allow such third party embeds. For complete information about the cookies we use, data we collect and how we process them, please check our Privacy Policy
Consent to display content from - Youtube
Consent to display content from - Vimeo
Google Maps
Consent to display content from - Google