Drift Detection in Structured and Unstructured Data

Abstract: 

Machine learning systems in production are subject to performance degradations due to many external factors and it is vital to actively monitor system stability and integrity. A common source of model degradation is due the inherent non-stationarity of the real world environment, commonly referred to as data drift. In this presentation, I will describe how to reliably quantify data drift in a variety of different data paradigms including Tabular data, Computer Vision data, and NLP data. Attendees of this talk will come away with a conceptual toolkit for thinking about data stability monitoring in their own models, with example use cases in common settings as well as in more challenging regimes.

Bio: 

Keegan is VP of Machine Learning at ArthurAI and is also an Adjunct Assistant Professor at Georgetown University. Previously, he was the Director of Machine Learning Research at Capital One and has also held roles at cyberdefense firms. He is a Co-Founder of the Conference on Applied Learning for Information Security (CAMLIS) and holds a PhD in Neuroscience from the University of Texas.

Open Data Science

 

 

 

Open Data Science
One Broadway
Cambridge, MA 02142
info@odsc.com

Privacy Settings
We use cookies to enhance your experience while using our website. If you are using our Services via a browser you can restrict, block or remove cookies through your web browser settings. We also use content and scripts from third parties that may use tracking technologies. You can selectively provide your consent below to allow such third party embeds. For complete information about the cookies we use, data we collect and how we process them, please check our Privacy Policy
Youtube
Consent to display content from Youtube
Vimeo
Consent to display content from Vimeo
Google Maps
Consent to display content from Google