Machine Learning Estimation of Heterogeneous Treatment Effects: the Microsoft EconML Library
Machine Learning Estimation of Heterogeneous Treatment Effects: the Microsoft EconML Library


One of the biggest promises of machine learning is the automation of decision making in a multitude of application domains. A core problem that arises in most data-driven personalized decision scenarios is the estimation of heterogeneous treatment effects: what is the effect of an intervention on an outcome of interest as a function of a set of observable characteristics of the treated sample? For instance, this problem arises in personalized pricing, where the goal is to estimate the effect of a price discount on the demand as a function of characteristics of the consumer. Similarly it arises in medical trials where the goal is to estimate the effect of a drug treatment on the clinical response of a patient as a function of patient characteristics. In many such settings we have an abundance of observational data, where the intervention was chosen via some unknown policy and the ability to run control A/B tests is limited.

We will present recent research advances in the area of machine learning based estimation of heterogeneous treatment effects. These novel methods offer large flexibility in modeling the effect heterogeneity (via techniques such as random forests, boosting, lasso and neural nets), while at the same time leverage techniques from causal inference and econometrics to preserve the causal interpretation of the learned model and many times also offer statistical validity via the construction of valid confidence intervals. We will also present and demo the Microsoft EconML library, an open source package developed by the ALICE project of Microsoft Research, New England, which implements several recent estimation algorithms in a common python API.


Keith Battocchi is a software engineer at Microsoft Research New England, where he is currently working on software for applying machine learning algorithms to economic problems. Over the past decade, he has worked in a variety of areas including programming language research, building query classifiers for Bing, and building a system to assess television advertising effectiveness.

Open Data Science




Open Data Science
One Broadway
Cambridge, MA 02142

Privacy Settings
We use cookies to enhance your experience while using our website. If you are using our Services via a browser you can restrict, block or remove cookies through your web browser settings. We also use content and scripts from third parties that may use tracking technologies. You can selectively provide your consent below to allow such third party embeds. For complete information about the cookies we use, data we collect and how we process them, please check our Privacy Policy
Consent to display content from - Youtube
Consent to display content from - Vimeo
Google Maps
Consent to display content from - Google