EconML
ALICE (Automated Learning and Intelligence for Causation and Economics) is a Microsoft Research project aimed at applying Artificial Intelligence concepts to economic decision making. One of its goals is to build a toolkit that combines state-of-the-art machine learning techniques with econometrics in order to bring automation to complex causal inference problems. To date, the ALICE Python SDK (econml) implements orthogonal machine learning algorithms such as the double machine learning work of Chernozhukov et al. This toolkit is designed to measure the causal effect of some treatment variable(s) t on an outcome variable y, controlling for a set of features x.
A Python package from Microsoft Research for estimating causal effects from observational data, measuring how a change affects different groups of people without needing a randomized trial.
EconML is a Python package from Microsoft Research that helps analysts and researchers answer a specific kind of question: if we change something, what effect does that change have, and does that effect differ for different groups of people? This is the problem of causal inference, and it sits at the heart of decisions like whether a drug works better for certain patients, whether a pricing change grows or shrinks revenue, or whether a policy program actually improves the outcomes it targets.
The core challenge EconML addresses is that most real-world data comes from observation rather than controlled experiments. You cannot always run a randomized trial, so you have to use statistical methods to estimate what would have happened under different conditions. EconML combines ideas from economics and statistics with modern machine learning to do this more accurately and at larger scale than older methods allowed. It implements several research-backed techniques, including double machine learning, which was developed to isolate causal signals from noisy datasets with many variables.
The package is designed around a common structure: you specify a treatment variable (the thing that was changed or that you want to understand), an outcome variable (what you are measuring), and a set of background features about each observation. EconML then estimates not just an average effect but how that effect varies across different subgroups, which is what the word "heterogeneous" refers to in its full name. It also produces confidence intervals so you can see how certain the estimates are.
EconML is installable via pip and works alongside the standard Python data science stack. The project includes Jupyter notebooks with worked examples, and the documentation site covers the main estimation methods, policy learning tools, and guidance on selecting the right approach for a given problem. It is actively maintained and has been receiving regular releases since 2019.
The full README is longer than what was shown.
Where it fits
- Estimate whether a pricing change increases or decreases revenue and how the effect varies by customer segment.
- Analyze observational health data to measure how well a treatment works across different patient subgroups.
- Apply double machine learning to isolate a causal signal from a noisy dataset with many variables.
- Build personalized policy recommendations based on how a treatment affects each individual differently.