Causal Inference under Low-Order Interference

Jun 17, 2024·
Matthew Eichhorn
Matthew Eichhorn
,
Samir Khan
,
Mayleen Cortez-Rodriguez
,
Johan Ugander
,
Christina Lee Yu
· 0 min read
Abstract
In many settings, we are interested in quantifying how a population will be affected by some global treatment: how will a vaccination campaign affect the rate of infection; how will the adoption of a new recommendation algorithm influence engagement on an online platform; will increased revenues driven by an advertising campaign be sufficient to offset its cost? To answer these questions, practitioners carry out randomized experiments and use the results to estimate a causal quantity known as the total treatment effect (TTE). In the presence of network interference, where the treatment of some individuals can spill over and affect the outcomes of their untreated peers, the classical Horvitz-Thompson TTE estimator has prohibitively high variance. To address this, existing literature largely falls into one of two disjoint approaches: imposing structural assumptions on individuals’ potential outcomes which can be leveraged by more sophisticated estimators, or imposing structural assumptions on the interference network which can be leveraged by more sophisticated experimental designs. Our work is one of the first to consider these approaches in tandem. I’ll posit a low-order interactions structure on the potential outcomes model that stipulates that individuals’ outcomes can be decomposed into additive effects from small treated subsets of their social neighbors. This assumption can be leveraged to design a TTE estimator, the Pseudoinverse (PI) estimator, that can be adapted to data collected under an arbitrary experimental design. Specializing to Bernoulli randomized designs, I’ll obtain a closed form for this estimator and demonstrate both theoretically and empirically that this estimator outperforms other existing approaches. Next, for an arbitrary experimental design, I’ll derive novel bias and variance bounds for the PI estimator in terms of properties of that design. Under graph cluster randomized (GCR) designs, these bounds illustrate a “best of both worlds” trade-off of the PI estimator: its variance scales as the smaller of the variance obtained from a low-order assumption and the variance obtained from cluster randomization, showing that combining these variance reduction strategies is preferable to using either individually. Finally, I’ll show the bias and variance bounds provide a principled approach for selecting a clustering to use for an experiment. Across a range of graphs and clustering algorithms, this method consistently selects clusterings that perform well on a range of potential outcomes models, suggesting that our bounds are useful to practitioners.
Date
Jun 17, 2024 1:30 PM
Event
Location

Québec City Convention Center

1000 Bd René-Lévesque E, Québec, QC G1R 5T8