PS 813 - Causal Inference
April 29, 2026
\[ \require{cancel} \]
We collapse the vector of assignments \(\mathbf{Z}\) and unit \(i\)’s characteristics \(\theta_i\) to its exposure \(D_i\)
\[D_i = f(\mathbf{Z}, \theta_i)\]
Our new “consistency” assumption under a known exposure mapping
\[Y_i = \sum_{k=1}^K I(D_i = d_k) y_i(d_k)\]
Essentially, once we know the value of the exposure, all assignments that generate that exposure are equivalent
We define causal estimands as differences across different types of exposure mappings
\[\mu(d_k) = \frac{1}{N}\sum_{i=1}^N y_i(d_k), \qquad \tau(d_k, d_l) = \mu(d_k) - \mu(d_l)\]
Examples
Problem:
Aronow and Samii (2017) suggest that we can adjust for the probability of observing a given exposure via IPW
We need to know the propensity of exposure
\[\pi_i(d_k) = \Pr(D_i = d_k)\]
If the design is known, this can be calculated directly
Can apply our usual IPW estimators using these known exposure probabilities
Our Horvitz-Thompson estimator is just
\[\widehat{y^T_{HT}}(d_k) = \sum_{i=1}^N I(D_i = d_k)\, \dfrac{Y_i}{\pi_i(d_k)}\]
Can also use Hajek estimator (normalize the weights to \(1\))
Need a positivity assumption on the exposures
Aronow and Samii (2017) propose a conservative variance estimator (conservative for similar reasons to the Neyman estimator)
\[\begin{align*}\widehat{\text{Var}}\big[\widehat{y^T_{HT}}(d_k)\big] =& \sum_{i \in U} I(D_i = d_k)\, \big[1 - \pi_i(d_k)\big]\, \bigg[\frac{Y_i}{\pi_i(d_k)}\bigg]^2\; \\ &+\; \sum_{i \in U}\sum_{j \in U \setminus i} I(D_i = d_k)\, I(D_j = d_k)\, \frac{\pi_{ij}(d_k) - \pi_i(d_k)\pi_j(d_k)}{\pi_{ij}(d_k)}\, \frac{Y_i\, Y_j}{\pi_i(d_k)\pi_j(d_k)}\end{align*}\]
Note the “off-diagonal” term - we need to account for the fact that the joint propensity of receiving exposure \(d_k\) might not be the same as the product ofthe marginals
In economics, a very popular style of design is the shift-share design
Consider the linear model
\[Y_i = \beta z_i + \epsilon_i\]
We’re interested in the average effect of \(z_i\) (\(\beta\))
Example (Autor et. al. (2013))
In a shift-share design, \(z_i\) has a particular known structure
\[z_i = \sum_{k=1}^K s_{ik} \times g_k\]
\(g = \{g_1, g_2, \dotsc, g_K\}\) are the shifts
\(s_i = \{s_{i1}, s_{i2}, \dotsc, s_{iK}\}\) are the unit-specific shares
Example (Autor et. al. (2013))
There are two approaches to identification in the shift-share setting
Random shares (Goldsmith-Pinkham et. al., 2020)
Random shocks (Borusyak, Hull, Jaravel, 2022)
But payments are non-random! Some places were “hurt” more by the trade war. Some places have more farmers, etc…
In the 2019 Market Facilitation Program (MFP), counties were assigned a single per-acre payment rate (our instrument for overall county payments)
The instrument had a formula structure that looked a lot like a shift-share!
\[(\text{Rate})_i^{\text{\$/acre}} = \frac{\sum_{c=1}^C (\text{Acreage})_{i,c}^{\text{acre}} \times (\text{Yield})_{i,c}^{\text{unit/acre}} \times \text{(Rate)}_{c}^{\text{\$/unit}}}{\sum_{c=1}^C \text{(Acreage)}_{i,c}^{\text{acre}}}\]
Outcome: Difference between 2020 and 2016 Trump county vote share (percentage points)
Treatment: 2019 MFP Payments
Instrument: Re-centered payment-rate instrument
Design-based randomization inference
Our test statistic - the sample covariance of \(Y\) and the de-meaned instrument (Borusyak and Hull, 2023)
PS 813 - University of Wisconsin - Madison