Research
Presented at the BITSS Annual Meeting in March 2024: Video
Under Review
How many experimental studies would have come to different conclusions had they been run on larger samples? I show how to estimate the expected number of statistically significant results that a set of experiments would have reported had their sample sizes all been counterfactually increased by a chosen factor. The deconvolution estimator is consistent and asymptotically normal. Unlike existing methods, my approach requires no assumptions about the distribution of true treatment effects of the interventions being studied other than continuity. This method includes an adjustment for publication bias in the reported t-scores. An application to randomized controlled trials (RCTs) published in top economics journals finds that doubling every experiment's sample size would only increase the power of two-sided t-tests by 7.2 percentage points on average. I argue that this effect is small by showing that it is comparable to the effect for systematic replication projects in laboratory psychology where previous studies enabled accurate power calculations ex ante. These effects are both smaller than for non-RCTs. This comparison suggests that RCTs are on average relatively insensitive to sample size increases. The policy implication is that grant givers should generally fund more experiments rather than fewer, larger ones. Submission, Transparency R package, Arxiv.
Presented at: CEPR Development Economics Annual Symposium, Urban Economics Association Annual Meeting, and Microeconometrics Class of 2024 Conference in September 2024
Under Review
We study the problem of estimating the average causal effect of treating every member of a population, as opposed to none, using an experiment that treats only some. We consider settings where spillovers have global support and decay slowly with (a generalized notion of) distance. We derive the minimax rate over both estimators and designs, and show that it increases with the spatial rate of spillover decay. Estimators based on OLS regressions like those used to analyze recent large-scale experiments are consistent (though only after de-weighting), achieve the minimax rate when the DGP is linear, and converge faster than IPW-based alternatives when treatment clusters are small, providing one justification for OLS's ubiquity. When the DGP is nonlinear they remain consistent but converge slowly. We further address inference and bandwidth selection. Applied to the cash transfer experiment studied by Egger et al. (2022) these methods yield a 20% larger estimated effect on consumption. Arxiv
Social Effects, Spillovers, and Scale-up of Teacher Training in Uganda: an RCT (with Vesall Nourani, Moustafa El-Kashlan, and Sara Tamayo)
While nearly half of Ugandan schoolchildren enter secondary school, fewer than 10% complete it. Low teaching quality may be a factor. We study the effects and spillovers of training secondary school teachers in rural Uganda with an RCT. Teachers were randomly assigned to an innovative training program run by Kimanya-Ngeyo in November 2021 and training is ongoing in waves. Our RCT design allows us to study teacher-to-teacher spillovers over time by randomly assigning half of treated schools to treat teachers in "cliques", where treated teachers know each other well vs. the other half of treated schools who were assigned to treat teachers in "anti-cliques", where treated teachers do not know each other well. AEA Registration here.