Séminaire HOPE / BxSE
Pedro Torres
LSE
Quantile Adjustment: Bias reduction in income imputation when external covariates matter
Abstract:
Empirical research often relies on combining information from multiple datasets when key variables are not jointly observed. A widely used solution is the two-sample two-stage (TSTS) procedure, which imputes missing variables from a donor dataset and then estimates relationships using the imputed values. While TSTS enables otherwise infeasible analyses, it introduces two sources of bias: variance bias, due to reduced variability in imputed variables, and projection bias, stemming from Berkson measurement error. These biases affect estimands differently depending on whether the imputed variable is used as a regressor or as an outcome. Existing correction methods typically address only one source of bias—either restoring variance through stochastic imputation or correcting projection bias via rescaling—leaving the other unresolved. This paper proposes a novel quantile-based imputation approach that simultaneously targets both biases by adjusting first-stage predictions at the quantile level. The method fully restores the variance of the imputed variable and partially recovers the covariance lost during imputation, thereby improving the consistency of second-stage estimates.
About the author: Pedro holds a BSc in Economics from the Universidad Iberoamericana and an MSc in Applied Social Data Science from the London School of Economics. His PhD focuses on inequalities, with a particular emphasis on the intergenerational transmission of inequalities and statistical methods for its analysis.
His current research focuses on methodological aspects of analysing the intergenerational transmission of inequalities, particularly through the use of machine learning. In addition, he has focused on the analysis of inequalities and their impact on the Mexican population.
