Downloads

WP202025-Regression-with-an-imputed-dependent-variable.pdf
PDF | 544.37 KB
Researchers are often interested in the relationship between two variables, with no single data set containing both. A common strategy is to use proxies for the dependent variable that are common to two surveys to impute the dependent variable into the data set containing the independent variable. We show that commonly employed regression or matching-based imputation procedures lead to inconsistent estimates. We offer an easily-implemented correction and correct asymptotic standard errors. We illustrate these with empirical examples using data from the US Consumer Expenditure Survey (CE) and the Panel Study of Income Dynamics (PSID).
Authors

Research Fellow University of Michigan
Tom is a Research Fellow at IFS, a Research Professor for the Institute for Social Research at the University of Michigan.

Deputy Research Director
Peter joined in 2009. He has published several papers on the microeconomics of household spending and labour supply decisions over the life-cycle.

PhD Student University of Essex
Working Paper details
- DOI
- 10.1920/wp.ifs.2020.2520
- Publisher
- The IFS
Suggested citation
T, Crossley and P, Levell and S, Poupakis. (2020). Regression with an imputed dependent variable. London: The IFS. Available at: https://ifs.org.uk/publications/regression-imputed-dependent-variable (accessed: 19 May 2025).
More from IFS
Understand this issue

Gender norms, violence and adolescent girls’ trajectories: Evidence from India
24 October 2022

Why is the government reforming health-related benefits?
We discuss the government's welfare reforms aimed at helping sick and disabled people into work, and what the changes mean for health-related benefits
14 May 2025

Drastic times need drastic action: breaking the 50-year tax taboo
Rachel Reeves should consider increasing the basic rate, just as Denis Healey did in 1975
14 April 2025
Policy analysis

Which places have the highest standard of living?
Measuring living standards using average household spending gives a starkly different picture of regional inequalities than using average income.
11 April 2025

ABC of SV: Limited Information Likelihood Inference in Stochastic Volatility Jump-Diffusion Models
We develop novel methods for estimation and filtering of continuous-time models with stochastic volatility and jumps using so-called Approximate Bayesian Compu- tation which build likelihoods based on limited information.
12 August 2014

Assessing the economic benefits of education: reconciling microeconomic and macroeconomic approaches
This CAYT report discusses the strengths and limitations of several approaches to assessing the effect of education on productivity.
14 March 2013
Academic research

Estimating intra-household sharing from time-use data
Estimating intra-household sharing is crucial to understanding overall inequality. However, expenditure data is almost always at the household level.
2 May 2025

Focal pricing constraints and pass-through of input cost changes
I show that the adoption and extent of focal pricing practices in an industry in general do not lower average pass-through of input cost changes.
2 May 2025

Using tax records to correct for under-representation of top income sources in surveys
We show that the survey correction method of Blanchet, Flores and Morgan (2022) can fail to correct its structure by components.
6 May 2025