On the structure of IV estimands

doi:10.1016/j.jeconom.2018.12.017

Journal of Econometrics

Volume 211, Issue 1, July 2019, Pages 294-307

https://doi.org/10.1016/j.jeconom.2018.12.017 Get rights and content

Abstract

When the overidentifying restrictions of the constant-effect linear instrumental variables model fail, common IV estimators converge to different probability limits. I characterize the estimands of two stage least squares, two step GMM, and limited information maximum likelihood as functions of the single-instrument estimands from the just-identified IV regressions which consider each instrument separately. The limited information maximum likelihood estimand is found to be discontinuous on a set of dimension equal to the number of instruments minus one, and to equal the full parameter space on a set of dimension equal to the number of instruments minus two.

Introduction

A wide variety of estimators have been proposed for the constant-effect linear instrumental variables (IV) model, all of which converge to the true parameter value when the model is correctly specified and an instrument relevance condition holds. When the IV model is misspecified, on the other hand, common IV estimators typically converge to different probability limits.

The goal of this paper is to characterize the behavior of commonly-used estimators under model misspecification in linear IV models with a single endogenous regressor. In particular, the paper considers two-stage least squares (TSLS), two-step generalized method of moments (TSGMM), limited information maximum likelihood (LIML), and continuous updating generalized method of moments (CUGMM). The probability limits (estimands) of TSLS, TSGMM, and LIML are characterized as functions of the estimands in the just-identified models that use one instrument at a time, holding other features of the data generating process fixed. More limited results are derived for the CUGMM estimand.

As is well understood, the TSLS estimand is linear in the single-instrument estimands with linear combination weights summing to one. By contrast, the TSGMM estimand is generally nonlinear, though continuous, in the single-instrument estimands. More surprisingly the LIML estimand is highly nonlinear in the single-instrument estimands and is discontinuous on a set of dimension equal to the number of instruments minus one. If the controls include a constant, I show that the LIML estimand is discontinuous if and only if the vector of single-instrument estimands is such that (a) the TSLS estimand coincides with the ordinary least squares (OLS) estimand and (b) the $R^{2}$ from the reduced-form regression of the outcome on the instruments is greater than the $R^{2}$ from the first-stage regression of the endogenous regressor on the instruments. As the TSLS estimand approaches the OLS estimand from above the LIML estimand diverges to positive infinity, while as the TSLS estimand approaches the OLS estimand from below the LIML estimand diverges to negative infinity. Moreover, when the TSLS and OLS estimands coincide and the reduced-form $R^{2}$ is equal to the first-stage $R^{2}$ , the population LIML objective function does not depend on the structural parameter value considered, so the minimizer is the full parameter space.

Analytical results for the CUGMM estimand are more elusive, but the level sets of this estimand (viewed as a function of the vector of single-instrument estimands) have a structure similar to those of LIML, and I find similar behavior for the LIML and CUGMM estimands in a calibration to data from Yogo (2004).

The approach taken in this paper is distinct from that in the literature on heterogeneous treatment effects. A large literature originating with Imbens and Angrist (1994) characterizes the probability limits of IV estimators as combinations of heterogenous treatment effects under exogeneity and monotonicity assumptions. By contrast my approach, based on single-instrument IV estimands, is agnostic about the source and form of misspecification and so can accommodate heterogeneous treatment effects, invalidity of the instruments, or misspecification of the linear functional form. Further, my results apply directly to IV applications which are difficult to cast into the treatment effects framework, for example Yogo (2004). At the same time, however, my results only relate IV estimands to the single-instrument estimands and other statistical objects, rather than to the causal or structural parameters of interest. Hence, by remaining agnostic about the source of misspecification my approach accommodates models beyond the scope of the heterogeneous treatment effect literature but obtains correspondingly weaker results.

Two papers from the literature on heterogeneous treatment effects of particular relevance to my results are Kolesar (2013) and Mogstad et al. (2018). Kolesar (2013) shows that the LIML estimand can lie outside the convex hull of the individual treatment effects in a heterogeneous treatment effect model. Kolesar’s results do not imply the discontinuity of LIML estimand but do suggest peculiar behavior for this quantity, which my results strongly confirm. Mogstad et al. (2018) derive expressions for a wide variety of estimands in terms of the potential outcomes in the treated and untreated states in a heterogenous treatment effect model with a binary treatment. Their results could be used to link the expressions in the present paper to causal effects in that setting, though further exploration of this possibility is left for future work. Other related work includes Hall and Inoue (2003), who examine the large-sample behavior of GMM estimators under misspecification, and Lee (2017), who proposes an asymptotic variance estimator for TSLS in models with heterogenous treatment effects.

In the next Section I formally introduce the IV model and define the IV estimands. Section 3 then presents analytical results on the structure of IV estimands, while Section 4 illustrates these results in a calibration to data from Yogo (2004). All proofs are given in the Appendix.

Section snippets

The linear IV model and estimands

Suppose we observe a sample of $T$ observations $(Y_{t}, X_{t}, Z_{t})$ drawn from distribution $F_{T}$ , where $Y_{t}$ is an outcome variable, $X_{t}$ is a potentially endogenous regressor, and $Z_{t}$ is a $k \times 1$ vector of instrumental variables. Let us stack these observations into $T \times 1$ vectors $Y$ and $X$ with row $t$ equal to $Y_{t}$ and $X_{t}$ respectively, and a $T \times k$ matrix $Z$ with row $t$ equal to $Z_{t}^{'}$ . Suppose the data obey the linear model $Y = X β + ε,$ where $β$ is the scalar parameter of interest. Conventional IV methods impose two further

The structure of IV estimands

As noted in the previous section, all IV estimands coincide in just-identified models, provided the instrument relevance condition holds. In over-identified models, by contrast, each instrument implies a corresponding IV estimand and the question is how to combine the single-instrument estimands into an overall estimate. The different IV estimators discussed above imply different answers to this question, and the goal of this section is to characterize the behavior of the IV estimands $β_{W}$ as

IV estimands in an example

To illustrate the analytic results above, Fig. 1, Fig. 2, Fig. 3, Fig. 4 plot the contours of the IV estimands $β_{W}$ as functions of single-instrument estimands $b$ in a calibration based on Yogo (2004). Yogo studies the effect of weak instruments on estimation of the elasticity of intertemporal substitution using a linear Euler equation model and data from a number of countries. Here I calibrate all elements of $ψ$ other than $E [Z_{t} Y_{t}]$ to values estimated from the quarterly US data series used by Yogo,

Conclusion

When the over-identifying restrictions of the classical IV model fail, common IV estimators converge to distinct probability limits. Characterizing these limits as a function of the single-instrument IV estimands, I find that the LIML estimand is discontinuous and, further, is sometimes equal to the full parameter space. If the set of controls includes a constant, these issues arise when the OLS and TSLS estimands are equal and the reduced-form $R^{2}$ is weakly larger than the first-stage $R^{2}$ . While

Acknowledgments

I am grateful to Jushan Bai, Anna Mikusheva, Whitney Newey, Christoph Rothe, Miikka Rokkanen, Jim Stock, participants in the Fall 2015 Conference in Honor of Jerry Hausman, and two anonymous referees for helpful comments. Support from the Silverman (1978) Family Career Development Chair at MIT and from the National Science Foundation under grant number 1654234 is gratefully acknowledged.

References (11)

HallA.R. et al.
The large sample behaviour of the generalized method of moments estimator in misspecified models
J. Econometrics
(2003)
HausmanJ.
Handbook of Econometrics, Vol. 1
AngristJ. et al.
Two-stage least squares estimation of average causal effects in models with variable treatment intensity
J. Amer. Statist. Assoc.
(1995)
ChamberlainG.
Decision theory applied to an instrumental variable model
Econometrica
(2007)
ImbensG. et al.
Identificaton and estimation of local average treatment effects
Econometrica
(1994)

There are more references available in the full text version of this article.

Cited by (9)

A doubly corrected robust variance estimator for linear GMM
2022, Journal of Econometrics
Citation Excerpt :
Hansen and Lee (2020) provide robust inference theory for iterated GMM. Rotemberg (1983) and Andrews (2019) characterize the estimands of the linear GMM under misspecification. Andrews et al. (2017) propose to measure the effect of model misspecification on the sensitivity of parameter estimates for the minimum distance estimators.
We propose a new finite sample corrected variance estimator for the linear generalized method of moments (GMM) including the one-step, two-step, and iterated estimators. Our formula also corrects the over-identification bias in variance estimation on top of the commonly used finite sample correction of Windmeijer (2005), which corrects the bias from estimating the efficient weight matrix, so is doubly corrected. An important feature of the proposed double correction is that it automatically provides robustness to misspecification of the moment condition. In contrast, the conventional variance estimator and the Windmeijer correction are inconsistent under misspecification. That is, the double correction formula proposed in this paper provides a convenient way to obtain improved inference under correct specification and robustness against misspecification at the same time.
Annals Issue in Honor of Jerry A. Hausman: Editors’ Introduction
2019, Journal of Econometrics
HIGHER-ORDER APPROXIMATION OF IV ESTIMATORS WITH INVALID INSTRUMENTS
2022, Econometric Theory
GMM is Inadmissible Under Weak Identification
2022, arXiv
Inference for Iterated GMM Under Misspecification
2021, Econometrica
Bartik instruments: What, when, why, and how
2020, American Economic Review

View all citing articles on Scopus

View full text

On the structure of IV estimands

Abstract

Introduction

Section snippets

The linear IV model and estimands

The structure of IV estimands

IV estimands in an example

Conclusion

Acknowledgments

J. Econometrics

Two-stage least squares estimation of average causal effects in models with variable treatment intensity

J. Amer. Statist. Assoc.

Decision theory applied to an instrumental variable model

Econometrica

Identificaton and estimation of local average treatment effects

Econometrica