2017 journal article

A SEMIPARAMETRIC INFERENCE TO REGRESSION ANALYSIS WITH MISSING COVARIATES IN SURVEY DATA

STATISTICA SINICA, 27(1), 261–285.

By: S. Yang & J. Kim

author keywords: Asymptotic linearization representation; fractional imputation; nonparametric maximum likelihood estimator; nonresponse
UN Sustainable Development Goal Categories
Source: Web Of Science
Added: August 6, 2018

Parameter estimation in parametric regression models with missing covariates is considered under a survey sampling setup. Under missingness at random, a semiparametric maximum likelihood approach is proposed which requires no parametric specification of the marginal covariate distribution. By drawing from the von Mises calculus and V-Statistics theory, we obtain an asymptotic linear representation of the semiparametric maximum likelihood estimator (SMLE) of the regression parameters, which allows for a consistent estimator of asymptotic variance. An EM algorithm for computation is then developed to implement the proposed method using fractional imputation. Simulation results suggest that the SMLE method is robust, whereas the fully parametric method is subject to severe bias under model misspecification. A rangeland study from the National Resources Inventory (NRI) is used to illustrate the practical use of the proposed methodology.