|
American Statistical Association
|
We consider estimating the mean of a variable Y subject to missing values, with the aid of a set of fully-observed predictors X, assuming the data are missing at random. We compare a number of estimators that combine information from a model relating Y to X (the prediction model) and a model relating the propensity to respond to X (the propensity model). These methods all have a double robustness (DR) property, namely, they yield a consistent estimate if either the mean structure of the prediction model or the propensity model is correctly specified. One method is to include the inverse of the propensity score as a linear term in the imputation model (Firth and Bennett, 1998; Scharfstein, Rotnitzky and Robins, 1999; Bang and Robins, 2005). Another method is to calibrate the predictions from a parametric model by adding the mean of the weighted residuals (Robins, Rotnitzky and Zhao, 1994; Scharfstein, Rotnitzky and Robins, 1999). The Penalized Spline Propensity Prediction (PSPP) method achieves double robustness by including the propensity score in the prediction model nonparametrically (Little and An, 2004; Zhang and Little, 2005). All these methods are consistent and have good asymptotic properties, but here we study their efficiency and confidence coverage over a range of sample sizes. Simulations tend to favor the PSPP method, suggesting in particular that calibration does not improve the inference when the relationship between Y and the estimated propensity is modeled using a flexible spline model. Extensions to inferences other than the mean are also discussed.
| Date: | Friday, December 14, 2007 |
|---|---|
| Time: | 11:00 A.M. - 12:00 P.M. |
| Location: |
Memorial Sloan-Kettering Cancer Center
Department of Epidemiology and Biostatistics 307 East 63rd Street (between First and Second Avenues) 3rd Floor Conference Room New York, New York Note: To gain access to the building, please follow the directions by the telephone in the foyer. |