A new estimation with minimum trace of asymptotic covariance matrix for incomplete longitudinal data with a surrogate process

Baojiang Chen, Jing Qin

Research output: Contribution to journalArticle


Missing data is a very common problem in medical and social studies, especially when data are collected longitudinally. It is a challenging problem to utilize observed data effectively. Many papers on missing data problems can be found in statistical literature. It is well known that the inverse weighted estimation is neither efficient nor robust. On the other hand, the doubly robust (DR) method can improve the efficiency and robustness. As is known, the DR estimation requires a missing data model (i.e., a model for the probability that data are observed) and a working regression model (i.e., a model for the outcome variable given covariates and surrogate variables). Because the DR estimating function has mean zero for any parameters in the working regression model when the missing data model is correctly specified, in this paper, we derive a formula for the estimator of the parameters of the working regression model that yields the optimally efficient estimator of the marginal mean model (the parameters of interest) when the missing data model is correctly specified. Furthermore, the proposed method also inherits the DR property. Simulation studies demonstrate the greater efficiency of the proposed method compared with the standard DR method. A longitudinal dementia data set is used for illustration.

Original languageEnglish (US)
Pages (from-to)4763-4780
Number of pages18
JournalStatistics in Medicine
Issue number27
Publication statusPublished - Jan 1 2013



  • Longitudinal data
  • Missing data
  • Optimal
  • Surrogate outcome

ASJC Scopus subject areas

  • Epidemiology
  • Statistics and Probability

Cite this