Misspecification of a binary dependent variable in the logistic model controlling for the repeated longitudinal measures

J Appl Stat. 2021 Oct 4;50(1):155-169. doi: 10.1080/02664763.2021.1982877. eCollection 2023.

Abstract

Many medical applications are interested to know the disease status. The disease status can be related to multiple serial measurements. Nevertheless, owing to various reasons, the binary outcome can be measured incorrectly. The estimators derived from the misspecified outcome can be biased. This paper derives the complete data likelihood function to incorporate both the multiple serial measurements and the misspecified outcome. Owing to the latent variables, EM algorithm is used to derive the maximum-likelihood estimators. Monte Carlo simulations are conducted to compare the impact of misspecification on the estimates. A retrospective data for the recurrence of atrial fibrillation is used to illustrate the usage of the proposed model.

Keywords: 62P10; Atrial fibrillation; EM algorithm; joint likelihood function; logistic regression; misspecification; random effect model.

Grants and funding

This research was partially supported in part by the Ministry of Science and Technology in Taiwan [grant numberMOST 106-2118-M-305-006-MY2].