Regression analysis of mixed panel-count data with application to cancer studies

Stat Biosci. 2021 Apr;13(1):178-195. doi: 10.1007/s12561-020-09291-2. Epub 2020 Aug 17.

Abstract

Both panel-count data and panel-binary data are common data types in recurrent event studies. Because of inconsistent questionnaires or missing data during the follow-ups, mixed data types need to be addressed frequently. A recently proposed semiparametric approach uses a proportional means model to facilitate regression analyses of mixed panel-count and panel-binary data. This method can use all available information regardless of the record type and provide unbiased estimates. However, the large number of nuisance parameters in the nonparametric baseline hazard function makes the estimating procedure very complicated and time-consuming. We approximated the baseline hazard function to simplify the estimating procedure. Simulation studies showed that our method performed similarly to that of the previous semiparametric likelihood-based method, but with much faster speed. Approximating the baseline hazard not only reduced the computational burden but also made it possible to implement the estimating procedure in a standard software, such as SAS.