Semiparametric inference for the scale-mixture of normal partial linear regression model with censored data

J Appl Stat. 2021 May 25;49(12):3022-3043. doi: 10.1080/02664763.2021.1931821. eCollection 2022.

Abstract

In the censored data exploration, the classical linear regression model which assumes normally distributed random errors is perhaps one of the commonly used frameworks. However, practical studies have often criticized the classical linear regression model because of its sensitivity to departure from the normality and partial nonlinearity. This paper proposes to solve these potential issues simultaneously in the context of the partial linear regression model by assuming that the random errors follow a scale-mixture of normal (SMN) family of distributions. The postulated method allows us to model data with great flexibility, accommodating heavy tails and outliers. By implementing the B-spline approximation and using the convenient hierarchical representation of the SMN distributions, a computationally analytical EM-type algorithm is developed for obtaining maximum likelihood (ML) parameter estimates. Various simulation studies are conducted to investigate the finite sample properties, as well as the robustness of the model in dealing with the heavy tails distributed datasets. Real-world data examples are finally analyzed for illustrating the usefulness of the proposed methodology.

Keywords: B-spline; EM-type algorithm; interval-censored data; scale-mixture of normal family of distributions; semiparametric modeling.

Grants and funding

This work is based upon research supported by the South Africa National Research Foundation and South Africa Medical Research Council (South Africa DST-NRF-SAMRC SARChI Research Chair in Biostatistics, Grant number 114613, and STATOMET), as well as by the National Research Foundation of South Africa [grant Number 127727]. Opinions expressed and conclusions arrived at are those of the author and are not necessarily to be attributed to the NRF.