Multivariate semiparametric control charts for mixed-type data

Stat Methods Med Res. 2023 Apr;32(4):671-690. doi: 10.1177/09622802221142528. Epub 2023 Feb 14.

Abstract

A useful tool that has gained popularity in the Quality Control area is the control chart which monitors a process over time, identifies potential changes, understands variations, and eventually improves the quality and performance of the process. This article introduces a new class of multivariate semiparametric control charts for monitoring multivariate mixed-type data, which comprise both continuous and discrete random variables (rvs). Our methodology leverages ideas from clustering and Statistical Process Control to develop control charts for MIxed-type data. We propose four control chart schemes based on modified versions of the KAy-means for MIxed LArge KAMILA data clustering algorithm, where we assume that the two existing clusters represent the reference and the test sample. The charts are semiparametric, the continuous rvs follow a distribution that belongs in the class of elliptical distributions. Categorical scale rvs follow a multinomial distribution. We present the algorithmic procedures and study the characteristics of the new control charts. The performance of the proposed schemes is evaluated on the basis of the False Alarm Rate and in-control Average Run Length. Finally, we demonstrate the effectiveness and applicability of our proposed methods utilizing real-world data.

Keywords: Artificial intelligence; KAMILA algorithm; average run length; clustering; false alarm rate; kernel density estimation.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*