Weighted Random Forests to Improve Arrhythmia Classification

Krzysztof Gajowniczek; Iga Grzegorczyk; Tomasz Ząbkowski; Chandrajit Bajaj

doi:10.3390/electronics9010099

Weighted Random Forests to Improve Arrhythmia Classification

Electronics (Basel). 2020 Jan;9(1):10.3390/electronics9010099. doi: 10.3390/electronics9010099. Epub 2020 Jan 3.

Authors

Krzysztof Gajowniczek¹, Iga Grzegorczyk², Tomasz Ząbkowski¹, Chandrajit Bajaj³

Affiliations

¹ Department of Artificial Intelligence, Institute of Information Technology, Warsaw University of Life Sciences - SGGW, 02-776 Warsaw, Poland.
² Department of Physics of Complex Systems, Faculty of Physics, Warsaw University of Technology, 00-662 Warsaw, Poland.
³ Department of Computer Science, Institute for Computational Engineering and Sciences, University of Texas at Austin, Austin, TX 78712.

Abstract

Construction of an ensemble model is a process of combining many diverse base predictive learners. It arises questions of how to weight each model and how to tune the parameters of the weighting process. The most straightforward approach is simply to average the base models. However, numerous studies have shown that a weighted ensemble can provide superior prediction results to a simple average of models. The main goals of this article are to propose a new weighting algorithm applicable for each tree in the Random Forest model and the comprehensive examination of the optimal parameter tuning. Importantly, the approach is motivated by its flexibility, good performance, stability, and resistance to overfitting. The proposed scheme is examined and evaluated on the Physionet/Computing in Cardiology Challenge 2015 data set. It consists of signals (electrocardiograms and pulsatory waveforms) from intensive care patients which triggered an alarm for five cardiac arrhythmia types (Asystole, Bradycardia, Tachycardia, Ventricular Tachycardia, and Ventricular Fultter/Fibrillation). The classification problem regards whether the alarm should or should not have been generated. It was proved that the proposed weighting approach improved classification accuracy for the three most challenging out of the five investigated arrhythmias comparing to the standard Random Forest model.

Keywords: arrhythmia; false alarm; machine learning; weighted random forest.

Grants and funding

R01 GM117594/GM/NIGMS NIH HHS/United States