Horizontal Pod Autoscaling in Kubernetes for Elastic Container Orchestration

Thanh-Tung Nguyen; Yu-Jin Yeom; Taehong Kim; Dae-Heon Park; Sehan Kim

doi:10.3390/s20164621

Horizontal Pod Autoscaling in Kubernetes for Elastic Container Orchestration

Sensors (Basel). 2020 Aug 17;20(16):4621. doi: 10.3390/s20164621.

Authors

Thanh-Tung Nguyen¹, Yu-Jin Yeom¹, Taehong Kim¹, Dae-Heon Park², Sehan Kim²

Affiliations

¹ School of Information and Communication Engineering, Chungbuk National University, Cheongju, Chungbuk 28644, Korea.
² Electronics and Telecommunications Research Institute, Daejeon 34129, Korea.

Abstract

Kubernetes, an open-source container orchestration platform, enables high availability and scalability through diverse autoscaling mechanisms such as Horizontal Pod Autoscaler (HPA), Vertical Pod Autoscaler and Cluster Autoscaler. Amongst them, HPA helps provide seamless service by dynamically scaling up and down the number of resource units, called pods, without having to restart the whole system. Kubernetes monitors default Resource Metrics including CPU and memory usage of host machines and their pods. On the other hand, Custom Metrics, provided by external software such as Prometheus, are customizable to monitor a wide collection of metrics. In this paper, we investigate HPA through diverse experiments to provide critical knowledge on its operational behaviors. We also discuss the essential difference between Kubernetes Resource Metrics (KRM) and Prometheus Custom Metrics (PCM) and how they affect HPA's performance. Lastly, we provide deeper insights and lessons on how to optimize the performance of HPA for researchers, developers, and system administrators working with Kubernetes in the future.

Keywords: Docker; Horizontal Pod Autoscaling (HPA); Kubernetes; Prometheus; cloud computing; container orchestration; custom metrics; edge computing; resource metrics.

Abstract

Grants and funding