Quantization avoids saddle points in distributed optimization

Yanan Bo; Yongqiang Wang

doi:10.1073/pnas.2319625121

Quantization avoids saddle points in distributed optimization

Proc Natl Acad Sci U S A. 2024 Apr 23;121(17):e2319625121. doi: 10.1073/pnas.2319625121. Epub 2024 Apr 19.

Authors

Yanan Bo¹, Yongqiang Wang¹

Affiliation

¹ Department of Electrical and Computer Engineering, Clemson University, Clemson, SC 29634.

PMID: 38640343
PMCID: PMC11047086 (available on 2024-10-19)
DOI: 10.1073/pnas.2319625121

Abstract

Distributed nonconvex optimization underpins key functionalities of numerous distributed systems, ranging from power systems, smart buildings, cooperative robots, vehicle networks to sensor networks. Recently, it has also merged as a promising solution to handle the enormous growth in data and model sizes in deep learning. A fundamental problem in distributed nonconvex optimization is avoiding convergence to saddle points, which significantly degrade optimization accuracy. We find that the process of quantization, which is necessary for all digital communications, can be exploited to enable saddle-point avoidance. More specifically, we propose a stochastic quantization scheme and prove that it can effectively escape saddle points and ensure convergence to a second-order stationary point in distributed nonconvex optimization. With an easily adjustable quantization granularity, the approach allows a user to control the number of bits sent per iteration and, hence, to aggressively reduce the communication overhead. Numerical experimental results using distributed optimization and learning problems on benchmark datasets confirm the effectiveness of the approach.

Keywords: distributed nonconvex optimization; quantization; saddle-point avoidance.

Grants and funding

CCF-2106293/NSF | CISE | Division of Computing and Communication Foundations (CCF)