Pooling Operations in Deep Learning: From "Invariable" to "Variable"

Zhou Tao; Chang XiaoYu; Lu HuiLing; Ye XinYu; Liu YunCan; Zheng XiaoMin

doi:10.1155/2022/4067581

Pooling Operations in Deep Learning: From "Invariable" to "Variable"

Biomed Res Int. 2022 Jun 20:2022:4067581. doi: 10.1155/2022/4067581. eCollection 2022.

Authors

Zhou Tao^{1

2}, Chang XiaoYu¹, Lu HuiLing³, Ye XinYu¹, Liu YunCan¹, Zheng XiaoMin⁴

Affiliations

¹ School of Computer Science and Engineering, North Minzu University, Yinchuan 750021, China.
² Key Laboratory of Image and Graphics Intelligent Processing of State Ethnic Affairs Commission, North Minzu University, Yinchuan 750021, China.
³ School of Science, Ningxia Medical University, Yinchuan 750004, China.
⁴ Research Institute for Reproductive Medicine and Genetic Diseases, Wuxi Maternity and Child Health Hospital, Wuxi 214002, China.

Abstract

Deep learning has become a research hotspot in multimedia, especially in the field of image processing. Pooling operation is an important operation in deep learning. Pooling operation can reduce the feature dimension, the number of parameters, the complexity of computation, and the complexity of time. With the development of deep learning models, pooling operation has made great progress. The main contributions of this paper on pooling operation are as follows: firstly, the steps of the pooling operation are summarized as the pooling domain, pooling kernel, step size, activation value, and response value. Secondly, the expression form of pooling operation is standardized. From the perspective of "invariable" to "variable," this paper analyzes the pooling domain and pooling kernel in the pooling operation. Pooling operation can be classified into four categories: invariable of pooling domain, variable of pooling domain, variable of pooling kernel, and the pooling of invariable "+" variable. Finally, the four types of pooling operation are summarized and discussed with their advantages and disadvantages. There is great significance to the research of pooling operations and the iterative updating of deep learning models.

Publication types

Review

MeSH terms

Deep Learning*
Image Processing, Computer-Assisted / methods