Signals classification based on IA-optimal CNN

Neural Comput Appl. 2021;33(15):9703-9721. doi: 10.1007/s00521-021-05736-x. Epub 2021 May 27.

Abstract

The versatility of the existing A-optimal-based CNN for solving multiple types of signals classification problems has not been verified by different signals datasets. Moreover, the existing A-optimal-based CNN uses a simplified approximate function as the optimization objective function instead of precise analytical function, which affects the signals classification accuracy to a certain extent. In this paper, a classification method called IA-optimal CNN is proposed. To improve the stability of the classifier, the trace of the covariance matrix of the weights of the fully connected layer is used as the optimization objective function, and the parameter optimization model is established without any simplification of the optimization objective function. In addition, to avoid the difficulty of not being able to obtain the analytical expression formula of the partial derivative of the inverse matrix with regard to the networks parameters, a novel dual function is introduced to transform the optimization problem into an equivalent binary function optimization problem. Furthermore, based on the above analytical solution results, the parameters are updated using the alternate iterative optimization method and the accurate weight update formula is deduced in detail. Five signals datasets are used to test the universality of the IA-optimal CNN in signals classification fields. The performance of IA-optimal CNN is showed, and the experimental results are compared with the existing A-optimal-based classification algorithm. Lastly, the following conclusion is proved theoretically: For the A-optimal-based CNN, the trace of the covariance matrix will continue to decrease and approach a convergence value in the iterative process, but it is impossible for the networks to strictly reach the A-optimal state.

Keywords: A-optimal; Alternate iterative optimization; Convolutional neural networks; Dual function; Signals classification.