Bearing Fault Diagnosis Based on an Enhanced Image Representation Method of Vibration Signal and Conditional Super Token Transformer

Jiaying Li; Han Liu; Jiaxun Liang; Jiahao Dong; Bin Pang; Ziyang Hao; Xin Zhao

doi:10.3390/e24081055

Bearing Fault Diagnosis Based on an Enhanced Image Representation Method of Vibration Signal and Conditional Super Token Transformer

Entropy (Basel). 2022 Jul 31;24(8):1055. doi: 10.3390/e24081055.

Authors

Jiaying Li^{1

2

3}, Han Liu^{1

2

3}, Jiaxun Liang^{1

2

3}, Jiahao Dong^{1

2

3}, Bin Pang^{1

2

3}, Ziyang Hao^{1

2

3}, Xin Zhao^{1

2

3}

Affiliations

¹ National & Local Joint Engineering Research Center of Metrology Instrument and System, Hebei University, Baoding 071002, China.
² Hebei Technology Innovation Center for Lightweight of New Energy Vehicle Power System, Hebei University, Baoding 071002, China.
³ College of Quality and Technical Supervision, Hebei University, Baoding 071002, China.

Abstract

Multipoint Optimal Minimum Entropy Deconvolution Adjusted (MOMEDA) is an advanced deconvolution method, which can effectively inhibit the interference of background noise and distinguish the fault period by calculating the multipoint kurtosis values. However, multipoint kurtosis (MKurt) could lead to misjudgment since it is sensitive to spurious noise spikes. Considering that L-kurtosis has good robustness with noise, this paper proposes a multipoint envelope L-kurtosis (MELkurt) method for establishing the temporal features. Then, an enhanced image representation method of vibration signals is proposed by employing the Gramian Angular Difference Field (GADF) method to convert the MELkurt series into images. Furthermore, to effectively learn and extract the features of GADF images, this paper develops a deep learning method named Conditional Super Token Transformer (CSTT) by incorporating the Super Token Transformer block, Super Token Mixer module, and Conditional Positional Encoding mechanism into Vision Transformer appropriately. Transfer learning is introduced to enhance the diagnostic accuracy and generalization capability of the designed CSTT. Consequently, a novel bearing fault diagnosis framework is established based on the presented enhanced image representation and CSTT. The proposed method is compared with Vision Transformer and some CNN-based models to verify the recognition effect by two experimental datasets. The results show that MELkurt significantly improves the fault feature enhancement ability with superior noise robustness to kurtosis, and the proposed CSTT achieves the highest diagnostic accuracy and stability.

Keywords: Vision Transformer; fault diagnosis; fault visualization; multipoint envelope L-kurtosis; rolling bearing.

Abstract

Grants and funding