Fast and Efficient Image Novelty Detection Based on Mean-Shifts

Matthias Hermann; Georg Umlauf; Bastian Goldlücke; Matthias O Franz

doi:10.3390/s22197674

Fast and Efficient Image Novelty Detection Based on Mean-Shifts

Sensors (Basel). 2022 Oct 10;22(19):7674. doi: 10.3390/s22197674.

Authors

Matthias Hermann¹, Georg Umlauf¹, Bastian Goldlücke², Matthias O Franz¹

Affiliations

¹ Institute for Optical Systems, HTWG Konstanz-University of Applied Sciences, Alfred-Wachtel-Straße 8, 78462 Konstanz, Germany.
² Department of Computer Science, University of Konstanz, Universitätsstraße 10, 78464 Konstanz, Germany.

Abstract

Image novelty detection is a repeating task in computer vision and describes the detection of anomalous images based on a training dataset consisting solely of normal reference data. It has been found that, in particular, neural networks are well-suited for the task. Our approach first transforms the training and test images into ensembles of patches, which enables the assessment of mean-shifts between normal data and outliers. As mean-shifts are only detectable when the outlier ensemble and inlier distribution are spatially separate from each other, a rich feature space, such as a pre-trained neural network, needs to be chosen to represent the extracted patches. For mean-shift estimation, the Hotelling T2 test is used. The size of the patches turned out to be a crucial hyperparameter that needs additional domain knowledge about the spatial size of the expected anomalies (local vs. global). This also affects model selection and the chosen feature space, as commonly used Convolutional Neural Networks or Vision Image Transformers have very different receptive field sizes. To showcase the state-of-the-art capabilities of our approach, we compare results with classical and deep learning methods on the popular dataset CIFAR-10, and demonstrate its real-world applicability in a large-scale industrial inspection scenario using the MVTec dataset. Because of the inexpensive design, our method can be implemented by a single additional 2D-convolution and pooling layer and allows particularly fast prediction times while being very data-efficient.

Keywords: deep learning; defect detection; image novelty detection; mean-shift.

MeSH terms

Image Processing, Computer-Assisted* / methods
Neural Networks, Computer*

Grants and funding

01IS19083A/Federal Ministry of Education and Research