Unsupervised Conditional Diffusion Models in Video Anomaly Detection for Monitoring Dust Pollution

Sensors (Basel). 2024 Feb 23;24(5):1464. doi: 10.3390/s24051464.

Abstract

Video surveillance is widely used in monitoring environmental pollution, particularly harmful dust. Currently, manual video monitoring remains the predominant method for analyzing potential pollution, which is inefficient and prone to errors. In this paper, we introduce a new unsupervised method based on latent diffusion models. Specifically, we propose a spatio-temporal network structure, which better integrates the spatial and temporal features of videos. Our conditional guidance mechanism samples frames of input videos to guide high-quality generation and obtains frame-level anomaly scores, comparing generated videos with original ones. We also propose an efficient compression strategy to reduce computational costs, allowing the model to perform in a latent space. The superiority of our method was demonstrated by numerical experiments in three public benchmarks and practical application analysis in coal mining over previous SOTA methods with better AUC, of at most over 3%. Our method accurately detects abnormal patterns in multiple challenging environmental monitoring scenarios, illustrating the potential application possibilities in the environmental protection domain and beyond.

Keywords: conditional diffusion models; dust pollution; latent compression; video anomaly detection.

Grants and funding

This research of Dianpeng Wang was funded by the National Natural Science Foundation of China (Grant no. NSFC 12171033).