Multi-instance dictionary learning for detecting abnormal events in surveillance videos

Jing Huo; Yang Gao; Wanqi Yang; Hujun Yin

doi:10.1142/S0129065714300101

Multi-instance dictionary learning for detecting abnormal events in surveillance videos

Int J Neural Syst. 2014 May;24(3):1430010. doi: 10.1142/S0129065714300101. Epub 2014 Jan 26.

Authors

Jing Huo¹, Yang Gao, Wanqi Yang, Hujun Yin

Affiliation

¹ State Key Laboratory for Novel Software Technology, Nanjing University, P. R. China.

PMID: 24552509
DOI: 10.1142/S0129065714300101

Abstract

In this paper, a novel method termed Multi-Instance Dictionary Learning (MIDL) is presented for detecting abnormal events in crowded video scenes. With respect to multi-instance learning, each event (video clip) in videos is modeled as a bag containing several sub-events (local observations); while each sub-event is regarded as an instance. The MIDL jointly learns a dictionary for sparse representations of sub-events (instances) and multi-instance classifiers for classifying events into normal or abnormal. We further adopt three different multi-instance models, yielding the Max-Pooling-based MIDL (MP-MIDL), Instance-based MIDL (Inst-MIDL) and Bag-based MIDL (Bag-MIDL), for detecting both global and local abnormalities. The MP-MIDL classifies observed events by using bag features extracted via max-pooling over sparse representations. The Inst-MIDL and Bag-MIDL classify observed events by the predicted values of corresponding instances. The proposed MIDL is evaluated and compared with the state-of-the-art methods for abnormal event detection on the UMN (for global abnormalities) and the UCSD (for local abnormalities) datasets and results show that the proposed MP-MIDL and Bag-MIDL achieve either comparable or improved detection performances. The proposed MIDL method is also compared with other multi-instance learning methods on the task and superior results are obtained by the MP-MIDL scheme.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Algorithms*
Artificial Intelligence*
Humans
Image Interpretation, Computer-Assisted
Learning*
Models, Theoretical*
Signal Detection, Psychological*
Video Recording