Training Data Extraction and Object Detection in Surveillance Scenario

Artur Wilkowski; Maciej Stefańczyk; Włodzimierz Kasprzak

doi:10.3390/s20092689

Training Data Extraction and Object Detection in Surveillance Scenario

Sensors (Basel). 2020 May 8;20(9):2689. doi: 10.3390/s20092689.

Authors

Artur Wilkowski¹, Maciej Stefańczyk¹, Włodzimierz Kasprzak¹

Affiliation

¹ Institute of Control and Computation Engineering, Warsaw University of Technology, Nowowiejska 15/19, 00-665 Warszawa, Poland.

Abstract

Police and various security services use video analysis for securing public space, mass events, and when investigating criminal activity. Due to a huge amount of data supplied to surveillance systems, some automatic data processing is a necessity. In one typical scenario, an operator marks an object in an image frame and searches for all occurrences of the object in other frames or even image sequences. This problem is hard in general. Algorithms supporting this scenario must reconcile several seemingly contradicting factors: training and detection speed, detection reliability, and learning from small data sets. In the system proposed here, we use a two-stage detector. The first region proposal stage is based on a Cascade Classifier while the second classification stage is based either on a Support Vector Machines (SVMs) or Convolutional Neural Networks (CNNs). The proposed configuration ensures both speed and detection reliability. In addition to this, an object tracking and background-foreground separation algorithm is used, supported by the GrabCut algorithm and a sample synthesis procedure, in order to collect rich training data for the detector. Experiments show that the system is effective, useful, and applicable to practical surveillance tasks.

Keywords: CNN; SVM; cascade classifier; few shot learning; object detection; video surveillance.

Grants and funding

CYBERSECIDENT/369195/I/NCBR/2017/Narodowe Centrum Badań i Rozwoju