HOTS: A Hierarchy of Event-Based Time-Surfaces for Pattern Recognition

Xavier Lagorce; Garrick Orchard; Francesco Galluppi; Bertram E Shi; Ryad B Benosman

doi:10.1109/TPAMI.2016.2574707

HOTS: A Hierarchy of Event-Based Time-Surfaces for Pattern Recognition

IEEE Trans Pattern Anal Mach Intell. 2017 Jul 1;39(7):1346-1359. doi: 10.1109/TPAMI.2016.2574707.

Authors

Xavier Lagorce¹, Garrick Orchard², Francesco Galluppi¹, Bertram E Shi³, Ryad B Benosman¹

Affiliations

¹ Vision and Natural Computation Group, Institut National de la Santé et de la Recherche Médicale, Sorbonne Universités, Institut de la Vision, Université Paris 06, Paris, Paris, FranceFrance.
² Singapore Institute for Neurotechnology (SINAPSE), National University of Singapore, Singapore.
³ Department of Electronic and Computer Engineering.

PMID: 27411216
DOI: 10.1109/TPAMI.2016.2574707

Abstract

This paper describes novel event-based spatio-temporal features called time-surfaces and how they can be used to create a hierarchical event-based pattern recognition architecture. Unlike existing hierarchical architectures for pattern recognition, the presented model relies on a time oriented approach to extract spatio-temporal features from the asynchronously acquired dynamics of a visual scene. These dynamics are acquired using biologically inspired frameless asynchronous event-driven vision sensors. Similarly to cortical structures, subsequent layers in our hierarchy extract increasingly abstract features using increasingly large spatio-temporal windows. The central concept is to use the rich temporal information provided by events to create contexts in the form of time-surfaces which represent the recent temporal activity within a local spatial neighborhood. We demonstrate that this concept can robustly be used at all stages of an event-based hierarchical model. First layer feature units operate on groups of pixels, while subsequent layer feature units operate on the output of lower level feature units. We report results on a previously published 36 class character recognition task and a four class canonical dynamic card pip task, achieving near 100 percent accuracy on each. We introduce a new seven class moving face recognition task, achieving 79 percent accuracy.This paper describes novel event-based spatio-temporal features called time-surfaces and how they can be used to create a hierarchical event-based pattern recognition architecture. Unlike existing hierarchical architectures for pattern recognition, the presented model relies on a time oriented approach to extract spatio-temporal features from the asynchronously acquired dynamics of a visual scene. These dynamics are acquired using biologically inspired frameless asynchronous event-driven vision sensors. Similarly to cortical structures, subsequent layers in our hierarchy extract increasingly abstract features using increasingly large spatio-temporal windows. The central concept is to use the rich temporal information provided by events to create contexts in the form of time-surfaces which represent the recent temporal activity within a local spatial neighborhood. We demonstrate that this concept can robustly be used at all stages of an event-based hierarchical model. First layer feature units operate on groups of pixels, while subsequent layer feature units operate on the output of lower level feature units. We report results on a previously published 36 class character recognition task and a four class canonical dynamic card pip task, achieving near 100 percent accuracy on each. We introduce a new seven class moving face recognition task, achieving 79 percent accuracy.

Keywords: Biosensors; Cameras; Character recognition; Feature extraction; Object recognition; Visualization.

Publication types

Research Support, Non-U.S. Gov't