An event-oriented diffusion-refinement method for sparse events completion

Bo Zhang; Yuqi Han; Jinli Suo; Qionghai Dai

doi:10.1038/s41598-024-57333-2

An event-oriented diffusion-refinement method for sparse events completion

Sci Rep. 2024 Mar 21;14(1):6802. doi: 10.1038/s41598-024-57333-2.

Authors

Bo Zhang¹, Yuqi Han¹, Jinli Suo^{2

3

4}, Qionghai Dai^{1

5}

Affiliations

¹ Department of Automation, Tsinghua University, Beijing, 100084, China.
² Department of Automation, Tsinghua University, Beijing, 100084, China. jlsuo@tsinghua.edu.cn.
³ Institute for Brain and Cognitive Sciences, Tsinghua University, Beijing, 100084, China. jlsuo@tsinghua.edu.cn.
⁴ Shanghai Artificial Intelligence Laboratory, Shanghai, 200232, China. jlsuo@tsinghua.edu.cn.
⁵ Institute for Brain and Cognitive Sciences, Tsinghua University, Beijing, 100084, China.

Abstract

Event cameras or dynamic vision sensors (DVS) record asynchronous response to brightness changes instead of conventional intensity frames, and feature ultra-high sensitivity at low bandwidth. The new mechanism demonstrates great advantages in challenging scenarios with fast motion and large dynamic range. However, the recorded events might be highly sparse due to either limited hardware bandwidth or extreme photon starvation in harsh environments. To unlock the full potential of event cameras, we propose an inventive event sequence completion approach conforming to the unique characteristics of event data in both the processing stage and the output form. Specifically, we treat event streams as 3D event clouds in the spatiotemporal domain, develop a diffusion-based generative model to generate dense clouds in a coarse-to-fine manner, and recover exact timestamps to maintain the temporal resolution of raw data successfully. To validate the effectiveness of our method comprehensively, we perform extensive experiments on three widely used public datasets with different spatial resolutions, and additionally collect a novel event dataset covering diverse scenarios with highly dynamic motions and under harsh illumination. Besides generating high-quality dense events, our method can benefit downstream applications such as object classification and intensity frame reconstruction.

Abstract

Grants and funding