Key frame extraction method for lecture videos based on spatio-temporal subtitles

Yunzuo Zhang; Yi Li; Zhaoquan Cai; Xuejun Wang; Jiayu Zhang; Shui Lam

doi:10.1007/s11042-023-15829-5

Key frame extraction method for lecture videos based on spatio-temporal subtitles

Multimed Tools Appl. 2023 Jun 2:1-14. doi: 10.1007/s11042-023-15829-5. Online ahead of print.

Authors

Yunzuo Zhang¹, Yi Li¹, Zhaoquan Cai², Xuejun Wang¹, Jiayu Zhang¹, Shui Lam³

Affiliations

¹ Shijiazhuang Tiedao University, Shijiazhuang, People's Republic of China.
² Shanwei Institute of Technology, Shanwei, People's Republic of China.
³ California State University, Long Beach, CA USA.

Abstract

Affected by the Corona Virus Disease 2019 (COVID-19), online lecture videos have witnessed an explosive growth. In the face of massive videos, this paper proposes a method for extracting key frames of lecture videos based on spatio-temporal subtitles, which can efficiently and quickly obtain effective information. Firstly, the spatio-temporal slices of subtitle area of the video sequence are extracted and spliced along the time axis to construct the video spatio-temporal subtitle. Then, the video spatio-temporal subtitle is processed in binarization, and the projection method is used to construct the SSPA curve of the video spatio-temporal subtitle. Finally, a selection method for steady-state key frame is designed, that is, the key frame extraction is realized by combining curve edge detection and subtitle existence threshold, which ensures the robustness of the proposed method. The test results of 8 videos show that the average value of the comprehensive index F₁-score of the key frame extracted by the algorithm can reach 0.97, the average precision is 0.97, and the average recall rate is 0.98. It can effectively extract the key frames in lecture videos, and compared with other algorithms, the average running time is reduced to 0.072 of the original, which is helpful to extract video information quickly and accurately.

Keywords: Key frame extraction; Lecture video; Spatio-temporal subtitle; Steady-state key frame.

© The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2023. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.