Rectification and Super-Resolution Enhancements for Forensic Text Recognition

Pablo Blanco-Medina; Eduardo Fidalgo; Enrique Alegre; Rocío Alaiz-Rodríguez; Francisco Jáñez-Martino; Alexandra Bonnici

doi:10.3390/s20205850

Rectification and Super-Resolution Enhancements for Forensic Text Recognition

Sensors (Basel). 2020 Oct 16;20(20):5850. doi: 10.3390/s20205850.

Authors

Pablo Blanco-Medina^{1

2}, Eduardo Fidalgo^{1

2}, Enrique Alegre^{1

2}, Rocío Alaiz-Rodríguez^{1

2}, Francisco Jáñez-Martino^{1

2}, Alexandra Bonnici³

Affiliations

¹ Department of Electrical, Systems and Automation, Universidad de León, 24007 León, Spain.
² INCIBE (Spanish National Cybersecurity Institute), 24005 León, Spain.
³ Faculty of Engineering, University of Malta, MSD2080 Msida, Malta.

Abstract

Retrieving text embedded within images is a challenging task in real-world settings. Multiple problems such as low-resolution and the orientation of the text can hinder the extraction of information. These problems are common in environments such as Tor Darknet and Child Sexual Abuse images, where text extraction is crucial in the prevention of illegal activities. In this work, we evaluate eight text recognizers and, to increase the performance of text transcription, we combine these recognizers with rectification networks and super-resolution algorithms. We test our approach on four state-of-the-art and two custom datasets (TOICO-1K and Child Sexual Abuse (CSA)-text, based on text retrieved from Tor Darknet and Child Sexual Exploitation Material, respectively). We obtained a 0.3170 score of correctly recognized words in the TOICO-1K dataset when we combined Deep Convolutional Neural Networks (CNN) and rectification-based recognizers. For the CSA-text dataset, applying resolution enhancements achieved a final score of 0.6960. The highest performance increase was achieved on the ICDAR 2015 dataset, with an improvement of 4.83% when combining the MORAN recognizer and the Residual Dense resolution approach. We conclude that rectification outperforms super-resolution when applied separately, while their combination achieves the best average improvements in the chosen datasets.

Keywords: Tor Darknet; computer forensics; super-resolution; text recognition; text spotting.

Grants and funding

821966/European Commission