Identification of document paper using hybrid feature extraction

J Forensic Sci. 2023 Sep;68(5):1808-1815. doi: 10.1111/1556-4029.15330. Epub 2023 Jul 7.

Abstract

Document forgery is a significant issue in Korea, with around ten thousand cases reported every year. Analyzing paper plays a crucial role in examining questionable documents such as marketable securities and contracts, which can aid in solving criminal cases of document forgery. Paper analysis can also provide essential insights in other types of criminal cases, serving as an important clue for solving cases such as the source of a blackmail letter. The papermaking process generates distinct forming fabric marks and formations, which are critical features for paper classification. These characteristics are observable under transmitted light and are created by the forming fabric pattern and the distribution of pulp fibers, respectively. In this study, we propose a novel approach for paper identification based on hybrid features. This method combines texture features extracted from images converted using the gray-level co-occurrence matrix (GLCM) approach and a convolutional neural network (CNN), with another set of features extracted by the CNN using the same images as input. We applied the proposed method to classification tasks for seven major paper brands available in the Korean market, achieving an accuracy of 97.66%. The results confirm the applicability of this method for visually inspecting paper products and demonstrate its potential for assisting in solving criminal cases involving document forgery.

Keywords: convolutional neural network (CNN); forensic document analysis; gray-level co-occurrence matrix (GLCM); paper analysis; paper forming fabric; questioned document examination.