A dataset of 1050-tampered color and grayscale images (CG-1050)

Data Brief. 2019 Nov 21:28:104864. doi: 10.1016/j.dib.2019.104864. eCollection 2020 Feb.

Abstract

This paper presents the CG-1050 dataset consisting of 100 original images, 1050 tampered images and their corresponding masks. The dataset is organized into four directories: original images, tampered images, mask images, and a description file. The directory of original images includes 15 color and 85 grayscale images. The directory of tampered images has 1050 images obtained through one of the following type of tampering: copy-move, cut-paste, retouching, and colorizing. The true mask between every pair of original and its tampered image is included in the mask directory (1380 masks). The description file shows the names of the images (i.e., original, tampered and mask), the image description, the photo location, the type of tampering, and the manipulated object in the image. With this dataset, the researchers can train and validate fake image classification methods, either for labelling the tampered image or for forgery pixel-detection.

Keywords: Colorizing; Copy-move; Cut-paste; Fake image; Forgery detection; Retouching; Tampering detection.