Deepfake attack prevention using steganography GANs

Iram Noreen; Muhammad Shahid Muneer; Saira Gillani

doi:10.7717/peerj-cs.1125

Deepfake attack prevention using steganography GANs

PeerJ Comput Sci. 2022 Oct 20:8:e1125. doi: 10.7717/peerj-cs.1125. eCollection 2022.

Authors

Iram Noreen¹, Muhammad Shahid Muneer¹, Saira Gillani¹

Affiliation

¹ Department of Computer Science, Bahria University, Islamabad, Lahore Campus, Pakistan.

Abstract

Background: Deepfakes are fake images or videos generated by deep learning algorithms. Ongoing progress in deep learning techniques like auto-encoders and generative adversarial networks (GANs) is approaching a level that makes deepfake detection ideally impossible. A deepfake is created by swapping videos, images, or audio with the target, consequently raising digital media threats over the internet. Much work has been done to detect deepfake videos through feature detection using a convolutional neural network (CNN), recurrent neural network (RNN), and spatiotemporal CNN. However, these techniques are not effective in the future due to continuous improvements in GANs. Style GANs can create fake videos with high accuracy that cannot be easily detected. Hence, deepfake prevention is the need of the hour rather than just mere detection.

Methods: Recently, blockchain-based ownership methods, image tags, and watermarks in video frames have been used to prevent deepfake. However, this process is not fully functional. An image frame could be faked by copying watermarks and reusing them to create a deepfake. In this research, an enhanced modified version of the steganography technique RivaGAN is used to address the issue. The proposed approach encodes watermarks into features of the video frames by training an "attention model" with the ReLU activation function to achieve a fast learning rate.

Results: The proposed attention-generating approach has been validated with multiple activation functions and learning rates. It achieved 99.7% accuracy in embedding watermarks into the frames of the video. After generating the attention model, the generative adversarial network has trained using DeepFaceLab 2.0 and has tested the prevention of deepfake attacks using watermark embedded videos comprising 8,074 frames from different benchmark datasets. The proposed approach has acquired a 100% success rate in preventing deepfake attacks. Our code is available at https://github.com/shahidmuneer/deepfakes-watermarking-technique.

Keywords: CNN; Deep learning; Deepfake; Encryption; GANs; Prevention; Steganographic; Watermark.

Grants and funding

This work was supported by the Offices of Research, Innovation, and Commercialization (ORIC), Bahria University, Islamabad, Pakistan. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.