A New Deep Learning-Based Methodology for Video Deepfake Detection Using XGBoost

Aya Ismail; Marwa Elpeltagy; Mervat S Zaki; Kamal Eldahshan

doi:10.3390/s21165413

A New Deep Learning-Based Methodology for Video Deepfake Detection Using XGBoost

Sensors (Basel). 2021 Aug 10;21(16):5413. doi: 10.3390/s21165413.

Authors

Aya Ismail¹, Marwa Elpeltagy², Mervat S Zaki³, Kamal Eldahshan³

Affiliations

¹ Mathematics Department, Tanta University, Tanta 31511, Egypt.
² Systems and Computers Department, Al-Azhar University, Cairo 11884, Egypt.
³ Mathematics Department, Al-Azhar University (Girls Branch), Cairo 11884, Egypt.

Abstract

Currently, face-swapping deepfake techniques are widely spread, generating a significant number of highly realistic fake videos that threaten the privacy of people and countries. Due to their devastating impacts on the world, distinguishing between real and deepfake videos has become a fundamental issue. This paper presents a new deepfake detection method: you only look once-convolutional neural network-extreme gradient boosting (YOLO-CNN-XGBoost). The YOLO face detector is employed to extract the face area from video frames, while the InceptionResNetV2 CNN is utilized to extract features from these faces. These features are fed into the XGBoost that works as a recognizer on the top level of the CNN network. The proposed method achieves 90.62% of an area under the receiver operating characteristic curve (AUC), 90.73% accuracy, 93.53% specificity, 85.39% sensitivity, 85.39% recall, 87.36% precision, and 86.36% F1-measure on the CelebDF-FaceForencics++ (c23) merged dataset. The experimental study confirms the superiority of the presented method as compared to the state-of-the-art methods.

Keywords: XGBoost; YOLO; convolutional neural network; deepfake; face detector; fake video detection.

MeSH terms

Deep Learning*
Humans
Neural Networks, Computer
ROC Curve