Blind source computer device identification from recorded VoIP calls for forensic investigation

Forensic Sci Int. 2017 Mar:272:111-126. doi: 10.1016/j.forsciint.2017.01.010. Epub 2017 Jan 17.

Abstract

The VoIP services provide fertile ground for criminal activity, thus identifying the transmitting computer devices from recorded VoIP call may help the forensic investigator to reveal useful information. It also proves the authenticity of the call recording submitted to the court as evidence. This paper extended the previous study on the use of recorded VoIP call for blind source computer device identification. Although initial results were promising but theoretical reasoning for this is yet to be found. The study suggested computing entropy of mel-frequency cepstrum coefficients (entropy-MFCC) from near-silent segments as an intrinsic feature set that captures the device response function due to the tolerances in the electronic components of individual computer devices. By applying the supervised learning techniques of naïve Bayesian, linear logistic regression, neural networks and support vector machines to the entropy-MFCC features, state-of-the-art identification accuracy of near 99.9% has been achieved on different sets of computer devices for both call recording and microphone recording scenarios. Furthermore, unsupervised learning techniques, including simple k-means, expectation-maximization and density-based spatial clustering of applications with noise (DBSCAN) provided promising results for call recording dataset by assigning the majority of instances to their correct clusters.

Keywords: Audio acoustic features; Audio forensics; Audio source device attribution; Forensic categorization of digital devices.

MeSH terms

  • Algorithms
  • Computer Communication Networks*
  • Forensic Sciences
  • Humans
  • Logistic Models
  • Machine Learning
  • Neural Networks, Computer
  • Phonation*
  • Support Vector Machine