Investigation into the performance of different models for predicting stutter

Jo-Anne Bright; James M Curran; John S Buckleton

doi:10.1016/j.fsigen.2013.04.008

Investigation into the performance of different models for predicting stutter

Forensic Sci Int Genet. 2013 Jul;7(4):422-7. doi: 10.1016/j.fsigen.2013.04.008. Epub 2013 May 21.

Authors

Jo-Anne Bright¹, James M Curran, John S Buckleton

Affiliation

¹ ESR, Private Bag 92021, Auckland 1025, New Zealand. Jo.bright@esr.cri.nz

PMID: 23768314
DOI: 10.1016/j.fsigen.2013.04.008

Abstract

In this paper we have examined five possible models for the behaviour of the stutter ratio, SR. These were two log-normal models, two gamma models, and a two-component normal mixture model. A two-component normal mixture model was chosen with different behaviours of variance; at each locus SR was described with two distributions, both with the same mean. The distributions have difference variances: one for the majority of the observations and a second for the less well-behaved ones. We apply each model to a set of known single source Identifiler™, NGM SElect™ and PowerPlex(®) 21 DNA profiles to show the applicability of our findings to different data sets. SR determined from the single source profiles were compared to the calculated SR after application of the models. The model performance was tested by calculating the log-likelihoods and comparing the difference in Akaike information criterion (AIC). The two-component normal mixture model systematically outperformed all others, despite the increase in the number of parameters. This model, as well as performing well statistically, has intuitive appeal for forensic biologists and could be implemented in an expert system with a continuous method for DNA interpretation.

Publication types

Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

Humans
Likelihood Functions
Models, Genetic*
Stuttering / genetics*