Time-domain event detection using single-instruction, multiple-thread gpGPU architectures in single-molecule biophysical data

Comput Phys Commun. 2024 Jul:300:109191. doi: 10.1016/j.cpc.2024.109191. Epub 2024 Mar 28.

Abstract

Discrete amplitude levels in ordered, time-domain data often represent different underlying latent states of the system that is being interrogated. Analysis and feature extraction from these data sets generally require considering the order of each individual point; this approach cannot take advantage of contemporary general-purpose graphics processing units (gpGPU) and single-instruction multiple-data (SIMD) instruction set architectures. Two sources of such data from single-molecule biological measurements are nanopores and single-molecule field effect transistor (smFET) nanotube devices; both generate streams of time-ordered current or voltage data, typically sampled near 1 MS/s, with run times of minutes, yielding terabyte-scale datasets. Here, we present three gpGPU-based algorithms to overcome limitations associated with serial event detection in time series data, resulting in a 250× improvement in the rate with which we can detect salient features in nanopore and smFET datasets. The code is freely available.