Improving Biomedical Signal Search Results in Big Data Case-Based Reasoning Environments

Pervasive Mob Comput. 2016 Jun:28:69-80. doi: 10.1016/j.pmcj.2015.09.006. Epub 2015 Oct 27.

Abstract

Time series subsequence matching has importance in a variety of areas in healthcare informatics. These include case-based diagnosis and treatment as well as discovery of trends among patients. However, few medical systems employ subsequence matching due to high computational and memory complexities. This manuscript proposes a randomized Monte Carlo sampling method to broaden search criteria with minimal increases in computational and memory complexities over R-NN indexing. Information gain improves while producing result sets that approximate the theoretical result space, query results increase by several orders of magnitude, and recall is improved with no signi cant degradation to precision over R-NN matching.

Keywords: Biomedical Signal Search; Case-Based Reasoning; Locality Sensitive Hashing; Monte Carlo Sampling; Time-Series Subsequence Matching.