Towards High Performance Data Analytic on Heterogeneous Many-core Systems: A Study on Bayesian Sequential Partitioning

J Parallel Distrib Comput. 2018 Dec:122:36-50. doi: 10.1016/j.jpdc.2018.07.011. Epub 2018 Jul 25.

Abstract

Bayesian Sequential Partitioning (BSP) is a statistically effective density estimation method to comprehend the characteristics of a high dimensional data space. The intensive computation of the statistical model and the counting of enormous data have caused serious design challenges for BSP to handle the growing volume of the data. This paper proposes a high performance design of BSP by leveraging a heterogeneous CPU/GPGPU system that consists of a host CPU and a K80 GPGPU. A series of techniques, on both data structures and execution management policies, is implemented to extensively exploit the computation capability of the heterogeneous many-core system and alleviate system bottlenecks. When compared with a parallel design on a high-end CPU, the proposed techniques achieve 48x average runtime enhancement while the maximum speedup can reach 78.76x.

Keywords: Data processing; design and optimization; heterogeneous system; many-core system; performance analysis.