Implementation and Evaluation of an Algorithm for Cryptographically Private Principal Component Analysis on Genomic Data

IEEE/ACM Trans Comput Biol Bioinform. 2018 Sep-Oct;15(5):1427-1432. doi: 10.1109/TCBB.2018.2858818. Epub 2018 Jul 23.

Abstract

We improve the quality of cryptographically privacy-preserving genome-wide association studies by correctly handling population stratification-the inherent genetic difference of patient groups, e.g., people with different ancestries. Our approach is to use principal component analysis to reduce the dimensionality of the problem so that we get less spurious correlations between traits of interest and certain positions in the genome. While this approach is commonplace in practical genomic analysis, it has not been used within a privacy-preserving setting. In this paper, we use cryptographically secure multi-party computation to tackle principal component analysis, and present an implementation and experimental results showing the performance of the approach.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Computer Security
  • Databases, Genetic*
  • Genetic Privacy*
  • Genomics / methods*
  • Principal Component Analysis / methods*