Classification of crystallization outcomes using deep convolutional neural networks

PLoS One. 2018 Jun 20;13(6):e0198883. doi: 10.1371/journal.pone.0198883. eCollection 2018.

Abstract

The Machine Recognition of Crystallization Outcomes (MARCO) initiative has assembled roughly half a million annotated images of macromolecular crystallization experiments from various sources and setups. Here, state-of-the-art machine learning algorithms are trained and tested on different parts of this data set. We find that more than 94% of the test images can be correctly labeled, irrespective of their experimental origin. Because crystal recognition is key to high-density screening and the systematic analysis of crystallization experiments, this approach opens the door to both industrial and fundamental research applications.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms
  • Crystallization*
  • Crystallography, X-Ray*
  • Datasets as Topic
  • Image Processing, Computer-Assisted*
  • Neural Networks, Computer*

Grants and funding

PC acknowledges support from National Science Foundation Grant no. NSF DMR-1749374. VV and DRS are employed by Google Inc., and SPW is employed by GlaxoSmithKline Inc. These funders provided support in the form of salaries for authors VV, DRS, and SPW, respectively, but did not have any additional role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. The specific roles of these authors are articulated in the ‘author contributions’ section. Additionally, GlaxoSmithKline provided data for the study which is freely available from the MARCO site (https://marco.ccr.buffalo.edu/).