New Python-based methods for data processing

Acta Crystallogr D Biol Crystallogr. 2013 Jul;69(Pt 7):1274-82. doi: 10.1107/S0907444913000863. Epub 2013 Jun 18.

Abstract

Current pixel-array detectors produce diffraction images at extreme data rates (of up to 2 TB h(-1)) that make severe demands on computational resources. New multiprocessing frameworks are required to achieve rapid data analysis, as it is important to be able to inspect the data quickly in order to guide the experiment in real time. By utilizing readily available web-serving tools that interact with the Python scripting language, it was possible to implement a high-throughput Bragg-spot analyzer (cctbx.spotfinder) that is presently in use at numerous synchrotron-radiation beamlines. Similarly, Python interoperability enabled the production of a new data-reduction package (cctbx.xfel) for serial femtosecond crystallography experiments at the Linac Coherent Light Source (LCLS). Future data-reduction efforts will need to focus on specialized problems such as the treatment of diffraction spots on interleaved lattices arising from multi-crystal specimens. In these challenging cases, accurate modeling of close-lying Bragg spots could benefit from the high-performance computing capabilities of graphics-processing units.

Keywords: cctbx; data processing; multiprocessing; reusable code.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms
  • Crystallography, X-Ray*
  • Data Interpretation, Statistical*
  • Electronic Data Processing / methods*
  • Electrons
  • Humans
  • Lasers*
  • Muramidase / chemistry*
  • Software*
  • Synchrotrons / instrumentation*

Substances

  • Muramidase