Successes and Struggles with Computational Reproducibility: Lessons from the Fragile Families Challenge

Socius. 2019 Jan-Dec:5:10.1177/2378023119849803. doi: 10.1177/2378023119849803. Epub 2019 Sep 10.

Abstract

Reproducibility is fundamental to science, and an important component of reproducibility is computational reproducibility: the ability of a researcher to recreate the results of a published study using the original author's raw data and code. Although most people agree that computational reproducibility is important, it is still difficult to achieve in practice. In this article, the authors describe their approach to enabling computational reproducibility for the 12 articles in this special issue of Socius about the Fragile Families Challenge. The approach draws on two tools commonly used by professional software engineers but not widely used by academic researchers: software containers (e.g., Docker) and cloud computing (e.g., Amazon Web Services). These tools made it possible to standardize the computing environment around each submission, which will ease computational reproducibility both today and in the future. Drawing on their successes and struggles, the authors conclude with recommendations to researchers and journals.

Keywords: cloud computing; computational reproducibility; computational social science; open and reproducible research; software containers.