Ten simple rules for writing Dockerfiles for reproducible data scienceOpen Access

Nüst Daniel, Sochat Vanessa, Marwick Ben, Eglen Stephen J., Head Tim, Hirst Tony, Evans Benjamin D.

Forschungsartikel (Zeitschrift) | Peer reviewed

Zusammenfassung

Computational science has been greatly improved by the use of containers for packaging software and data dependencies. In a scholarly context, the main drivers for using these containers are transparency and support of reproducibility; in turn, a workflow's reproducibility can be greatly affected by the choices that are made with respect to building containers. In many cases, the build process for the container's image is created from instructions provided in a Dockerfile format. In support of this approach, we present a set of rules to help researchers write understandable Dockerfiles for typical data science workflows. By following the rules in this article, researchers can create containers suitable for sharing with fellow scientists, for including in scholarly communication such as education or scientific papers, and for effective and sustainable personal workflows.

Details zur Publikation

FachzeitschriftPLoS Computational Biology
Jahrgang / Bandnr. / Volume16
Ausgabe / Heftnr. / Issue11
StatusVeröffentlicht
Veröffentlichungsjahr2020 (10.11.2020)
Sprache, in der die Publikation verfasst istEnglisch
DOI10.1371/journal.pcbi.1008316
Link zum Volltexthttps://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1008316&type=printable
Stichwörterreproducible research; containerisation; Docker

Autor*innen der Universität Münster

Nüst, Daniel
Professur für Geoinformatik (Prof. Pebesma)

Projekte, aus denen die Publikation entstanden ist

Laufzeit: 01.03.2019 - 31.07.2021 | 2. Förderperiode
Gefördert durch: DFG - Wissenschaftliche Literaturversorgung und Informationssysteme
Art des Projekts: Gefördertes Einzelprojekt

Promotionen, aus denen die Publikation resultiert

Infrastructures and Practices for Reproducible Research in Geography, Geosciences, and GIScience
Promovend*in: Nüst, Daniel | Betreuer*innen: Pebesma, Edzer
Zeitraum: 04.01.2016 - 14.02.2022
Promotionsverfahren erfolgt(e) an: Promotionsverfahren an der Universität Münster