TY - JOUR
T1 - Facilitating bioinformatics reproducibility with QIIME 2 provenance Replay
AU - Keefe, Christopher R.
AU - Dillon, Matthew R.
AU - Gehret, Elizabeth
AU - Herman, Chloe
AU - Jewell, Mary
AU - Wood, Colin V.
AU - Bolyen, Evan
AU - Gregory Caporaso, J.
N1 - Publisher Copyright:
Copyright: © 2023 Keefe et al.
PY - 2023/11
Y1 - 2023/11
N2 - Study reproducibility is essential to corroborate, build on, and learn from the results of scientific research but is notoriously challenging in bioinformatics, which often involves large data sets and complex analytic workflows involving many different tools. Additionally, many biologists are not trained in how to effectively record their bioinformatics analysis steps to ensure reproducibility, so critical information is often missing. Software tools used in bioinformatics can automate provenance tracking of the results they generate, removing most barriers to bioinformatics reproducibility. Here we present an implementation of that idea, Provenance Replay, a tool for generating new executable code from results generated with the QIIME 2 bioinformatics platform, and discuss considerations for bioinformatics developers who wish to implement similar functionality in their software.
AB - Study reproducibility is essential to corroborate, build on, and learn from the results of scientific research but is notoriously challenging in bioinformatics, which often involves large data sets and complex analytic workflows involving many different tools. Additionally, many biologists are not trained in how to effectively record their bioinformatics analysis steps to ensure reproducibility, so critical information is often missing. Software tools used in bioinformatics can automate provenance tracking of the results they generate, removing most barriers to bioinformatics reproducibility. Here we present an implementation of that idea, Provenance Replay, a tool for generating new executable code from results generated with the QIIME 2 bioinformatics platform, and discuss considerations for bioinformatics developers who wish to implement similar functionality in their software.
UR - http://www.scopus.com/inward/record.url?scp=85179622447&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85179622447&partnerID=8YFLogxK
U2 - 10.1371/journal.pcbi.1011676
DO - 10.1371/journal.pcbi.1011676
M3 - Article
C2 - 38011287
AN - SCOPUS:85179622447
SN - 1553-734X
VL - 19
JO - PLoS Computational Biology
JF - PLoS Computational Biology
IS - 11
M1 - e1011676
ER -