DeBasher: a flow-based programming bash extension for the implementation of complex and interactive workflows with stateful processes

dc.contributor.author
Ortiz Martínez, Daniel
dc.date.accessioned
2026-02-20T05:54:16Z
dc.date.available
2026-02-20T05:54:16Z
dc.date.issued
2026-02-19T10:47:49Z
dc.date.issued
2026-02-19T10:47:49Z
dc.date.issued
2025-04-16
dc.date.issued
2026-02-19T10:47:51Z
dc.identifier
1471-2105
dc.identifier
https://hdl.handle.net/2445/227061
dc.identifier
765395
dc.identifier.uri
https://hdl.handle.net/2445/227061
dc.description.abstract
Background: Bioinformatics data analysis faces significant challenges. As data analysis often takes the form of pipelines or workflows, workflow managers (WfMs) have become essential. Data flow programming constitutes the preferred approach in WfMs, enabling parallel processes activated reactively based on input availability. While this paradigm typically follows a linear, acyclic progression, cyclic workflows are sometimes necessary in bioinformatics analyses. These cyclic workflows also present an oppor‑ tunity to explore workflow interactivity, a feature not widely implemented in existing WfMs. Results: We propose DeBasher, a tool that adopts the flow‑based programming (FBP) paradigm, in which the workflow components are in control of their life cycle and can store state information, allowing the execution of complex workflows that include cycles. DeBasher also incorporates a powerful model of interactivity, where the user can alter the behavior of a running workflow. Additionally, DeBasher allows the user to define triggers so as to initiate the execution of a complete workflow or a part of it. The ability to execute processes with state and in control of their life cycle also has applications in dynamic scheduling tasks. Furthermore, DeBasher presents a series of extra features, including the combination of multiple workflows at runt‑ ime through a feature we have called runtime piping, switching to static scheduling to increase scalability, or implementing processes in multiple languages. DeBasher has been successfully used to process 131.7 TB of genomic data by means of a variant calling pipeline. Conclusions: DeBasher is an FBP Bash extension that can be useful in a wide range of situations and in particular when implementing complex workflows, workflows with interactivity or triggers, or when a high scalability is required. Keywords: Workflow, Pipeline, Workflow manager, Data flow programming, Flow‑ based programming, Bash, Workflow interactivity, Workflow triggers
dc.format
25 p.
dc.format
application/pdf
dc.language
eng
dc.publisher
BioMed Central
dc.relation
Reproducció del document publicat a: https://doi.org/10.1186/s12859-025-06108-1
dc.relation
BMC Bioinformatics, 2025, vol. 26
dc.relation
https://doi.org/10.1186/s12859-025-06108-1
dc.rights
cc-by (c) Daniel Ortiz-Martínez, 2025
dc.rights
http://creativecommons.org/licenses/by/4.0/
dc.rights
info:eu-repo/semantics/openAccess
dc.subject
Automatització
dc.subject
Processament de dades
dc.subject
Bioinformàtica
dc.subject
Automation
dc.subject
Data processing
dc.subject
Bioinformatics
dc.title
DeBasher: a flow-based programming bash extension for the implementation of complex and interactive workflows with stateful processes
dc.type
info:eu-repo/semantics/article
dc.type
info:eu-repo/semantics/publishedVersion


Fitxers en aquest element

FitxersGrandàriaFormatVisualització

No hi ha fitxers associats a aquest element.

Aquest element apareix en la col·lecció o col·leccions següent(s)