COSMOS: Python library for massively parallel workflows
Open Access
- 30 June 2014
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 30 (20), 2956-2958
- https://doi.org/10.1093/bioinformatics/btu385
Abstract
Summary: Efficient workflows to shepherd clinically generated genomic data through the multiple stages of a next-generation sequencing pipeline are of critical importance in translational biomedical science. Here we present COSMOS, a Python library for workflow management that allows formal description of pipelines and partitioning of jobs. In addition, it includes a user interface for tracking the progress of jobs, abstraction of the queuing system and fine-grained control over the workflow. Workflows can be created on traditional computing clusters as well as cloud-based services. Availability and implementation: Source code is available for academic non-commercial research purposes. Links to code and documentation are provided at http://lpm.hms.harvard.edu and http://wall-lab.stanford.edu . Contact:dpwall@stanford.edu or peter_tonellato@hms.harvard.edu . Supplementary information : Supplementary data are available at Bioinformatics online.This publication has 6 references indexed in Scilit:
- The Taverna workflow suite: designing and executing workflows of Web Services on the desktop, web or in the cloudNucleic Acids Research, 2013
- Snakemake—a scalable bioinformatics workflow engineBioinformatics, 2012
- Bpipe: a tool for running and managing bioinformatics pipelinesBioinformatics, 2012
- Ruffus: a lightweight Python library for computational pipelinesBioinformatics, 2010
- Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciencesGenome Biology, 2010
- Standardization of an API for Distributed Resource Management SystemsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2007