Workflows#
Our workflows are written in such a way that they depend on available docker images. Therefor execution of our workflows do not need a lot dependencies even though there are many different tools used but rely (heavily) on docker and publically available images. We either use already available community images or we create and host them ourself when needed.
Requirements:
Docker
A cwl-runner (we use cwltool)
UNLOCK CWL repository: m-unlock/cwl/
Installation and usage
Short introduction on how setup and use these workflows:
Setup section.
These workflows are also published on the WorkflowHub: https://workflowhub.eu/projects/16#workflows. On here you can see in more detail the in- and outputs and the steps involved.
Some are summarized here.
Workflow Metagenomics Assembly#
Workflow for assembly from Illumina reads and/or longreads. Customizable to a certain extend on which steps to run. (can also be used for isolates)
Main steps involved:
Workflow Illumina Quality
Workflow Longread Quality
Assembly: SPAdes / Flye
Short read polishing (Pilon)
ONT read polishing (Medaka)
QUAST (Assembly quality report)
Workflow Metagenomics Binning
Worklow Metagenomics GEM
Workflow Illumina Quality#
FastQC quality plots (before and after filtering)
fastp quality filtering
Reference/contamination filtering (mapped or unmapped)
Kraken2 taxonomic read classification (before and after)
PhiX removal
rRNA removal
Workflow Longreads Quality#
NanoPlot Quality plot and reports (before and after filtering)
Filtlong Longreads quality filtering
Reference/contamination filtering (mapped or unmapped)
Kraken2 taxonomic read classification
Workflow Metagenomics Binning#
Metabat2 / MaxBin2 / SemiBin binning
DAS Tool bin refinement
EukRep (eukaryotic classification)
CheckM bin quality
BUSCO bin quality
GTDB-Tk bin taxonomic classification
Workflow Metagenomic GEM#
!! Important caveat with this workflow:
The CarveMe, MEMOTA and SMETANA Docker container images that we use have the licenced CPLEX Optimizer build in. Therefor we can not make these images public unfortunately. This means this workflow will not work out-of-the-box. We did however make the Docker Build files available here
Prodigal protein prediction
CarveMe GEnome-scale Metabolic model reconstruction
MEMOTE for metabolic model testing
SMETANA Species METabolic interaction ANAlysis