Get Started

Please install nextflow Nextflow and either Singularity or Docker before.

For installing Nextflow you need a POSIX compatible system (Linux, OS X, etc). It requires Bash 3.2 (or later) and Java 11 (or later, up to 17). Windows system is supported through WSL. For the installation of Nextflow just run:

curl -s https://get.nextflow.io | bash

To install the pipeline you need to download the repo:

git clone --depth 1 --recurse-submodules https://github.com/biocorecrg/MoP3.git

Installing Guppy

You can use INSTALL.sh and the version of Guppy you want to download.

Note

Please consider that the support of VBZ compression of fast5 started with version 3.4.X.

cd MoP3; bash INSTALL.sh 6.0.1

or for installing the default 3.4.5

cd MoP3; bash INSTALL.sh

Guppy custom models for RNA basecalling will be downloaded from our repository https://biocore.crg.eu/public/mop3_pub/models.tar and placed automatically within the right path inside the pipeline.

You can install different versions of Guppy but only one will be run during the pipeline execution. For switching among them you need to run INSTALL.sh with the version you prefer.

Testing

cd mop_preprocess

nextflow run mop_preprocess.nf -params-file params.f5.yaml -with-singularity -bg -profile local > log

Tip

You can replace `-with-singularity` with `-with-docker` if you want to use the docker engine.

Profiles

Some nextflow configuration files are stored within the folder conf and can be selected using different profiles. Currently, we have:

  • ci: for continuous integration testing (low resources)

  • local: for being used in a laptop without GPU support

  • m1mac: for running the containers in emulation for being used on M1/M2/M3 Apple processors.

  • sge: for being used in an HPC with Sun Grid Engine

  • cluster or crg: for being used in the custom HPC environment at CRG

  • slurm: for being used in an HPC with SLURM

  • awsbatch: for being used in Amazon AWS cloud infrastructure