Quickstart¶

This quickstart allows you to run a subset of the available tools implemented by the Toolkit on a machine to process two datasets. This tutorial has mainly been tested on a machine with 29 GB of RAM and 14 cores running an Ubuntu operating system. You will need at least 250 GB of disk space. The disk were your docker images and containers are created has to be at least 17 GB.

Requirements¶

Docker: Install Docker by following the official Docker installation instructions.
Java: In order to run Nextflow, you need to install Java on your machine, which can be achieved via sudo apt install default-jre.
Nextflow should be installed. Please check the official Nextflow instructions

Run the Toolkit¶

The following command will start a subset of all available modules offered by the Toolkit. All databases will be downloaded to the database directory in your current working directory.

NXF_VER=24.10.4 nextflow run metagenomics/metagenomics-tk \
      -profile standard \
      -entry wFullPipeline \
          -params-file  https://raw.githubusercontent.com/metagenomics/metagenomics-tk/refs/heads/master/default/quickstart.yml \
      --logDir logs \
      --s3SignIn false \
      --scratch false \
      --output output \
      --databases $(pwd)/databases \
          --input.paired.path https://raw.githubusercontent.com/metagenomics/metagenomics-tk/refs/heads/master/test_data/fullPipeline/quickstart.tsv

You can read more about the outputs, which are placed in a directory named output, in the corresponding Modules sections.

Quickstart¶

Requirements¶

Run the Toolkit¶

Further Reading¶