Parabricks
Introduction
NVIDIA’s Clara Parabricks brings next generation sequencing to GPUs, accelerating an array of gold-standard tooling such as BWA-MEM, GATK4, Google’s DeepVariant, and many more. Users can achieve a 30-60x acceleration and 99.99% accuracy for variant calling when comparing against CPU-only BWA-GATK4 pipelines, meaning a single server can process up to 60 whole genomes per day. These tools can be easily integrated into current pipelines with drop-in replacement commands to quickly bring speed and data-center scale to a range of applications including germline, somatic and RNA workflows.
Note
Please follow the recommended citation guidelines from the developers when you use the tool in research.
Versions
Cluster |
Version(s) |
|---|---|
ANVIL |
4.0.0-1 |
GILBRETH |
4.0.0-1 |
SCHOLAR |
4.0.0-1 |
Commands
pbrun
Module
You can load the modules by:
module load biocontainers
module load parabricks
Example job
Warning
Using #!/bin/sh -l as shebang in the slurm job script will cause the failure of some biocontainer modules. Please use #!/bin/bash instead.
To run parabricks on our clusters:
#!/bin/bash
#SBATCH -A myallocation # Allocation name
#SBATCH -p wholenode # Partition name
#SBATCH -t 1:00:00
#SBATCH -N 1
#SBATCH -n 1
#SBATCH --job-name=parabricks
#SBATCH --mail-type=FAIL,BEGIN,END
#SBATCH --error=%x-%J-%u.err
#SBATCH --output=%x-%J-%u.out
module --force purge
module biocontainers parabricks
# Your parabricks workflow...
#!/bin/bash
#SBATCH -A mygroup # Group name
#SBATCH -p cpu # Partition name
#SBATCH -q normal # QOS name (optional)
#SBATCH -t 1:00:00
#SBATCH -N 1
#SBATCH -n 1
#SBATCH --job-name=parabricks
#SBATCH --mail-type=FAIL,BEGIN,END
#SBATCH --error=%x-%J-%u.err
#SBATCH --output=%x-%J-%u.out
module --force purge
module biocontainers parabricks
# Your parabricks workflow...
#!/bin/bash
#SBATCH -A mygroup # Group name
#SBATCH -p ai # Partition name
#SBATCH --gres=gpu:1 # Number of GPUs
#SBATCH -q normal # QOS name (optional)
#SBATCH -t 1:00:00
#SBATCH -N 1
#SBATCH -n 1
#SBATCH --job-name=parabricks
#SBATCH --mail-type=FAIL,BEGIN,END
#SBATCH --error=%x-%J-%u.err
#SBATCH --output=%x-%J-%u.out
module --force purge
module biocontainers parabricks
# Your parabricks workflow...
#!/bin/bash
#SBATCH -A mygroup # Group name
#SBATCH -p a100 # Partition name
#SBATCH --gres=gpu:1 # Number of GPUs
#SBATCH --mem=2G # Memory
#SBATCH -q normal # QOS name (optional)
#SBATCH -t 1:00:00
#SBATCH -N 1
#SBATCH -n 1
#SBATCH --job-name=parabricks
#SBATCH --mail-type=FAIL,BEGIN,END
#SBATCH --error=%x-%J-%u.err
#SBATCH --output=%x-%J-%u.out
module --force purge
module biocontainers parabricks
# Your parabricks workflow...
#!/bin/bash
#SBATCH -A queue # Queue name
#SBATCH -t 1:00:00
#SBATCH -N 1
#SBATCH -n 1
#SBATCH --job-name=parabricks
#SBATCH --mail-type=FAIL,BEGIN,END
#SBATCH --error=%x-%J-%u.err
#SBATCH --output=%x-%J-%u.out
module --force purge
module biocontainers parabricks
# Your parabricks workflow...