/category/bio
Cell Ranger and UVA HPC
Description A set of analysis piplines that perform sample demultiplexing, barcode processing, and single cell 3’ gene counting.
Software Category: bio
For detailed information, visit the Cell Ranger
website.
Available Versions To find the available versions and learn how to load them, run:
module spider cellranger The output of the command shows the available Cell Ranger
module versions.
For detailed information about a particular Cell Ranger
module, including how to load the module, run the module spider command with the module’s full version label. For example:
module spider cellranger/8.0.0 ModuleVersion Module Load Command cellranger8.0.0 module load cellranger/8.0.0
Cell Ranger ATAC and UVA HPC
Description Cell Ranger ATAC is a set of analysis pipelines that process
Chromium Single Cell ATAC data.
Software Category: bio
For detailed information, visit the Cell Ranger ATAC
website.
Available Versions To find the available versions and learn how to load them, run:
module spider cellranger-atac The output of the command shows the available Cell Ranger ATAC
module versions.
For detailed information about a particular Cell Ranger ATAC
module, including how to load the module, run the module spider command with the module’s full version label. For example:
module spider cellranger-atac/2.0.0 ModuleVersion Module Load Command cellranger-atac2.0.0 module load cellranger-atac/2.0.0
CellProfiler and UVA HPC
Description CellProfiler is a free open-source software designed to enable biologists without training in computer vision or programming to quantitatively measure phenotypes from thousands of images automatically.
Software Category: bio
For detailed information, visit the CellProfiler
website.
Available Versions The current installation of CellProfiler
incorporates the most popular packages. To find the available versions and learn how to load them, run:
module spider cellprofiler The output of the command shows the available CellProfiler
module versions.
For detailed information about a particular CellProfiler
module, including how to load the module, run the module spider command with the module’s full version label. For example:
DeepLabCut and UVA HPC
Description DeepLabCut is a toolbox for markerless pose estimation of animals performing various tasks.
Software Category: bio
For detailed information, visit the DeepLabCut
website.
Available Versions To find the available versions and learn how to load them, run:
module spider deeplabcut The output of the command shows the available DeepLabCut
module versions.
For detailed information about a particular DeepLabCut
module, including how to load the module, run the module spider command with the module’s full version label. For example:
module spider deeplabcut/3.0.0rc4 ModuleVersion Module Load Command deeplabcut3.0.0rc4 module load apptainer/1.2.2 deeplabcut/3.0.0rc4 Dockerfile We cannot use the official Docker image on the UVA HPC system because:
Snakemake and UVA HPC
Description The Snakemake workflow management system is a tool to create reproducible and scalable data analyses. Software Category: tools For detailed information, visit the Snakemake
website.
Available Versions The current installation of Snakemake
incorporates the most popular packages. To find the available versions and learn how to load them, run:
module spider snakemake The output of the command shows the available Snakemake
module versions.
For detailed information about a particular Snakemake
module, including how to load the module, run the module spider command with the module’s full version label. For example:
module spider snakemake/7.24.2 ModuleVersion Module Load Command snakemake7.24.2 module load snakemake/7.
Workflow Managers and UVA HPC
Workflow managers are used to create reproducible and scalable analysis pipelines. These managers are useful when you have a series of scripts that you want to tie together in the form of a pipeline.
The most popular workflow managers on the HPC system are listed below:
Snakemake Snakemake is a workflow management system written in Python. It integrates with both conda environments and singularities. Cromwell Cromwell is a Workflow Management System geared towards scientific workflows Nextflow Nextflow enables scalable and reproducible scientific workflows using software containers
Samtools and UVA HPC
Description SAM Tools provide various utilities for manipulating alignments in the SAM format,
including sorting, merging, indexing and generating alignments in a per-position format.
Software Category: bio
For detailed information, visit the Samtools
website.
Available Versions To find the available versions and learn how to load them, run:
module spider samtools The output of the command shows the available Samtools
module versions.
For detailed information about a particular Samtools
module, including how to load the module, run the module spider command with the module’s full version label. For example:
module spider samtools/1.17 ModuleVersion Module Load Command samtools1.17 module load samtools/1.17 Build Your Own Version Users may build their own versions of Samtools
Amber and UVA HPC
Description Amber (originally Assisted Model Building with Energy Refinement) is software for performing
molecular dynamics and structure prediction.
Software Category: chem
For detailed information, visit the Amber
website
Local support is not available.
Available Versions To find the available versions and learn how to load them, run:
module spider amber The output of the command shows the available Amber
module versions.
For detailed information about a particular Amber
module, including how to load the module, run the module spider command with the module’s full version label. For example:
module spider amber/22.0 ModuleVersion Module Load Command amber22.0 module load gcc/11.4.0 openmpi/4.1.4 amber/22.
Bowtie2 and UVA HPC
Description Bowtie 2 is an ultrafast and memory-efficient tool for aligning sequencing reads
to long reference sequences. It is particularly good at aligning reads of about 50 up to 100s or 1,000s
of characters, and particularly good at aligning to relatively long (e.g. mammalian) genomes.
Bowtie 2 indexes the genome with an FM Index to keep its memory footprint small: for the human genome,
its memory footprint is typically around 3.2 GB. Bowtie 2 supports gapped, local, and paired-end alignment modes.
Software Category: bio
For detailed information, visit the Bowtie2
website.
Available Versions The current installation of Bowtie2
incorporates the most popular packages.
BWA and UVA HPC
Description Burrows-Wheeler Aligner (BWA) is an efficient program that aligns
relatively short nucleotide sequences against a long reference sequence such as the human genome.
BWA provides three alignment algorithms:
BWA-backtrack BWA-SW BWA-MEM The BWA-backtrack algorithm is exclusively used for short sequence reads up to 100bp, the latter two can be used for sequence reads of up to 1MB. The BWA-MEM algorithm can also be used for high-quality short Illumina sequence reads (< 100bp) in many cases with better performance compared to the original BWA-backtrack algorithm. Therefore, the more universal BWA-MEM algorithm is recommended as a starting point for most alignment scenarios.
Cromwell and UVA HPC
Description Cromwell is a Workflow Management System geared towards scientific workflows.
Software Category: tools For detailed information, visit the Cromwell
website.
Available Versions The current installation of Cromwell
incorporates the most popular packages. To find the available versions and learn how to load them, run:
module spider cromwell The output of the command shows the available Cromwell
module versions.
For detailed information about a particular Cromwell
module, including how to load the module, run the module spider command with the module’s full version label. For example:
module spider cromwell/30.1 ModuleVersion Module Load Command cromwell30.1 module load cromwell/30.1 The Backend Configuration In order to allow Cromwell to interact with the HPC system (via SLURM), we need to define a backend to dispatch jobs.
GATK and UVA HPC
Description The Genome Analysis Toolkit or GATK is a software package developed at the Broad Institute
to analyse next-generation resequencing data. The toolkit offers a wide variety of tools,
with a primary focus on variant discovery and genotyping as well as strong emphasis on
data quality assurance. Its robust architecture, powerful processing engine and
high-performance computing features make it capable of taking on projects of any size.
Software Category: bio
For detailed information, visit the GATK
website.
For a GitHub reference, visit: https://github.com/broadinstitute/gatk
Available Versions The current installation of GATK
incorporates the most popular packages. To find the available versions and learn how to load them, run:
NCBI Blast and UVA HPC
Description Basic Local Alignment Search Tool, or BLAST, is an algorithm
for comparing primary biological sequence information, such as the amino-acid
sequences of different proteins or the nucleotides of DNA sequences.
Software Category: bio
For detailed information, visit the NCBI Blast
website.
Available Versions The current installation of NCBI Blast
incorporates the most popular packages. To find the available versions and learn how to load them, run:
module spider blast The output of the command shows the available NCBI Blast
module versions.
For detailed information about a particular NCBI Blast
module, including how to load the module, run the module spider command with the module’s full version label.
Picard and UVA HPC
Description A set of tools (in Java) for working with next generation sequencing data in the BAM format. Software Category: bio
For detailed information, visit the Picard
website.
Available Versions To find the available versions and learn how to load them, run:
module spider picard The output of the command shows the available Picard
module versions.
For detailed information about a particular Picard
module, including how to load the module, run the module spider command with the module’s full version label. For example:
module spider picard/2.27.5 ModuleVersion Module Load Command picard2.27.5 module load picard/2.27.5
SmrtLink and UVA HPC
Description PacBio’s open-source SMRT Analysis software suite is designed for use with Single Molecule,
Real-Time (SMRT) Sequencing data. You can analyze, visualize, and manage your data through an intuitive GUI
or command-line interface. You can also integrate SMRT Analysis in your existing data workflow through
the extensive set of APIs provided
Software Category: bio
For detailed information, visit the SmrtLink
website.
Available Versions To find the available versions and learn how to load them, run:
module spider smrtlink The output of the command shows the available SmrtLink
module versions.
For detailed information about a particular SmrtLink
module, including how to load the module, run the module spider command with the module’s full version label.
WDLTool and UVA HPC
Description A Java command-line tool co-developed with WDL that performs utility functions, including syntax validation and generation of input JSON templates.
Software Category: tools For detailed information, visit the WDLTool
website.
Available Versions To find the available versions and learn how to load them, run:
module spider wdltool The output of the command shows the available WDLTool
module versions.
For detailed information about a particular WDLTool
module, including how to load the module, run the module spider command with the module’s full version label. For example:
module spider wdltool/0.14 ModuleVersion Module Load Command wdltool0.14 module load wdltool/0.14