genome files, short read aligned indices etc. on both Tusker and
Crane.
In order to use these resources, the "**biodata**" module needs to be
loaded first.
For how to load module, please check [Module
Commands](Module-Commands_332464.html).
Loading the "**biodata**" module will pre-set many environment
variables, but most likely you will only need a subset of them.
Environment variables can be used in your command or script by prefixing
a **$** sign to the name.
Loading the "**biodata**" module will pre-set many environment variables, but most likely you will only need a subset of them. Environment variables can be used in your command or script by prefixing `$` to the name.
The major environment variables are:
**$DATA** - main directory
**$BLAST** - Directory containing all available BLAST (nucleotide and
protein) databases
**$BLAST** - Directory containing all available BLAST (nucleotide and protein) databases
**$KEGG** - KEGG database main entry point (requires license)
**$PANTHER** - PANTHER database main entry point (latest)
**$IPR** - InterProScan database main entry point (latest)
**$GENOMES** - Directory containing all available genomes (multiple
sources, builds possible
**$INDICES** - Directory containing indices for bowtie, bowtie2, bwa for
all available genomes
**$UNIPROT** - Directory containing latest release of full UniProt
database
In order to check what genomes are available, you can just type:
**Check available GENOMES**
``` syntaxhighlighter-pre
ls $GENOMES
```
**$GENOMES** - Directory containing all available genomes (multiple sources, builds possible
**$INDICES** - Directory containing indices for bowtie, bowtie2, bwa for all available genomes
**$UNIPROT** - Directory containing latest release of full UniProt database
\\
\\
\\
In order to check what genomes are available, you can type:
{{<highlightbash>}}
$ ls $GENOMES
{{</highlight>}}
\\
In order to check what BLAST databases are available, you can just type:
**Check available BLAST databases**
``` syntaxhighlighter-pre
ls $BLAST
```
An example of how to run Bowtie2 local alignment on Tusker utilizing the
default Horse, *Equus caballus* index (*BOWTIE2\_HORSE*) with paired-end
fasta files and 8 CPUs is shown below:
**bowtie2\_alignment.submit**
\#!/bin/sh
\#SBATCH --job-name=Bowtie2
\#SBATCH --nodes=1
\#SBATCH --ntasks-per-node=8
\#SBATCH --time=168:00:00
\#SBATCH --mem=50gb
\#SBATCH --output=Bowtie2.%J.out
\#SBATCH --error=Bowtie2.%J.err
module load biodata/1.0
{{<highlightbash>}}
$ ls $BLAST
{{</highlight>}}
\\
An example of how to run Bowtie2 local alignment on Crane utilizing the default Horse, *Equus caballus* index (*BOWTIE2\_HORSE*) with paired-end fasta files and 8 CPUs is shown below:
The organisms and their appropriate environmental variables for all genomes and chromosome files, as well as for short read aligned indices are shown on the link below: