diff --git a/content/applications/app_specific/bioinformatics_tools/data_manipulation_tools/sratoolkit.md b/content/applications/app_specific/bioinformatics_tools/data_manipulation_tools/sratoolkit.md index 7efbaf2b1b322a5f96ac2d4486f7feaef3169862..7c3697e1361c56203c61a14771017875c356ffc5 100644 --- a/content/applications/app_specific/bioinformatics_tools/data_manipulation_tools/sratoolkit.md +++ b/content/applications/app_specific/bioinformatics_tools/data_manipulation_tools/sratoolkit.md @@ -44,6 +44,15 @@ fastq-dump --split-files input_reads.sra {{% /panel %}} This script outputs two fastq paired end reads `input_reads_1.fastq` and `input_reads_2.fastq`. + +To download `bam` files from NCBI using the SRA identification, the following commands can be used: +{{< highlight bash >}} +$ module load SRAtoolkit/2.11 samtools +$ sam-dump <sra_id> | samtools view -bS - > <sra_id>.bam +{{< /highlight >}} +where `<sra_id>` is the assigned SRA identification in NCBI (e.g., SRR1482462). + + All SRAtoolkit commands are single threaded, and therefore both `#SBATCH --nodes` and `#SBATCH --ntasks-per-node` in the SLURM script are set to **1**. @@ -64,3 +73,7 @@ Other frequently used SRAtoolkit tools are: - **vdb-encrypt**: encrypt non-SRA dbGaP data - **vdb-decrypt**: decrypt non-SRA dbGaP data - **vdb-validate**: validate the integrity of downloaded SRA data + +{{% notice info %}} +**For older versions of SRA Toolkit, the location of the caching can be changed with `vdb-config -i`.** +{{% /notice %}}