Bureaucrats, cc_docs_admin, cc_staff
2,306
edits
No edit summary |
|||
Line 13: | Line 13: | ||
==Initialization== | ==Initialization== | ||
In order to be used correctly, MetaPhlAn needs to download certain databases from a remote server and then compute indices derived from the components of these databases. On those clusters which do not permit Internet access from the compute nodes, these databases will have to be downloaded using a login node, | In order to be used correctly, MetaPhlAn needs to download certain databases from a remote server and then compute indices derived from the components of these databases. On those clusters which do not permit Internet access from the compute nodes, these databases will have to be downloaded using a login node using a tool such as wget, | ||
{{Command|prompt=(ENV) [name@server ~]| | {{Command|prompt=(ENV) [name@server ~]|wget http://cmprod1.cibio.unitn.it/biobakery3/metaphlan_databases/mpa_v31_CHOCOPhlAn_201901.tar}} | ||
You can then untar the database file and compute the indices using a job, so as not to put an undue computational burden on the shared login node which you are using. A sample script is the following, | |||
{{File | |||
|name=job.sh | |||
|lang="sh" | |||
|contents= | |||
#!/bin/bash | |||
#SBATCH --account=def-someuser | |||
#SBATCH --time=01:00:00 | |||
#SBATCH --cpus-per-task=4 | |||
#SBATCH --mem=10G | |||
module load gcc blast samtools bedtools bowtie2 python/3.9 | |||
cd $HOME | |||
source ENV/bin/activate | |||
metaphlan --install --index mpa_v31_CHOCOPhlAn_201901 --bowtie2db $PWD | |||
}} |