MrBayes: Difference between revisions

Latest revision as of 16:49, 28 June 2024

Other languages:

English
français

MrBayes is a program for Bayesian inference and model choice across a wide range of phylogenetic and evolutionary models. MrBayes uses Markov chain Monte Carlo (MCMC) methods to estimate the posterior distribution of model parameters.

Finding available modules[edit]

[name@server ~]$ module spider mrbayes

For more on finding and selecting a version of MrBayes using module commands see Using modules

Examples[edit]

Sequential[edit]

The following job script uses only one CPU core (--cpus-per-task=1). The example uses an input file (primates.nex) distributed with MrBayes.

File : submit-mrbayes-seq.sh

#!/bin/bash
#SBATCH --account=def-someuser  # replace with your PI account
#SBATCH --cpus-per-task=1 
#SBATCH --mem-per-cpu=3G        # increase as needed
#SBATCH --time=1:00:00          # increase as needed

module load mrbayes/3.2.7
cd $SCRATCH 
cp -v $EBROOTMRBAYES/share/examples/mrbayes/primates.nex .

mb primates.nex

The job script can be submitted with

[name@server ~]$ sbatch submit-mrbayes-seq.sh

Parallel[edit]

MrBayes can be run on multiple cores, on multiple nodes, and on GPUs.

MPI[edit]

The following job script will use 8 CPU cores in total, on one or more nodes. Like the previous example, it uses an input file (primates.nex) distributed with MrBayes.

File : submit-mrbayes-parallel.sh

#!/bin/bash
#SBATCH --account=def-someuser  # replace with your PI account
#SBATCH --ntasks=8 				# increase as needed
#SBATCH --mem-per-cpu=3G        # increase as needed
#SBATCH --time=1:00:00          # increase as needed

module load mrbayes/3.2.7
cd $SCRATCH 
cp -v $EBROOTMRBAYES/share/examples/mrbayes/primates.nex .

srun mb primates.nex

The job script can be submitted with

[name@server ~]$ sbatch submit-mrbayes-parallel.sh

GPU[edit]

The following job script will use a GPU. Like the previous examples, it uses an input file (primates.nex) distributed with MrBayes.

File : submit-mrbayes-gpu.sh

#!/bin/bash
#SBATCH --account=def-someuser  # replace with your PI account
#SBATCH --cpus-per-task=1
#SBATCH --gpus=1
#SBATCH --mem-per-cpu=3G        # increase as needed
#SBATCH --time=1:00:00          # increase as needed

module load gcc cuda/12 mrbayes/3.2.7
cd $SCRATCH 
cp -v $EBROOTMRBAYES/share/examples/mrbayes/primates.nex .

srun mb primates.nex

The job script can be submitted with

[name@server ~]$ sbatch submit-mrbayes-gpu.sh

Checkpointing[edit]

If you need very long runs of MrBayes, we suggest you break up the work into several small jobs rather than one very long job. Long jobs have are more likely to be interrupted by hardware failure or maintenance outage. Fortunately, MrBayes has a mechanism for creating checkpoints, in which progress can be saved from one job and continued in a subsequent job.

Here is an example of how to split a calculation into two Slurm jobs which will run one after the other. Create two files, job1.nex and job2.nex, as shown below. Notice that the key difference between them is the presence of the append keyword in the second.

File : job1.nex

execute primates.nex;

mcmc ngen=10000000 nruns=2 temp=0.02 mcmcdiag=yes samplefreq=1000 
stoprule=yes stopval=0.005 relburnin=yes burninfrac=0.1 printfreq=1000 
checkfreq=1000;

File : job2.nex

execute primates.nex;

mcmc ngen=20000000 nruns=2 temp=0.02 mcmcdiag=yes samplefreq=1000
stoprule=yes stopval=0.005 relburnin=yes burninfrac=0.1 printfreq=1000
append=yes checkfreq=1000;

Then create a job script. This example is a job array, which means that one script and one sbatch command will be sufficient to launch two Slurm jobs, and therefore both parts of the calculation. See Job arrays for more about the --array parameter and the $SLURM_ARRAY_TASK_ID variable used here.

File : submit-mrbayes-cp.sh

#!/bin/bash
#SBATCH --account=def-someuser  # replace with your PI account
#SBATCH --ntasks=8 				# increase as needed
#SBATCH --mem-per-cpu=3G        # increase as needed
#SBATCH --time=1:00:00          # increase as needed
#SBATCH --array=1-2%1           # match the number of sub-jobs, only 1 at a time

module load gcc mrbayes/3.2.7
cd $SCRATCH 
cp -v $EBROOTMRBAYES/share/examples/mrbayes/primates.nex .

srun mb job${SLURM_ARRAY_TASK_ID}.nex

The example can be submitted with

[name@server ~]$ sbatch submit-mrbayes-cp.sh

@@ Line 8: / Line 8: @@
 {{Command|module spider mrbayes}}
-For more on finding and selecting a specific version of MrBayes using <code>module</code> commands see [[Utiliser_des_modules/en|Using modules]]
+<!--T:34-->
+For more on finding and selecting a version of MrBayes using <code>module</code> commands see [[Utiliser_des_modules/en|Using modules]]
 == Examples == <!--T:3-->
@@ Line 16: / Line 17: @@
 The example uses an input file (<code>primates.nex</code>) distributed with MrBayes.
+<!--T:35-->
 {{File
    |name=submit-mrbayes-seq.sh
@@ Line 44: / Line 46: @@
 ==== MPI ==== <!--T:11-->
 The following job script will use 8 CPU cores in total, on one or more nodes.
-Like the previous example it uses an input file (<code>primates.nex</code>) distributed with MrBayes.
+Like the previous example, it uses an input file (<code>primates.nex</code>) distributed with MrBayes.
+<!--T:36-->
 {{File
    |name=submit-mrbayes-parallel.sh
@@ Line 71: / Line 74: @@
 ==== GPU ==== <!--T:17-->
 The following job script will use a GPU.
-Like the previous examples it uses an input file (<code>primates.nex</code>) distributed with MrBayes.
+Like the previous examples, it uses an input file (<code>primates.nex</code>) distributed with MrBayes.
+<!--T:37-->
 {{File
    |name=submit-mrbayes-gpu.sh
@@ Line 98: / Line 102: @@
 == Checkpointing == <!--T:23-->
-If you need very long runs of MrBayes, we suggest you break up the work into several small jobs rather than one very long job. Long jobs have are more likely to be interrupted a hardware failure or a maintenance outage. Fortunately, MrBayes has a mechanism for creating checkpoints, in which progress can be saved from one job and continued in a subsequent job.
+If you need very long runs of MrBayes, we suggest you break up the work into several small jobs rather than one very long job. Long jobs have are more likely to be interrupted by hardware failure or maintenance outage. Fortunately, MrBayes has a mechanism for creating checkpoints, in which progress can be saved from one job and continued in a subsequent job.
+<!--T:38-->
 Here is an example of how to split a calculation into two Slurm jobs which will run one after the other.  Create two files, <code>job1.nex</code> and <code>job2.nex</code>, as shown below.  Notice that the key difference between them is the presence of the <code>append</code> keyword in the second.
@@ Line 132: / Line 137: @@
 one <code>sbatch</code> command will be sufficient to launch two Slurm jobs, and therefore
 both parts of the calculation.  See [[Job arrays]] for more about the <code>--array</code>
-directive and the <code>$SLURM_ARRAY_TASK_ID</code> variable used here.
+parameter and the <code>$SLURM_ARRAY_TASK_ID</code> variable used here.
+<!--T:39-->
 {{File
    |name=submit-mrbayes-cp.sh
@@ Line 150: / Line 156: @@
 cp -v $EBROOTMRBAYES/share/examples/mrbayes/primates.nex .
+<!--T:40-->
 srun mb job${SLURM_ARRAY_TASK_ID}.nex
 }}

MrBayes: Difference between revisions

Latest revision as of 16:49, 28 June 2024

Contents

Finding available modules[edit]

Examples[edit]

Sequential[edit]

Parallel[edit]

MPI[edit]

GPU[edit]

Checkpointing[edit]

Navigation menu

MrBayes: Difference between revisions

Latest revision as of 16:49, 28 June 2024

Finding available modules[edit]

Examples[edit]

Sequential[edit]

Parallel[edit]

MPI[edit]

GPU[edit]

Checkpointing[edit]

Navigation menu

Search