Genomics data
This site replaces the former Compute Canada documentation site, and is now being managed by the Digital Research Alliance of Canada. Ce site remplace l'ancien site de documentation de Calcul Canada et est maintenant géré par l'Alliance de recherche numérique du Canada. |
This is not a complete article: This is a Draft, a work in progress that is intended to be published into an article, which may or may not be ready for inclusion in the main wiki. It should not necessarily be considered factual or authoritative.
In partnership with C3G, we maintain several genome databases that are available on Compute Canada's general purpose clusters (Béluga, Cedar, Graham). In addition to the FASTA sequence, many genomes include aligner indices and annotation files.
When it is available, the genomics data are always located here: /cvmfs/ref.mugqic/genomes
.
We encourage you to browse the directory to get more information.
[user@cedar5 ~]$ ls -1 /cvmfs/ref.mugqic/genomes
blast_db
chimera_gold_db
chimera_unite_db
greengenes_db
mirbase
pfam_db
silva_db
species
temp
unite_db
Available genomes in species/
Common name | Species | Builds |
---|---|---|
Human | Homo sapiens |
|
Mouse | Mus musculus |
|
Rat | Rattus norvegicus |
|
Monkey | Macaca mulatta |
|
Chimpanzee | Pan troglodytes |
|
Baboon | Papio anubis |
|
Dog | Canis familiaris |
|
Cow | Bos taurus |
|
Chicken | Gallus gallus |
|
Fly | Drosophila melanogaster |
|
C. Elegans | Caenorhabditis elegans |
|
Yeast | Saccharomyces cerevisiae |
|
Schizosaccharomyces pombe |
| |
Bacteria | Escherichia coli str k_12 substr dh10b |
|
pseudomonas aeruginosa pa14 |
| |
Pseudomonas aeruginosa UCBPP_PA14 |
| |
Plants | Arabidopsis thaliana |
|