Cloud: Difference between revisions

From Alliance Doc
Jump to navigation Jump to search
m (Removing link to Arbutus user docs- obsolete page)
m (fix heading levels)
 
(10 intermediate revisions by 4 users not shown)
Line 5: Line 5:


<!--T:2-->
<!--T:2-->
A user of the cloud will typically create or "spin up" one or more virtual machines (VMs or "instances"). He or she then logs into the VM with administrative privileges, installs any desired software, and runs the software applications needed. These applications could be as diverse as a CPU-intensive analysis of particle physics data, or a web service directed towards scholars of literature and the humanities. The advantage is that the user has complete control over the collection of installed software (the "software stack"). The disadvantage is that the user must have some degree of experience in installing software and otherwise managing a computer.
A user of the cloud will typically create or <i>spin up</i> one or more virtual machines (VMs or <i>instances</i>). He or she then logs into the VM with administrative privileges, installs any desired software, and runs the software applications needed. These applications could be as diverse as a CPU-intensive analysis of particle physics data, or a web service directed towards scholars of literature and the humanities. The advantage is that the user has complete control over the collection of installed software (the <i>software stack</i>). The disadvantage is that the user must have some degree of experience in installing software and otherwise managing a computer.


<!--T:3-->
<!--T:3-->
Virtual machines can be easily replicated. One can take a "snapshot" of a VM which can then be started again elsewhere. This makes it easy to replicate or scale up a service, and to recover from (for example) a power interruption.
Virtual machines can be easily replicated. One can take a <i>snapshot</i> of a VM which can then be started again elsewhere. This makes it easy to replicate or scale up a service, and to recover from (for example) a power interruption.


<!--T:6-->
<!--T:6-->
If you can fit your work easily into the [https://en.wikipedia.org/wiki/Supercomputer HPC] [https://en.wikipedia.org/wiki/Batch_processing batch] submission workflow and environment (see [[What is a scheduler?]]) it is preferable to work outside the cloud, as there are more [[National systems|resources available]] for HPC and software is already configured and installed for many common needs. There are also tools like [[Singularity]] to run custom software stacks inside containers within our HPC clusters.
If you can fit your work easily into the [https://en.wikipedia.org/wiki/Supercomputer HPC] [https://en.wikipedia.org/wiki/Batch_processing batch] submission workflow and environment (see [[What is a scheduler?]]) it is preferable to work outside the cloud, as there are more [[National systems|resources available]] for HPC and software is already configured and installed for many common needs. There are also tools like [[Apptainer]] to run custom software stacks inside containers within our HPC clusters.
If your need isn't served by Singularity or HPC batch, then the cloud is your solution.
If your need isn't served by Apptainer or HPC batch, then the cloud is your solution.


=Getting a Cloud project= <!--T:10-->
==Getting a cloud project== <!--T:10-->
* Review and understand the [[Cloud shared security responsibility model|important role]] you are about to take on to [http://www.science.gc.ca/eic/site/063.nsf/eng/h_97955.html safeguard your research] and the shared cloud infrastructure.
* Review and understand the [[Cloud shared security responsibility model|important role]] you are about to take on to [https://science.gc.ca/site/science/en/safeguarding-your-research safeguard your research] and the shared cloud infrastructure.
* If you do not have an account with us, create one with [https://alliancecan.ca/en/services/advanced-research-computing these instructions].
* If you do not have an account with us, create one with [https://alliancecan.ca/en/services/advanced-research-computing/account-management/apply-account these instructions].
* A [[Managing_your_cloud_resources_with_OpenStack#Projects|project]] is an allocation of resources for creating VMs within a cloud.
* A [[Managing_your_cloud_resources_with_OpenStack#Projects|project]] is an allocation of resources for creating VMs within a cloud.
* If you are a primary investigator (PI) with an active cloud resource allocation (see [https://alliancecan.ca/en/services/advanced-research-computing/research-portal/accessing-resources/resource-allocation-competitions RAC]) you should already have a project. See the sections below on using the cloud to get started. If not or if you are not sure please contact [[Technical support|technical support]].
* If you are a primary investigator (PI) with an active cloud resource allocation (see [https://alliancecan.ca/en/services/advanced-research-computing/research-portal/accessing-resources/resource-allocation-competitions RAC]) you should already have a project. See the sections below on using the cloud to get started. If not or if you are not sure please contact [[Technical support|technical support]].
* Otherwise go [https://docs.google.com/forms/d/e/1FAIpQLSeU_BoRk5cEz3AvVLf3e9yZJq-OvcFCQ-mg7p4AWXmUkd5rTw/viewform here] to
* Otherwise go to the [https://docs.google.com/forms/d/e/1FAIpQLSeU_BoRk5cEz3AvVLf3e9yZJq-OvcFCQ-mg7p4AWXmUkd5rTw/viewform Alliance cloud project and RAS request form] to
** request access to an existing project (see the section below for information you will need to supply)
** request access to an existing project (see the section below for information you will need to supply)
** and if you are a PI you may also
** and if you are a PI you may also
Line 28: Line 28:
* Requests are typically processed within two business days.
* Requests are typically processed within two business days.


==Preparing your request== <!--T:12-->
===Preparing your request=== <!--T:12-->
* When requesting access to an existing project, you will need to know the project name and which cloud it is on. See the section on [[Managing_your_cloud_resources_with_OpenStack#Projects|projects]] for guidance on how to find the project name and the section about [[Cloud#Cloud_systems|cloud systems]] for a list of our clouds. Requests for access must be confirmed by the PI owning the project.
* When requesting access to an existing project, you will need to know the project name and which cloud it is on. See the section on [[Managing_your_cloud_resources_with_OpenStack#Projects|projects]] for guidance on how to find the project name and the section about [[Cloud#Cloud_systems|cloud systems]] for a list of our clouds. Requests for access must be confirmed by the PI owning the project.
* When requesting either a new project or an increase in quota for an existing project some justification, in the form of a few sentences, is required:
* When requesting either a new project or an increase in quota for an existing project some justification, in the form of a few sentences, is required:
Line 37: Line 37:
*A PI may own up to 3 projects, but the sum of all project quotas must be within the [[Cloud_RAS_Allocations|RAS]] allocation limits. A PI may have both compute and persistent cloud RAS allocations.
*A PI may own up to 3 projects, but the sum of all project quotas must be within the [[Cloud_RAS_Allocations|RAS]] allocation limits. A PI may have both compute and persistent cloud RAS allocations.


=Creating a virtual machine on the cloud infrastructure= <!--T:13-->
==Creating a virtual machine on the cloud infrastructure== <!--T:13-->
* The [[Cloud Quick Start|cloud quick start guide]] describes how to manually create your first VM.
* The [[Cloud Quick Start|cloud quick start guide]] describes how to manually create your first VM.
* Review the [[Cloud_Technical_Glossary|glossary]] to learn definitions of common topics.
* Review the [[Cloud_Technical_Glossary|glossary]] to learn definitions of common topics.
Line 43: Line 43:
* See the [[Cloud troubleshooting guide|troubleshooting guide]] for steps to deal with common issues in cloud computing.
* See the [[Cloud troubleshooting guide|troubleshooting guide]] for steps to deal with common issues in cloud computing.


=User responsibilities= <!--T:14-->
==User responsibilities== <!--T:14-->
For each cloud project, you are responsible for
For each cloud project, you are responsible for
*[[Cloud_shared_security_responsibility_model|Securing and patching software on your VM]],
*[[Managing_your_cloud_resources_with_OpenStack | Creating and managing your virtual machines ]]
*[[Managing_your_cloud_resources_with_OpenStack#Security_Groups|Defining security groups to allow access to your network]],
*[[Cloud_shared_security_responsibility_model|Securing and patching software on your VM]]
*[[Managing_your_Linux_VM|Creating user accounts]],
*[[Managing_your_cloud_resources_with_OpenStack#Security_groups|Defining security groups to allow access to your network]]
*[[VM Best Practices|Following best practices]],
*[[Managing_your_Linux_VM|Creating user accounts]]
*[[Security_considerations_when_running_a_VM|Considering security issues]],
*[[VM Best Practices|Following best practices]]
*[[Backing_up_your_VM|Backing up your VMs]].
*[[Security_considerations_when_running_a_VM|Considering security issues]]
*[[Backing_up_your_VM|Backing up your VMs]]


=Advanced topics= <!--T:15-->
==Advanced topics== <!--T:15-->
More experienced users can:
More experienced users can
*[[Automating_VM_creation|Automatically create VMs]],
*[[Automating_VM_creation|Automatically create VMs]],
*Describe your VM infrastructure as code using [[Terraform]].
*Describe your VM infrastructure as code using [[Terraform]].
=Use cases=  
==Use cases==
More detailed instructions are available for some of the common cloud use cases, including:
More detailed instructions are available for some of the common cloud use cases, including
*[[Configuring a data or web server|Configure a data or web server]],
*[[Configuring a data or web server|Configure a data or web server]]
*[[Using cloud vGPUs|Using vGPUs (standard shared GPU allocation) in the cloud]],
*[[Using cloud vGPUs|Using vGPUs (standard shared GPU allocation) in the cloud]]
*[[Using cloud gpu|Using PCI-e passthrough GPUs in the cloud]],
*[[Using cloud gpu|Using PCI-e passthrough GPUs in the cloud]]
*[[Setting up GUI Desktop on a VM]],
*[[Setting up GUI Desktop on a VM]]
*[[Using ipv6 in cloud|Using IPv6 in Arbutus cloud]],
*[[Using ipv6 in cloud|Using IPv6 in Arbutus cloud]]


= Cloud systems = <!--T:8-->
==Cloud systems== <!--T:8-->
Your project will be on one of the following clouds:
Your project will be on one of the following clouds:
* [https://beluga.cloud.computecanada.ca Béluga cloud]
* [https://beluga.cloud.computecanada.ca Béluga]
* [https://arbutus.cloud.computecanada.ca Arbutus]
* [https://arbutus.cloud.computecanada.ca Arbutus]
* [https://graham.cloud.computecanada.ca Graham]
* [https://graham.cloud.computecanada.ca Graham]
Line 72: Line 73:
The details of the underlying hardware and OpenStack versions are described on the [[cloud resources]] page. The [[System status]] wiki page contains information about the current cloud status and future planned maintenance and upgrade activities.
The details of the underlying hardware and OpenStack versions are described on the [[cloud resources]] page. The [[System status]] wiki page contains information about the current cloud status and future planned maintenance and upgrade activities.


= Support = <!--T:5-->
==Support== <!--T:5-->
For questions about our cloud service, contact [[Technical support|technical support]].
For questions about our cloud service, contact [[Technical support|technical support]].


</translate>
</translate>
[[Category:Cloud]]
[[Category:Cloud]]

Latest revision as of 14:37, 14 December 2023

Other languages:

We offer Infrastructure as a Service that supports virtualization.

A user of the cloud will typically create or spin up one or more virtual machines (VMs or instances). He or she then logs into the VM with administrative privileges, installs any desired software, and runs the software applications needed. These applications could be as diverse as a CPU-intensive analysis of particle physics data, or a web service directed towards scholars of literature and the humanities. The advantage is that the user has complete control over the collection of installed software (the software stack). The disadvantage is that the user must have some degree of experience in installing software and otherwise managing a computer.

Virtual machines can be easily replicated. One can take a snapshot of a VM which can then be started again elsewhere. This makes it easy to replicate or scale up a service, and to recover from (for example) a power interruption.

If you can fit your work easily into the HPC batch submission workflow and environment (see What is a scheduler?) it is preferable to work outside the cloud, as there are more resources available for HPC and software is already configured and installed for many common needs. There are also tools like Apptainer to run custom software stacks inside containers within our HPC clusters. If your need isn't served by Apptainer or HPC batch, then the cloud is your solution.

Getting a cloud project[edit]

  • Review and understand the important role you are about to take on to safeguard your research and the shared cloud infrastructure.
  • If you do not have an account with us, create one with these instructions.
  • A project is an allocation of resources for creating VMs within a cloud.
  • If you are a primary investigator (PI) with an active cloud resource allocation (see RAC) you should already have a project. See the sections below on using the cloud to get started. If not or if you are not sure please contact technical support.
  • Otherwise go to the Alliance cloud project and RAS request form to
    • request access to an existing project (see the section below for information you will need to supply)
    • and if you are a PI you may also
      • request a new project with our Rapid Access Service (RAS),
      • or request an increase in quota of an existing project.
  • Requests are typically processed within two business days.

Preparing your request[edit]

  • When requesting access to an existing project, you will need to know the project name and which cloud it is on. See the section on projects for guidance on how to find the project name and the section about cloud systems for a list of our clouds. Requests for access must be confirmed by the PI owning the project.
  • When requesting either a new project or an increase in quota for an existing project some justification, in the form of a few sentences, is required:
    • why you need cloud resources,
    • why an HPC cluster is not suitable,
    • your plans for efficient usage of your resources,
    • your plans for maintenance and security (refer to this page).
  • A PI may own up to 3 projects, but the sum of all project quotas must be within the RAS allocation limits. A PI may have both compute and persistent cloud RAS allocations.

Creating a virtual machine on the cloud infrastructure[edit]

User responsibilities[edit]

For each cloud project, you are responsible for

Advanced topics[edit]

More experienced users can

Use cases[edit]

More detailed instructions are available for some of the common cloud use cases, including

Cloud systems[edit]

Your project will be on one of the following clouds:

The details of the underlying hardware and OpenStack versions are described on the cloud resources page. The System status wiki page contains information about the current cloud status and future planned maintenance and upgrade activities.

Support[edit]

For questions about our cloud service, contact technical support.