Bureaucrats, cc_docs_admin, cc_staff
2,879
edits
(Marked this version for translation) |
(spelling and simplification) |
||
Line 2: | Line 2: | ||
<translate> | <translate> | ||
<!--T:2--> | <!--T:2--> | ||
This guide describes how to allocate vGPU resources to a virtual machine (VM), | This guide describes how to | ||
* allocate vGPU resources to a virtual machine (VM), | |||
* install the necessary drivers and | |||
* check whether the vGPU can be used. | |||
Access to repositories as well as to the vGPUs is currently only available within [https://arbutus.cloud.computecanada.ca Arbutus Cloud]. Please note that the documentation below only covers the vGPU driver installation; the CUDA toolkit is not pre-installed. | |||
The [https://developer.nvidia.com/cuda-toolkit-archive CUDA toolkit] can be installed directly from Nvidia or used from [[Accessing_CVMFS|the CVMFS software stack]]. | The [https://developer.nvidia.com/cuda-toolkit-archive CUDA toolkit] can be installed directly from Nvidia or used from [[Accessing_CVMFS|the CVMFS software stack]]. | ||
Line 8: | Line 12: | ||
<!--T:3--> | <!--T:3--> | ||
To use a vGPU within a VM, the instance needs to be deployed on one of the flavors listed below. The vGPU will be available to the operating system via the PCI bus. | To use a vGPU within a VM, the instance needs to be deployed on one of the flavors listed below. The vGPU will be available to the operating system via the PCI bus. | ||
<!--T:4--> | <!--T:4--> | ||
Line 16: | Line 20: | ||
<!--T:24--> | <!--T:24--> | ||
Once the VM is available, make sure to update the OS to the latest available software, including the kernel | Once the VM is available, make sure to update the OS to the latest available software, including the kernel. Then reboot the VM to have the latest kernel running. | ||
<pre> | <pre> | ||
[root@centos7]# yum -y update && reboot | [root@centos7]# yum -y update && reboot | ||
Line 22: | Line 26: | ||
<!--T:25--> | <!--T:25--> | ||
Since the proprietary | Since the proprietary Nvidia drivers need to be compiled against the running kernel, the package '''dkms''' is required from the [https://dl.fedoraproject.org/pub/epel/epel-release-latest-7.noarch.rpm EPEL Repository] | ||
<!--T:26--> | <!--T:26--> | ||
Line 30: | Line 34: | ||
<!--T:27--> | <!--T:27--> | ||
Install the '''Arbutus Cloud''' [http://repo.arbutus.cloud.computecanada.ca/pulp/repos/centos/arbutus-cloud-vgpu-repo.el7.noarch.rpm repository] | Install the '''Arbutus Cloud''' [http://repo.arbutus.cloud.computecanada.ca/pulp/repos/centos/arbutus-cloud-vgpu-repo.el7.noarch.rpm repository]. | ||
This also installs the public key the packages are signed with to ensure their authenticity | |||
These drivers and user-space tools are carefully tested against the infrastructure before they are made available. | |||
<pre> | <pre> | ||
[root@centos7]# yum -y install http://repo.arbutus.cloud.computecanada.ca/pulp/repos/centos/arbutus-cloud-vgpu-repo.el7.noarch.rpm | [root@centos7]# yum -y install http://repo.arbutus.cloud.computecanada.ca/pulp/repos/centos/arbutus-cloud-vgpu-repo.el7.noarch.rpm | ||
Line 37: | Line 42: | ||
<!--T:28--> | <!--T:28--> | ||
The last step is to install the '''nvidia vGPU packages'''. The kernel module package 'nvidia-vgpu-kmod' | The last step is to install the '''nvidia vGPU packages'''. | ||
The kernel module package 'nvidia-vgpu-kmod' will take a few minutes as it compiles the required kernel modules in the background. | |||
<pre> | <pre> | ||
[root@centos7]# yum -y install nvidia-vgpu-kmod nvidia-vgpu-gridd nvidia-vgpu-tools | [root@centos7]# yum -y install nvidia-vgpu-kmod nvidia-vgpu-gridd nvidia-vgpu-tools | ||
Line 43: | Line 49: | ||
<!--T:29--> | <!--T:29--> | ||
If your installation was successful, the vGPU will be accessible and licensed. | |||
Test by running <code>nvidia-smi</code>: | |||
<pre> | <pre> | ||
[root@centos7]# nvidia-smi | [root@centos7]# nvidia-smi | ||
Line 71: | Line 78: | ||
<!--T:30--> | <!--T:30--> | ||
To check for the license status as well as other information | To check for the license status as well as other information about the vGPU: | ||
<pre> | <pre> | ||
Line 130: | Line 137: | ||
== Preparation of a VM running CentOS8 == <!--T:6--> | == Preparation of a VM running CentOS8 == <!--T:6--> | ||
Once the VM is available, make sure to update the OS to the latest available software, including the kernel | Once the VM is available, make sure to update the OS to the latest available software, including the kernel. Then reboot the VM to have the latest kernel running. | ||
<pre> | <pre> | ||
[root@centos8]# dnf -y update && reboot | [root@centos8]# dnf -y update && reboot | ||
Line 136: | Line 143: | ||
<!--T:35--> | <!--T:35--> | ||
Since the | Since the proprietary Nvidia drivers need to be compiled against the running kernel, the package '''dkms''' is required from the [https://dl.fedoraproject.org/pub/epel/epel-release-latest-8.noarch.rpm EPEL Repository] | ||
<!--T:36--> | <!--T:36--> | ||
Line 144: | Line 151: | ||
<!--T:37--> | <!--T:37--> | ||
Install the '''Arbutus Cloud''' [http://repo.arbutus.cloud.computecanada.ca/pulp/repos/centos/arbutus-cloud-vgpu-repo.el8.noarch.rpm repository] | Install the '''Arbutus Cloud''' [http://repo.arbutus.cloud.computecanada.ca/pulp/repos/centos/arbutus-cloud-vgpu-repo.el8.noarch.rpm repository]. | ||
It also installs the public key the packages are signed with to ensure their authenticity. | |||
These drivers and user-space tools are carefully tested against the infrastructure before they are made available. | |||
<pre> | <pre> | ||
[root@centos8]# dnf -y install http://repo.arbutus.cloud.computecanada.ca/pulp/repos/centos/arbutus-cloud-vgpu-repo.el8.noarch.rpm | [root@centos8]# dnf -y install http://repo.arbutus.cloud.computecanada.ca/pulp/repos/centos/arbutus-cloud-vgpu-repo.el8.noarch.rpm | ||
Line 151: | Line 159: | ||
<!--T:38--> | <!--T:38--> | ||
The last step is to install the '''nvidia vGPU packages'''. The kernel module package 'nvidia-vgpu-kmod' | The last step is to install the '''nvidia vGPU packages'''. | ||
The kernel module package 'nvidia-vgpu-kmod' will take a few minutes as it compiles the required kernel modules in the background. | |||
<pre> | <pre> | ||
[root@centos8]# dnf -y install nvidia-vgpu-kmod nvidia-vgpu-gridd nvidia-vgpu-tools | [root@centos8]# dnf -y install nvidia-vgpu-kmod nvidia-vgpu-gridd nvidia-vgpu-tools | ||
Line 157: | Line 166: | ||
<!--T:39--> | <!--T:39--> | ||
If your installation was successful, the vGPU will be accessible and licensed. | |||
Test by running <code>nvidia-smi</code> as shown above for Centos7. | |||
== Preparation of a VM running Debian10 == <!--T:7--> | == Preparation of a VM running Debian10 == <!--T:7--> | ||
Ensure that the latest packages are installed and the system has been booted the latest stable kernel, as dkms will request the latest one available from the | Ensure that the latest packages are installed and the system has been booted with the latest stable kernel, as '''dkms''' will request the latest one available from the Debian repositories. | ||
<!--T:40--> | <!--T:40--> | ||
Line 169: | Line 178: | ||
<!--T:41--> | <!--T:41--> | ||
After a successful reboot, the system should have the latest available kernel running and the repository can be installed, by installing the repo package. | After a successful reboot, the system should have the latest available kernel running and the repository can be installed, by installing the <code>arbutus-cloud-repo</code> package. | ||
This package also contains the gpg key all packages are signed with. | This package also contains the gpg key all packages are signed with. | ||
Line 180: | Line 189: | ||
<!--T:43--> | <!--T:43--> | ||
The installation of the package will display a warning, since the key is directly imported (for convenience) via the package's post installation procedure. | The installation of the package will display a warning, since the key is directly imported (for convenience) via the package's post-installation procedure. | ||
<!--T:44--> | <!--T:44--> | ||
Line 190: | Line 199: | ||
<!--T:45--> | <!--T:45--> | ||
Update | Update the local apt cache and install the vGPU packages: | ||
<!--T:46--> | <!--T:46--> | ||
Line 198: | Line 207: | ||
<!--T:47--> | <!--T:47--> | ||
If your installation was successful, the vGPU will be accessible and licensed. | |||
Test by running <code>nvidia-smi</code> as shown above for Centos7. | |||
== Preparation of a VM running Ubuntu20 == <!--T:8--> | == Preparation of a VM running Ubuntu20 == <!--T:8--> | ||
Line 209: | Line 219: | ||
<!--T:49--> | <!--T:49--> | ||
After a successful reboot, the system should have the latest | After a successful reboot, the system should have the latest available kernel running. | ||
Now the repository can be installed by installing the <code>arbutus-cloud-repo</code> package. | |||
This package also contains the gpg key all packages are signed with. | |||
<!--T:50--> | <!--T:50--> | ||
Line 218: | Line 230: | ||
<!--T:51--> | <!--T:51--> | ||
A warning will be displayed since the signature key is added in the post-install stage. The warning can be ignored. | |||
Update | Update the local apt cache and install the vGPU packages: | ||
<pre> | <pre> | ||
root@ubuntu20:~# apt-get update && apt-get -y install nvidia-vgpu-kmod nvidia-vgpu-tools nvidia-vgpu-gridd | root@ubuntu20:~# apt-get update && apt-get -y install nvidia-vgpu-kmod nvidia-vgpu-tools nvidia-vgpu-gridd | ||
Line 225: | Line 237: | ||
<!--T:52--> | <!--T:52--> | ||
If your installation was successful, the vGPU will be accessible and licensed. | |||
Test by running <code>nvidia-smi</code> as shown above for Centos7. | |||
== Preparation of a VM running Ubuntu18 == <!--T:57--> | == Preparation of a VM running Ubuntu18 == <!--T:57--> | ||
Line 236: | Line 249: | ||
<!--T:59--> | <!--T:59--> | ||
After a successful reboot, the system should have the latest | After a successful reboot, the system should have the latest available kernel running. | ||
Now the repository can be installed by installing the <code>arbutus-cloud-repo</code> package. | |||
This package also contains the gpg key all packages are signed with. | |||
<!--T:60--> | <!--T:60--> | ||
Line 245: | Line 260: | ||
<!--T:61--> | <!--T:61--> | ||
A warning will be displayed since the signature key is added in the post-install stage. The warning can be ignored. | |||
Update | Update the local apt cache and install the vGPU packages: | ||
<pre> | <pre> | ||
root@ubuntu18:~# apt-get update && apt-get -y install nvidia-vgpu-kmod nvidia-vgpu-tools nvidia-vgpu-gridd | root@ubuntu18:~# apt-get update && apt-get -y install nvidia-vgpu-kmod nvidia-vgpu-tools nvidia-vgpu-gridd | ||
Line 252: | Line 267: | ||
<!--T:62--> | <!--T:62--> | ||
If your installation was successful, the vGPU will be accessible and licensed. | |||
Test by running <code>nvidia-smi</code> as shown above for Centos7. | |||
</translate> | </translate> |