Multi-Instance GPU: Difference between revisions

Jump to navigation Jump to search
no edit summary
No edit summary
No edit summary
Line 47: Line 47:
<translate>  
<translate>  
<!--T:12-->
<!--T:12-->
Request 1 MIG of power 4/8 for a 24-hour batch script using the maximum recommended number of cores and system memory.
Request 1 MIG of power 4/8 and size 20GB for a 24-hour batch script using the maximum recommended number of cores and system memory.
</translate>  
</translate>  


Line 70: Line 70:


<!--T:14-->
<!--T:14-->
There are currently two ways to monitor the resource usage of a GPU job. One can find information on current and past jobs by looking at the Narval usage [https://docs.alliancecan.ca/wiki/Portail portal], under the <code>Job stats</code> tab.  
You can find information on current and past jobs on the [https://docs.alliancecan.ca/wiki/Portail Narval usage portal], under the <code>Job stats</code> tab.  


<!--T:15-->
<!--T:15-->
Electric power consumption is a good indicator of the total computing power requested from the GPU. For instance, the following job requested one A100 GPU with a maximum TDP of 400W, but only used 100W on average, which is only 50W more than the idle electric consumption:
Power consumption is a good indicator of the total computing power requested from the GPU. For instance, the following job requested one A100 GPU with a maximum TDP of 400W, but only used 100W on average, which is only 50W more than the idle electric consumption:
   
   
[[File:ExampleGPUPower.png|400px|frame|left|Example GPU Power usage of a job on a A100 GPU]]  
[[File:ExampleGPUPower.png|400px|frame|left|Example GPU Power usage of a job on a A100 GPU]]  
rsnt_translations
56,430

edits

Navigation menu