CUDA tutorial: Difference between revisions

no edit summary
(Marked this version for translation)
No edit summary
Line 55: Line 55:
* How do you make such kernel run in a massively parallel fashion?
* How do you make such kernel run in a massively parallel fashion?
Here is the execution recipe that will answer the above questions:
Here is the execution recipe that will answer the above questions:
* each GPU core (streaming processor) executes a sequential '''thread''', where a '''thread''' is a smallest set of instructions handled by the operating system's schedule.
* each GPU core (streaming processor) executes a sequential '''thread''', where a '''thread''' is a smallest set of instructions handled by the operating system's scheduler.
* all GPU cores execute the kernel in a SIMT fashion (Single Instruction, Multiple Threads)
* all GPU cores execute the kernel in a SIMT fashion (Single Instruction, Multiple Threads)
Usually the following procedure is recommended when it comes to executing on GPU:
Usually the following procedure is recommended when it comes to executing on GPU:
rsnt_translations
56,563

edits