CUDA tutorial: Difference between revisions

CUDA tutorial (view source)

226 bytes added , 7 years ago

Bureaucrats, cc_docs_admin, cc_staff

337

edits

@@ Line 55: / Line 55: @@
 * each GPU core (streaming processor) execute a sequential '''Thread''', where '''Thread''' is a smallest set of instructions handled by the operating system's schedule.
 * all GPU cores execute the kernel in a SIMT fashion (Single Instruction Multiple Threads)
+Usually the following procedure is recommended when it comes to executing on GPU:
+. Copy input data from CPU memory to GPU memory
+. Load GPU program (Kernel) and execute it
+. Copy results from GPU memory back to CPU memory
 = First CUDA C Program=