Bureaucrats, cc_docs_admin, cc_staff
337
edits
Line 61: | Line 61: | ||
= CUDA Block-Threading Model = | = CUDA Block-Threading Model = | ||
Given very large number of threads (and in order to achieve massive parallelism one has to use all the threads possible) in CUDA kernel, one needs to organize them somehow. in CUDA, all the threads are structured in threading blocks, the blocks are further organized into grids, as shown on FIg. In dividing the threads we make sure that the following is satisfied: | |||
* threads within a block cooperate via the shared memory | |||
* threads in different blocks can not cooperate | |||
[[File:Cuda-threads-blocks.png|thumbnail|CUDA block-threading model where threads are organized into blocks while blocks are further organized into grid. ]] | [[File:Cuda-threads-blocks.png|thumbnail|CUDA block-threading model where threads are organized into blocks while blocks are further organized into grid. ]] | ||