rsnt_translations
56,430
edits
No edit summary |
(Marked this version for translation) |
||
Line 170: | Line 170: | ||
<translate> | <translate> | ||
==== Building with OpenACC ==== | ==== Building with OpenACC ==== <!--T:49--> | ||
<!--T:22--> | |||
{{Callout | {{Callout | ||
|title= | |title= | ||
Which compiler ?</translate> | Which compiler ?</translate> | ||
|content= | |content= | ||
Line 184: | Line 185: | ||
}} | }} | ||
<!--T:50--> | |||
The NVidia compilers use the <tt>-ta</tt> (target accelerator) option to enable compilation for an accelerator. We use the sub-option <tt>tesla:managed</tt>, to tell the compiler that we want it compiled for Tesla GPUs, and we want to use managed memory. Managed memory simplifies the process of transferring data to and from the device. We will remove this option in a later example. We also use the option <tt>-fast</tt>, which is an optimization option. | The NVidia compilers use the <tt>-ta</tt> (target accelerator) option to enable compilation for an accelerator. We use the sub-option <tt>tesla:managed</tt>, to tell the compiler that we want it compiled for Tesla GPUs, and we want to use managed memory. Managed memory simplifies the process of transferring data to and from the device. We will remove this option in a later example. We also use the option <tt>-fast</tt>, which is an optimization option. | ||
</translate> | </translate> | ||
Line 207: | Line 209: | ||
<translate> | <translate> | ||
<!--T:51--> | |||
As we can see in the compiler output, the compiler could not parallelize the two loops. We will see in the following sections how to deal with this. | As we can see in the compiler output, the compiler could not parallelize the two loops. We will see in the following sections how to deal with this. | ||