rsnt_translations
56,430
edits
No edit summary |
|||
Line 23: | Line 23: | ||
The Amdahl's law says that "Parallelizing the most time-consuming (i.e. the hotspots) routines will have the most impact". | The Amdahl's law says that "Parallelizing the most time-consuming (i.e. the hotspots) routines will have the most impact". | ||
== Build the Sample Code | == Build the Sample Code == | ||
For this example we will use a code from the [https://github.com/calculquebec/cq-formation-openacc repositories]. Download the package and change to the '''cpp''' or '''f90''' directory. The point of this exercise is to compile&link the code, obtain executable, and then profile them. | For this example we will use a code from the [https://github.com/calculquebec/cq-formation-openacc repositories]. Download the package and change to the '''cpp''' or '''f90''' directory. The point of this exercise is to compile&link the code, obtain executable, and then profile them. | ||
{{Callout | {{Callout | ||
Line 53: | Line 53: | ||
{{Callout | {{Callout | ||
|title=<translate><!--T:6--> | |title=<translate><!--T:6--> | ||
Which | Which profiler ?</translate> | ||
|content= | |content= | ||
<translate> | <translate> | ||
Line 66: | Line 66: | ||
=== PGPROF | === PGPROF Profiler === | ||
[[File:Pgprof new0.png|thumbnail|300px|Starting new session|left ]] | [[File:Pgprof new0.png|thumbnail|300px|Starting new session|left ]] | ||
Bellow are several snapshots demonstrating how to start with the PGPROF profiler. First step is to initiate a new session. | Bellow are several snapshots demonstrating how to start with the PGPROF profiler. First step is to initiate a new session. | ||
Line 72: | Line 72: | ||
Then specify the profiling options. For example, if you need to profile CPU activity then set the "Profile execution of the CPU" box. | Then specify the profiling options. For example, if you need to profile CPU activity then set the "Profile execution of the CPU" box. | ||
=== NVIDIA Visual | === NVIDIA Visual Profiler === | ||
Another profiler available for OpenACC applications is NVIDIA Visual Profiler. It's a cross-platform analyzing tool for the codes written with OpenACC and CUDA C/C++ instructions. | Another profiler available for OpenACC applications is NVIDIA Visual Profiler. It's a cross-platform analyzing tool for the codes written with OpenACC and CUDA C/C++ instructions. |