OpenACC Tutorial - Profiling: Difference between revisions

Jump to navigation Jump to search
no edit summary
No edit summary
Line 23: Line 23:
The Amdahl's law says that "Parallelizing the most time-consuming (i.e. the hotspots) routines will have the most impact".
The Amdahl's law says that "Parallelizing the most time-consuming (i.e. the hotspots) routines will have the most impact".


== Build the Sample Code ? ==
== Build the Sample Code ==
For this example we will use a code from the [https://github.com/calculquebec/cq-formation-openacc repositories]. Download the package and change to the '''cpp''' or '''f90''' directory. The point of this exercise is to compile&link the code, obtain executable, and then profile them.
For this example we will use a code from the [https://github.com/calculquebec/cq-formation-openacc repositories]. Download the package and change to the '''cpp''' or '''f90''' directory. The point of this exercise is to compile&link the code, obtain executable, and then profile them.
{{Callout
{{Callout
Line 53: Line 53:
{{Callout
{{Callout
|title=<translate><!--T:6-->
|title=<translate><!--T:6-->
Which profiller ?</translate>
Which profiler ?</translate>
|content=
|content=
<translate>
<translate>
Line 66: Line 66:




=== PGPROF Profiller ===
=== PGPROF Profiler ===
[[File:Pgprof new0.png|thumbnail|300px|Starting new session|left  ]]
[[File:Pgprof new0.png|thumbnail|300px|Starting new session|left  ]]
Bellow are several snapshots demonstrating how to start with the PGPROF profiler. First step is to initiate a new session.  
Bellow are several snapshots demonstrating how to start with the PGPROF profiler. First step is to initiate a new session.  
Line 72: Line 72:
Then specify the profiling options. For example, if you need to profile CPU activity then set the "Profile execution of the CPU" box.
Then specify the profiling options. For example, if you need to profile CPU activity then set the "Profile execution of the CPU" box.


=== NVIDIA Visual Profiller ===
=== NVIDIA Visual Profiler ===


Another profiler available for OpenACC applications is NVIDIA Visual Profiler. It's a cross-platform analyzing tool for the codes written with OpenACC and CUDA C/C++ instructions.
Another profiler available for OpenACC applications is NVIDIA Visual Profiler. It's a cross-platform analyzing tool for the codes written with OpenACC and CUDA C/C++ instructions.
rsnt_translations
56,430

edits

Navigation menu