OpenACC Tutorial - Profiling: Difference between revisions

no edit summary
No edit summary
No edit summary
Line 74: Line 74:
[[File:Nvvp-pic0.png|thumbnail|300px|The NVVP profiler|right  ]]
[[File:Nvvp-pic0.png|thumbnail|300px|The NVVP profiler|right  ]]
[[File:Nvvp-pic1.png|thumbnail|300px|Browse for executable you want to profile|right  ]]
[[File:Nvvp-pic1.png|thumbnail|300px|Browse for executable you want to profile|right  ]]
== NVIDIA NVPROF Command Line Profiler  ==
NVIDIA also provides a command line version called NVPROF, similar to GPU prof
{{Command
|nvprof --cpu-profiling on ./cgi.x
|result=
<Program output >
======== CPU profiling result (bottom up):
84.25% matvec(matrix const &, vector const &, vector const &)
84.25% main
9.50% waxpby(double, vector const &, double, vector const &, vector const &)
3.37% dot(vector const &, vector const &)
2.76% allocate_3d_poisson_matrix(matrix&, int)
2.76% main
0.11% __c_mset8
0.03% munmap
  0.03% free_matrix(matrix&)
    0.03% main
======== Data collected at 100Hz frequency
}}
Bureaucrats, cc_docs_admin, cc_staff
337

edits