Bureaucrats, cc_docs_admin, cc_staff
337
edits
No edit summary |
No edit summary |
||
Line 74: | Line 74: | ||
[[File:Nvvp-pic0.png|thumbnail|300px|The NVVP profiler|right ]] | [[File:Nvvp-pic0.png|thumbnail|300px|The NVVP profiler|right ]] | ||
[[File:Nvvp-pic1.png|thumbnail|300px|Browse for executable you want to profile|right ]] | [[File:Nvvp-pic1.png|thumbnail|300px|Browse for executable you want to profile|right ]] | ||
== NVIDIA NVPROF Command Line Profiler == | |||
NVIDIA also provides a command line version called NVPROF, similar to GPU prof | |||
{{Command | |||
|nvprof --cpu-profiling on ./cgi.x | |||
|result= | |||
<Program output > | |||
======== CPU profiling result (bottom up): | |||
84.25% matvec(matrix const &, vector const &, vector const &) | |||
84.25% main | |||
9.50% waxpby(double, vector const &, double, vector const &, vector const &) | |||
3.37% dot(vector const &, vector const &) | |||
2.76% allocate_3d_poisson_matrix(matrix&, int) | |||
2.76% main | |||
0.11% __c_mset8 | |||
0.03% munmap | |||
0.03% free_matrix(matrix&) | |||
0.03% main | |||
======== Data collected at 100Hz frequency | |||
}} |