Nvprof: Difference between revisions

no edit summary
No edit summary
No edit summary
Line 34: Line 34:
=== Summary mode ===
=== Summary mode ===
This is the default operating mode for Nvprof. It outputs a single result line for each instruction such as  a kernel function or  CUDA memory copy/set performed by the application. For each kernel function, Nvprof outputs the total time of all instances of the kernel or type of memory copy as well as the average, minimum, and maximum time.
This is the default operating mode for Nvprof. It outputs a single result line for each instruction such as  a kernel function or  CUDA memory copy/set performed by the application. For each kernel function, Nvprof outputs the total time of all instances of the kernel or type of memory copy as well as the average, minimum, and maximum time.
In this example, the application is <code>a.out</code> and we run Nvprof to get the profiling :
{{Command|nvprof  ./a.out}}
Bureaucrats, cc_docs_admin, cc_staff
337

edits