38,760
edits
(Updating to match new version of source page) |
(Updating to match new version of source page) |
||
Line 46: | Line 46: | ||
|content= | |content= | ||
For the purpose of this tutorial, we use several profilers as described below: | For the purpose of this tutorial, we use several profilers as described below: | ||
* PGPROF - a powerful and simple analyzer for parallel programs written with OpenMP or OpenACC directives, or with CUDA | * PGPROF - a powerful and simple analyzer for parallel programs written with OpenMP or OpenACC directives, or with [https://en.wikipedia.org/wiki/CUDA CUDA]. | ||
We note that [http://www.pgroup.com/support/download_pgi2016.php?view=current Portland Group Profiler] is free for academic usage. | We note that [http://www.pgroup.com/support/download_pgi2016.php?view=current Portland Group Profiler] is free for academic usage. | ||
* NVIDIA Visual Profiler NVVP - a cross-platform analyzing tool for the codes written with OpenACC and CUDA C/C++ instructions | * NVIDIA Visual Profiler NVVP - a cross-platform analyzing tool for the codes written with OpenACC and CUDA C/C++ instructions. | ||
* NVPROF - a command line text-based version of the NVIDIA Visual Profiler | * NVPROF - a command line text-based version of the NVIDIA Visual Profiler. | ||
}} | }} | ||