OpenACC Tutorial - Adding directives/fr: Difference between revisions

Created page with "Cliquez pour agrandir."
(Created page with "Voici le résultat pour la version OpenACC : {{Command |./challenge |result=Rows: 8120601, nnz: 218535025 Iteration: 0, Tolerance: 4.0067e+08 Iteration: 10, Tolerance: 1...")
(Created page with "Cliquez pour agrandir.")
Line 259: Line 259:
Total Iterations: 100 Total Time: 115.068931s
Total Iterations: 100 Total Time: 115.068931s
}}
}}
[[File:Openacc profiling1.png|thumbnail|Click to enlarge]]
[[File:Openacc profiling1.png|thumbnail|Cliquez pour agrandir.]]
The results are correct. However, not only do we not get any speed up, but we rather get a slow down by a factor of almost 4! Let's profile the code again using NVidia's visual profiler (<tt>nvvp</tt>). This can be done with the following steps:  
The results are correct. However, not only do we not get any speed up, but we rather get a slow down by a factor of almost 4! Let's profile the code again using NVidia's visual profiler (<tt>nvvp</tt>). This can be done with the following steps:  
# Start <tt>nvvp</tt> with the command <tt>nvvp &</tt>  (the <tt>&</tt> sign is to start it in the background)
# Start <tt>nvvp</tt> with the command <tt>nvvp &</tt>  (the <tt>&</tt> sign is to start it in the background)
rsnt_translations
56,430

edits