rsnt_translations
56,430
edits
(Created page with "Voici le résultat pour la version OpenACC : {{Command |./challenge |result=Rows: 8120601, nnz: 218535025 Iteration: 0, Tolerance: 4.0067e+08 Iteration: 10, Tolerance: 1...") |
(Created page with "Cliquez pour agrandir.") |
||
Line 259: | Line 259: | ||
Total Iterations: 100 Total Time: 115.068931s | Total Iterations: 100 Total Time: 115.068931s | ||
}} | }} | ||
[[File:Openacc profiling1.png|thumbnail| | [[File:Openacc profiling1.png|thumbnail|Cliquez pour agrandir.]] | ||
The results are correct. However, not only do we not get any speed up, but we rather get a slow down by a factor of almost 4! Let's profile the code again using NVidia's visual profiler (<tt>nvvp</tt>). This can be done with the following steps: | The results are correct. However, not only do we not get any speed up, but we rather get a slow down by a factor of almost 4! Let's profile the code again using NVidia's visual profiler (<tt>nvvp</tt>). This can be done with the following steps: | ||
# Start <tt>nvvp</tt> with the command <tt>nvvp &</tt> (the <tt>&</tt> sign is to start it in the background) | # Start <tt>nvvp</tt> with the command <tt>nvvp &</tt> (the <tt>&</tt> sign is to start it in the background) |