Translations:OpenACC Tutorial - Adding directives/37/en

From Alliance Doc
Jump to navigation Jump to search

This will run the program and generate a timeline of the execution. The resulting timeline is illustrated on the image on the right side. As we can see, almost all of the run time is being spent transferring data between the host and the device. This is very often the case when one ports a code from CPU to GPU. We will look at how to optimize this in the next part of the tutorial.