OpenACC Tutorial: Difference between revisions
(Added empty line after paragraph - Updated links) |
(Marked this version for translation) |
||
(One intermediate revision by one other user not shown) | |||
Line 7: | Line 7: | ||
<!--T:2--> | <!--T:2--> | ||
OpenACC is an application programming interface (API) for porting code onto accelerators such as GPU and coprocessors. It has been developed by Cray, CAPS, NVidia and PGI. Like in [[OpenMP]], the programmer annotates C, C++ or Fortran code to identify portions that should be parallelized by the compiler. | OpenACC is an application programming interface (API) for porting code onto accelerators such as GPU and coprocessors. It has been developed by Cray, CAPS, NVidia and PGI. Like in [[OpenMP]], the programmer annotates C, C++ or Fortran code to identify portions that should be parallelized by the compiler. | ||
<!--T:10--> | |||
A self-paced course on this topic is available from SHARCNET: [https://training.sharcnet.ca/courses/enrol/index.php?id=173 Introduction to GPU Programming]. | |||
</translate> | </translate> | ||
Latest revision as of 16:56, 16 October 2024
This tutorial is strongly inspired from the OpenACC Bootcamp session presented at GPU Technology Conference 2016.
OpenACC is an application programming interface (API) for porting code onto accelerators such as GPU and coprocessors. It has been developed by Cray, CAPS, NVidia and PGI. Like in OpenMP, the programmer annotates C, C++ or Fortran code to identify portions that should be parallelized by the compiler.
A self-paced course on this topic is available from SHARCNET: Introduction to GPU Programming.
This tutorial uses OpenACC to accelerate C, C++ or Fortran code. A working knowledge of one of these languages is therefore required to gain the most benefit out of it.
This tutorial is based on examples. You can download all of the examples in this Github repository.
Lesson plan
- Introduction
- Gathering a profile and getting compiler information
- Expressing parallelism with OpenACC directives
- Expressing data movement
- Optimizing loops
External references
Here are some useful external references: