Fundamentals of Accelerated Computing with CUDA C/C++ (FACCC) – Outline

Detailed Course Outline

Introduction

  • Meet the instructor.
  • Create an account at courses.nvidia.com/join

Accelerating Applications with CUDA C/C++

  • Learn the essential syntax and concepts to be able to write GPU-enabled C/C++ applications with CUDA:
  • Write, compile, and run GPU code.
  • Control parallel thread hierarchy.
  • Allocate and free memory for the GPU.

Managing Accelerated Application Memory with CUDA C/C++

  • Learn the command-line profiler and CUDA-managed memory, focusing on observation-driven application improvements and a deep understanding of managed memory behavior:
  • Profile CUDA code with the command-line profiler.
  • Go deep on unified memory.
  • Optimize unified memory management.

Asynchronous Streaming and Visual Profiling for Accelerated Applications with CUDA C/C++

  • Identify opportunities for improved memory management and instruction-level parallelism:
  • Profile CUDA code with NVIDIA Nsight Systems.
  • Use concurrent CUDA streams.

Final Review

  • Review key learnings and wrap up questions.
  • Complete the assessment to earn a certificate.
  • Take the workshop survey.