Skip to content

Latest commit

 

History

History
41 lines (34 loc) · 4.43 KB

File metadata and controls

41 lines (34 loc) · 4.43 KB

Introduction to Parallel Programming for GPUs with CUDA

The goals for this CUDA tutorial are to gain a comprehensive understanding of the CUDA programming model, including SIMT threading, kernel structure, and GPU memory hierarchy, as well as practical knowledge of memory optimization techniques like global memory coalescing and shared memory bank conflicts.

Topics

  1. Chapter 1: CPU vs GPU Architecture and Performance

  2. Chapter 2: CUDA Programming In Practice

  3. Chapter 3: GPU Memory Hierarchy and Optimization

References:

  • Nvidia Cuda Programming Guide
  • CS 4230/6230 (Parallel and High-Performance Computing) Lectures by Professor P. Sadayappan, The University of Utah

By Omid Asudeh