This course focuses on parallelization of programs using CUDA C on NVIDIA GPUs. Includes:
-Machine Problems (mini-labs)
- [x] MP0 - Parallel vector addition
- [x] MP1 - Matrix multiplication
- [x] MP2 - Tiled Matrix multiplication
- [x] MP3 - 2D Convolution
- [x] MP4-1 - Data Parallel Reduction
- [x] MP4-2 - Parallel Prefix Sum (Scan)
- [x] MP5 - Histogramming
- [x] MP6 - Sparse Matrix-Vector Multiplication
-Final Project
- [x] Groupby with value operations