Skip to content

Latest commit

 

History

History
16 lines (14 loc) · 565 Bytes

File metadata and controls

16 lines (14 loc) · 565 Bytes

Coursework for UMN EE5351 Applied Parallel Programming

This course focuses on parallelization of programs using CUDA C on NVIDIA GPUs. Includes:

-Machine Problems (mini-labs)
 - [x] MP0 - Parallel vector addition
 - [x] MP1 - Matrix multiplication
 - [x] MP2 - Tiled Matrix multiplication
 - [x] MP3 - 2D Convolution
 - [x] MP4-1 - Data Parallel Reduction
 - [x] MP4-2 - Parallel Prefix Sum (Scan)
 - [x] MP5 - Histogramming
 - [x] MP6 - Sparse Matrix-Vector Multiplication

-Final Project
 - [x] Groupby with value operations