iCSC23 Course - Multiplatform Programming with Python

This repository contains reference materials for the course and instructions on how to set up the tutorial environment.

The folder lecture_examples contains code snippets that were shown during the lecture. These are not part of the tutorial and are stored here for reference only so that you can try them out at home.
The folder tutorial contains the tutorial exercises. Please follow the steps below for the setup.

Tutorial setup

Click on the banner below:

This connects you to the CERN SWAN service where you will be requested to configure the work environment: In the first dropdown window select 102b Cuda 11.7.1 (GPU) and press "Start my session" at the bottom. You don't have to change the other settings.
The repository will be automatically cloned to your CERNBox SWAN directory. Navigate to the tutorial/exercises folder & have fun!

Tutorial exercises

The goal of the tutorial is to demonstrate the basic usecases of the Python libraries CFFI, CuPy & PyOpenCL. There are 3 exercises:

Exercise 1: This exercise is about a speeding up a Python function that computes the n-th Fibonacci number. We will use CFFI to create a C extension module, load it in Python then test it. Then we compare the execution times of the pure Python and the CFFI implementation.
Exercise 2: This exercise is a demo of CuPy (exercise_2a) and PyOpenCL (exercise_2b). We will create and execute a simple elementwise kernel on the GPU and use the math API of the libraries to perform the same operation purely in Python. We then do a simple profiling of the execution time on the GPU.
Exercise 3: This exercise is about templating, i.e. we will simply create a Python wrapper around a C function. The wrapper takes as input the desired context (cpu, gpu_cuda or gpu_opencl) then the script parses the C function and turns it into a CPU or GPU kernel by adding the relevant qualifiers, then loads the relevant library and executes it on the requested platform. This exercise is meant to show that we can write our performance critical C code only once and use Python's string parsing methods to fine tune it for a specific context.

Exercises 1 and 3 contain parts where you are asked to fill in some code. For these you may find the solutions in the tutorial/exercise_solutions folder.

References & resources

[1] Programming Massively Parallel Processors, David B. Kirk, Wen-mei W. Hwu
https://safari.ethz.ch/architecture/fall2019/lib/exe/fetch.php?media=2013_programming_massively_parallel_processors_a_hands-on_approach_2nd.pdf

[2] CUDA Programming guide
https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html

[3] OpenCL API Specification
https://registry.khronos.org/OpenCL/

[4] Easy, Effective, Efficient GPU Programming with PyOpenCL and PyCUDA, Andreas Klöckner
https://www.bu.edu/pasi/courses/gpu-programming-with-pyopencl-and-pycuda/

[5] CFFI documentation
https://cffi.readthedocs.io/en/latest/index.html

[6] CuPy documentation
https://docs.cupy.dev/en/stable/index.html

[7] PyOpenCL documentation
https://documen.tician.de/pyopencl/index.html

[8] Tool used to create code snippet images for the lecture
https://carbon.now.sh/

Name		Name	Last commit message	Last commit date
Latest commit History 82 Commits
lecture_examples		lecture_examples
readme_images		readme_images
tutorial		tutorial
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

iCSC23 Course - Multiplatform Programming with Python

Tutorial setup

Tutorial exercises

References & resources

About

Releases

Packages

Languages

pkicsiny/icsc23_course

Folders and files

Latest commit

History

Repository files navigation

iCSC23 Course - Multiplatform Programming with Python

Tutorial setup

Tutorial exercises

References & resources

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages