Parallelization in Python

Summary

This repository consists of a template for parallelizing tasks in Python. This template can be easily modified for your application. All methods are based on the multiprocessing module.

How to parallelize tasks in Python

Using the multiprocessing module it is possible to parallelize tasks in python by

from multiprocessing import Pool

with Pool() as pool:
        output = pool.map(run_subtask, arguments)

and for tasks with multiple input arguments you can use

output = pool.starmap(run_subtask, arguments)

instead.

There also exist different alternatives to pool.map method, such as pool.map_async, pool.imap, pool.imap_unordered, which might be more suitable in terms of speed and memory allocation depending on your application. For more details about these methods see this stackoverflow post or multiprocessing documentation.

Warning - Shared databases or data structures

When parallelizing tasks one has to be especially careful with subtasks that need access to same data structures (e.g. tasks that write to the same databases). In this case it might be necessary to further adjust these methods to avoid their collisions during the parallelized process. For more information see multiprocessing documentation. However, if you are parallelizing subtasks that do not need access to shared data structures (e.g. Monte Carlo simulations) you can simply use the methods above to speed up your computations.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.envrc		.envrc
.gitattributes		.gitattributes
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
main.py		main.py
pyproject.toml		pyproject.toml
shell.nix		shell.nix

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Parallelization in Python

Summary

How to parallelize tasks in Python

Warning - Shared databases or data structures

About

Uh oh!

Releases

Packages

Languages

License

JurajZelman/py-parallelization

Folders and files

Latest commit

History

Repository files navigation

Parallelization in Python

Summary

How to parallelize tasks in Python

Warning - Shared databases or data structures

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages