Multi-GPU support with dask #179

Intron7 · 2024-04-25T13:07:19Z

This adds dask support

Functions to add:

for more information, see https://pre-commit.ci

ilan-gold

From what I can tell, also CSC is just not really mentioned? It's not supported but maybe we should throw an error or something?

ilan-gold · 2024-11-12T16:58:18Z

pyproject.toml

@@ -37,6 +37,7 @@ doc = [
    "scanpydoc[typehints,theme]>=0.9.4",
    "readthedocs-sphinx-ext",
    "sphinx_copybutton",
+    "dask",


These should be added to the dependencies.

ilan-gold · 2024-11-12T17:00:41Z

src/rapids_singlecell/preprocessing/_hvg.py

+        adata_subset = adata[adata.obs[batch_key] == batch].copy()

        calculate_qc_metrics(adata_subset, layer=layer)
        filt = adata_subset.var["n_cells_by_counts"].to_numpy() > 0
-        adata_subset = adata_subset[:, filt]
+        adata_subset = adata_subset[:, filt].copy()


So this is unrelated to this PR?

ilan-gold · 2024-11-12T17:00:57Z

src/rapids_singlecell/preprocessing/_hvg.py

+        adata_subset = adata[adata.obs[batch_key] == batch].copy()

        calculate_qc_metrics(adata_subset, layer=layer)
        filt = adata_subset.var["n_cells_by_counts"].to_numpy() > 0
-        adata_subset = adata_subset[:, filt]
+        adata_subset = adata_subset[:, filt].copy()


Can we add a test for the "weird stuff?"

ilan-gold · 2024-11-12T17:02:44Z

src/rapids_singlecell/preprocessing/_kernels/_qc_kernels_dask.py

+"""
+
+
+def _sparse_qc_csr_dask_cells(dtype):


I 100% agree with this. If it is a separate PR, that's fine by me. But we need to be able to maintain this library too if a fix needs to be made or something. This sort of thing goes a long way towards making things easier for the next person.

ilan-gold · 2024-11-12T17:03:28Z

src/rapids_singlecell/preprocessing/_normalize.py


+def _normalize_total(X: ArrayTypesDask, target_sum: int):


Are we just completely punting on CSC matrices?

For csc we transform to csr in normalize and thats always done.

ilan-gold · 2024-11-12T17:10:43Z

src/rapids_singlecell/preprocessing/_normalize.py

+        from ._kernels._norm_kernel import _mul_csr
+
+        mul_kernel = _mul_csr(X.dtype)
+        mul_kernel.compile()


I agree with Phil here. Calling compile should be universally applied on first-access as neeeded instead of manually having to remember to do it. We should be trying to make sure that if someone wants to add a new feature to RSC and forgets to call compile, that can't happen if you're out-of-commission.

ilan-gold · 2024-11-12T17:21:10Z

src/rapids_singlecell/preprocessing/_normalize.py

+def _get_target_sum_csr(X: sparse.csr_matrix) -> int:
+    from ._kernels._norm_kernel import _get_sparse_sum_major
+
+    counts_per_cell = cp.zeros(X.shape[0], dtype=X.dtype)
+    sum_kernel = _get_sparse_sum_major(X.dtype)
+    sum_kernel(
+        (X.shape[0],),
+        (64,),
+        (X.indptr, X.data, counts_per_cell, X.shape[0]),
+    )


Please refactor these lines into their own function that can then be applied to each block of the dask array and then do the masking + median at the end after either you do map_blocks or _get_target_sum_csr. We do this "recursive" map_blocks in scanpy a lot and it works very well and makes things very easy to reason about. These functions are basically identical. Especially if compiling really costs nothing extra as you say, then this should be doable. Maybe I'm missing something though. This seems like a good point in favor of "add compile to some always-called function" as Phil was saying

I see where you are coming from with this one. However I feel the compile might make this challenging. I have an idea on how to maybe fix this but this needs some testing and investigations. I could wrap the cuda kernel factory to call compile on the returned kernel. But I don't know if this than executed on the worker or host.

ilan-gold · 2024-11-12T17:21:46Z

src/rapids_singlecell/preprocessing/_normalize.py

+        chunks=(X.chunksize[0],),
+        drop_axis=1,
+    )
+    counts_per_cell = target_sum_chunk_matrices.compute()


Why? Too much for this PR? Make an issue maybe?

ilan-gold · 2024-11-12T17:22:45Z

src/rapids_singlecell/preprocessing/_pca.py

+                svd_solver = "jacobi"
+            pca_func = PCA(n_components=n_comps, svd_solver=svd_solver, whiten=False)
+            X_pca = pca_func.fit_transform(X)
+            X_pca = X_pca.compute_chunk_sizes()


Why do we need to compute chunk sizes here?

Please add a comment

I added a comment for PCA X_pca = X_pca.compute_chunk_sizes().

For the copying in HVG. Since I use calculate_qc for to find where to subset. I wanted to be sure that the original data doesn't get overwritten. I addition to that the dask gpu views are sometimes a bit weird. So copy makes this more solid in general. That applies most to slicing against the minor axis.

I also dont want dask to be a dependency because cuml handels that. I dont want get into problems there.

src/rapids_singlecell/preprocessing/_scale.py

Co-authored-by: Ilan Gold <[email protected]>

Intron7 · 2024-11-13T11:49:38Z

I renamed the functions for QC and renamed some of the variables so its a bit clearer whats happening.

ilan-gold

https://github.com/scverse/rapids_singlecell/pull/179/files#r1838498091 is not done and from what I can tell #179 (review) has not been addressed. What happens if you pass a csc dask array to pca?

Intron7 · 2024-11-14T11:13:14Z

https://github.com/scverse/rapids_singlecell/pull/179/files#r1838498091 is not done and from what I can tell #179 (review) has not been addressed. What happens if you pass a csc dask array to pca?

That will just error. And tell the user to please give me dense or csr as meta. I updated _check_gpu_X to reflect that.

The median I'll test today

ilan-gold · 2024-11-14T17:00:14Z

We should look into the cost of allocating ahead of time for all operations that are currently in-place

Intron7 · 2024-11-21T12:05:21Z

Median out of core is a bad choice. Uses way more memory and is slower. Loose Loose

for more information, see https://pre-commit.ci

add first functions

17df571

Intron7 marked this pull request as draft April 25, 2024 13:08

add hvg part1

40167ca

Intron7 changed the title ~~add first functions~~ Multi-GPU support with dask Apr 30, 2024

Intron7 and others added 10 commits April 30, 2024 12:01

Merge branch 'main' into dask_mg_support

f4db387

Merge branch 'main' into dask_mg_support

6526b42

[pre-commit.ci] auto fixes from pre-commit.com hooks

0cdb85d

for more information, see https://pre-commit.ci

reset to main for hvg

48b68f6

add support for hvg

886cafa

first pass pca

d7bf01e

pca update

b216890

fix bug with csc matrix

cdffd33

add dask to docs

177afa1

add tests

dd1377c

Intron7 added the run-gpu-ci runs GPU CI label May 3, 2024

Intron7 and others added 14 commits May 3, 2024 13:50

update names

e254800

get docs to work

77b3c34

remove client from sparse calc

36bebf9

need dask for docs

82cc22c

Merge branch 'main' into dask_mg_support

7ddde9b

add scale

e33821f

int64 updates

e1e6c19

For main branch

7da41e0

Merge branch 'main' into dask_mg_support

e676dbe

test docs

b6f436f

Merge branch 'main' into dask_mg_support

ef00052

[pre-commit.ci] auto fixes from pre-commit.com hooks

4b22562

for more information, see https://pre-commit.ci

fix import

5ed8e68

fix rebase

b879ea4

Intron7 marked this pull request as ready for review May 13, 2024 14:27

Intron7 added invalid This doesn't seem right run-gpu-ci runs GPU CI labels Nov 12, 2024

github-actions bot removed the run-gpu-ci runs GPU CI label Nov 12, 2024

ilan-gold requested changes Nov 12, 2024

View reviewed changes

Intron7 removed the invalid This doesn't seem right label Nov 13, 2024

Intron7 and others added 4 commits November 13, 2024 12:02

Update src/rapids_singlecell/preprocessing/_scale.py

bb10cda

Co-authored-by: Ilan Gold <[email protected]>

add note

1ae74d7

dask import

38e4ad0

update qc names

9755641

Intron7 requested a review from ilan-gold November 13, 2024 11:48

Merge branch 'main' into dask_mg_support

fe2aa20

Intron7 added the run-gpu-ci runs GPU CI label Nov 13, 2024

github-actions bot removed the run-gpu-ci runs GPU CI label Nov 13, 2024

update

5fd5a97

ilan-gold requested changes Nov 14, 2024

View reviewed changes

update _check_gpu_X

af1faf5

update docs

7366200

Intron7 and others added 2 commits November 15, 2024 12:14

docs update

d1a6344

Merge branch 'main' into dask_mg_support

fb8c825

Intron7 and others added 3 commits November 25, 2024 11:49

make sure dtype is correct PCA

c65585d

Merge branch 'main' into dask_mg_support

b7974f9

[pre-commit.ci] auto fixes from pre-commit.com hooks

e7a1118

for more information, see https://pre-commit.ci

Intron7 added the run-gpu-ci runs GPU CI label Dec 5, 2024

github-actions bot removed the run-gpu-ci runs GPU CI label Dec 5, 2024

Intron7 and others added 2 commits December 6, 2024 14:17

Merge branch 'main' into dask_mg_support

a2107ff

[pre-commit.ci] auto fixes from pre-commit.com hooks

03e601a

for more information, see https://pre-commit.ci

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi-GPU support with dask #179

Multi-GPU support with dask #179

Intron7 commented Apr 25, 2024 •

edited

Loading

ilan-gold left a comment •

edited

Loading

ilan-gold Nov 12, 2024

ilan-gold Nov 12, 2024

ilan-gold Nov 12, 2024

ilan-gold Nov 12, 2024

ilan-gold Nov 12, 2024

Intron7 Nov 13, 2024

ilan-gold Nov 12, 2024

ilan-gold Nov 12, 2024

Intron7 Nov 13, 2024

ilan-gold Nov 12, 2024

ilan-gold Nov 12, 2024

ilan-gold Nov 12, 2024

Intron7 Nov 13, 2024

Intron7 Nov 13, 2024

Intron7 commented Nov 13, 2024

ilan-gold left a comment

Intron7 commented Nov 14, 2024

ilan-gold commented Nov 14, 2024

Intron7 commented Nov 21, 2024

		"""


		def _sparse_qc_csr_dask_cells(dtype):

Multi-GPU support with dask #179

Are you sure you want to change the base?

Multi-GPU support with dask #179

Conversation

Intron7 commented Apr 25, 2024 • edited Loading

ilan-gold left a comment • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Intron7 commented Nov 13, 2024

ilan-gold left a comment

Choose a reason for hiding this comment

Intron7 commented Nov 14, 2024

ilan-gold commented Nov 14, 2024

Intron7 commented Nov 21, 2024

Intron7 commented Apr 25, 2024 •

edited

Loading

ilan-gold left a comment •

edited

Loading