[BUG] <numBlocks in Y dimension is larger than needed for FetchOnDemand_no_fusion> #323

yokosyun · 2024-08-12T01:40:40Z

Is there an existing issue for this?

I have searched the existing issues

Current Behavior

fetch_on_demand_gemm_no_fusion have wrong numBlocks in Y dim.
Thus there is unnecessary Block execution.

cur_nnz is divided by 16(BLOCK_SIZE)

fetch_on_demand_gemm_no_fusion_fp32_1<16, 4, 8>
            <<<dim3(DIV_UP(out_channel, 16), DIV_UP(cur_nnz, 16), 1),
               dim3(16, 16, 1)>>>

Expected Behavior

it must be divided by (16(BLOCK_SIZE)*4(N_LOOP)) to be correct numBlocks in Y dim

fetch_on_demand_gemm_no_fusion_fp32_1<16, 4, 8>
            <<<dim3(DIV_UP(out_channel, 16), DIV_UP(cur_nnz, 16 * N_LOOP), 1),
               dim3(16, 16, 1)>>>

Environment

- GCC:
- NVCC:
- PyTorch:
- PyTorch CUDA:
- TorchSparse:

Anything else?

We can't make a bugfix PR?

The text was updated successfully, but these errors were encountered:

zhijian-liu assigned ys-2020 Aug 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] <numBlocks in Y dimension is larger than needed for FetchOnDemand_no_fusion> #323

[BUG] <numBlocks in Y dimension is larger than needed for FetchOnDemand_no_fusion> #323

yokosyun commented Aug 12, 2024 •

edited

Loading

[BUG] <numBlocks in Y dimension is larger than needed for FetchOnDemand_no_fusion> #323

[BUG] <numBlocks in Y dimension is larger than needed for FetchOnDemand_no_fusion> #323

Comments

yokosyun commented Aug 12, 2024 • edited Loading

Is there an existing issue for this?

Current Behavior

Expected Behavior

Environment

Anything else?

yokosyun commented Aug 12, 2024 •

edited

Loading