UCT/CUDA: Detect sys_dev for async allocations by brminich · Pull Request #10607 · openucx/ucx

brminich · 2025-04-04T16:31:28Z

What?

Currently, asynchronous memory can be detected as CUDA-managed, with its sys_dev set to unknown. However, for async VMM memory, knowing the sys_dev is crucial to identify the correct CUDA context for executing cuMemcpyAsync.

Why?

Test from #10601 fails when sending an eager message from legacy pinned memory to asynchronous VMM memory (on isr1). On the receiver side, the destination buffer is detected as CUDA-managed with an unknown sys_device, preventing the cuda_copy transport from selecting the correct context for the VMM allocation.

Currently, asynchronous memory can be detected as CUDA-managed, with its sys_dev set to unknown. However, for async VMM memory, knowing the sys_dev is crucial to identify the correct CUDA context for executing cuMemcpyAsync.

brminich changed the title ~~UCT/CUDA: Detect sys_dev for sync allocations~~ UCT/CUDA: Detect sys_dev for async allocations Apr 4, 2025

UCT/CUDA: Detect sys_dev for async allocations

88de9de

Currently, asynchronous memory can be detected as CUDA-managed, with its sys_dev set to unknown. However, for async VMM memory, knowing the sys_dev is crucial to identify the correct CUDA context for executing cuMemcpyAsync.

brminich force-pushed the uct/fix_async_mem_detection branch from d8ebb65 to 88de9de Compare April 4, 2025 16:32

rakhmets approved these changes Apr 4, 2025

View reviewed changes

yosefe approved these changes Apr 7, 2025

View reviewed changes

yosefe merged commit 194aec9 into openucx:master Apr 7, 2025
151 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

UCT/CUDA: Detect sys_dev for async allocations#10607

UCT/CUDA: Detect sys_dev for async allocations#10607
yosefe merged 1 commit intoopenucx:masterfrom
brminich:uct/fix_async_mem_detection

brminich commented Apr 4, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

brminich commented Apr 4, 2025

What?

Why?

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants