`CudaDevice::htod_copy_into` is unsound #314

CatsAreFluffy · 2025-01-20T11:29:57Z

If two htod_copy_intos to the same CudaSlice are issued in quick succession, the source for the first copy may be freed before it completes. I haven't been able to find an example that behaves incorrectly because of this, though.

The text was updated successfully, but these errors were encountered:

coreylowman · 2025-01-20T16:45:42Z

Ah are you saying this because we overwrite the first owned vec with the 2nd dst.host_buf = Some(Pin::new(src));?

I supposed that's true. Perhaps we should add:

if self.host_buf.is_some() {
    self.synchronize()?;
}

CatsAreFluffy · 2025-01-20T20:45:35Z

You could also do it by making an event when you copy, storing it in the CudaSlice, and synchronizing on it whenever you drop the host memory. This would block execution less if you queue more work between the copies. Synchronization also needs to happen for anything else that can free the host memory (eg leak and drop), so I think the easiest way to do that would be to make a struct containing an event and a Pin<Vec<_>> that synchronizes on the event when it's dropped, and use that for storing the host memory.

Also, now that I think about it, I probably wasn't able to cause issues with this since copies from unpinned memory (in the CUDA sense) are always synchronous, and Vecs are always allocated in unpinned, so you can't actually get any use-after-frees. I'm not sure if it's a good idea to rely on that in general, though.

coreylowman · 2025-01-21T03:12:37Z

Oh yeah good points. For future reference: while we are using rust side pins, this is not the same as cuda side pinned memory (which requires you used a specific api to allocate the memory #80 ).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`CudaDevice::htod_copy_into` is unsound #314

`CudaDevice::htod_copy_into` is unsound #314

CatsAreFluffy commented Jan 20, 2025

coreylowman commented Jan 20, 2025

CatsAreFluffy commented Jan 20, 2025

coreylowman commented Jan 21, 2025 •

edited

Loading

CudaDevice::htod_copy_into is unsound #314

CudaDevice::htod_copy_into is unsound #314

Comments

CatsAreFluffy commented Jan 20, 2025

coreylowman commented Jan 20, 2025

CatsAreFluffy commented Jan 20, 2025

coreylowman commented Jan 21, 2025 • edited Loading

`CudaDevice::htod_copy_into` is unsound #314

`CudaDevice::htod_copy_into` is unsound #314

coreylowman commented Jan 21, 2025 •

edited

Loading