[WIP] Win32 Handle extension #442

Agrael1 · 2024-08-25T22:27:16Z

No description provided.

Agrael1 · 2024-08-26T10:45:34Z

Question: Do I add vkGetMemoryWin32HandleKHR into allocator functions, or do I add it as a function argument?
First approach is clean wrt argument count, but if I don't have Win32 platform enabled this is kinda pointless. Second is less appealing, but safe, because it relies solely on user providing function, being static or dynamic

adam-sawicki-a · 2024-08-26T10:50:46Z

I would add it to the struct with functions, just like all other Vulkan functions.

adam-sawicki-a · 2024-08-26T11:33:00Z

Is it safe to use an atomic? What if 2 threads want to fetch the handle at the same time? Shouldn't you use a mutex around the whole fetching logic?

Agrael1 · 2024-08-26T11:44:32Z

Is it safe to use an atomic? What if 2 threads want to fetch the handle at the same time? Shouldn't you use a mutex around the whole fetching logic?

No, atomics are safe. I also did the relax ordering, because we have no memory to make barrier onto.
If 2 threads are accessing there are 2 situations that may happen:

already assigned -> easy duplicate
not assigned: -> creates handle (maybe twice, but who cares)
then there is compare_exchange, which is atomic, and if the underlying vlue changes it returns false.
One compare_exchange always succeeds, others will return false, and handle will not be reassigned. If there is a driver that shouts error (didn't find any major one) this is also OK, since that means there is handle somewhere already. Any excessive HANDLES are removed. We also provide handles for other processes, that's why there is that AMD fix there.

Actually I could tinker with a lot of atomics in this lib and make it lock free. There are lock free queues there, so maybe one of the next steps :)

Agrael1 · 2024-08-26T11:46:59Z

I actually asked Vulkan if that's OK to call on vkGetMemoryWin32HandleKHR multiple times. They pointed out that this is UB and not an error, meaning there is no harm in it, just we have no clue what's gonna be returned.

adam-sawicki-a · 2024-08-26T11:51:38Z

Isn't this whole effort about making sure we don't call vkGetMemoryWin32HandleKHR twice for a device memory object? As you posted the validation message in the other ticket:

VUID-VkMemoryGetWin32HandleInfoKHR-handleType-00663
If handleType is defined as an NT handle, vkGetMemoryWin32HandleKHR must be called no more than once for each valid unique combination of memory and handleType

If so, isn't your current code unsafe in this regard when called from multiple threads?

When talking about an UB, we typically understand that anything can happen, including entire app or system crash, so is there really "no harm" when we trigger an an UB?

Agrael1 · 2024-08-26T11:56:24Z

Hmm, you're right, though there was a limitation for me, hence why I chose to use atomics, and that being absence of mutex on dedicated memory. Should I add one?

adam-sawicki-a · 2024-08-26T11:58:34Z

Yes please, I think you need a mutex for this to be safe.

Another topic to think about: What if an allocation ends up as a dedicated allocation?

Agrael1 · 2024-08-26T12:01:05Z

If it has VkDeviceMemory it is not a problem. There is no mutex in this structure, so I think I will add one

Agrael1 · 2024-08-26T12:12:30Z

Added a mutex. There is a really low chance in current setup that it will be used, so it imposes little overhead

adam-sawicki-a · 2024-08-26T12:21:21Z

It seems you reformatted the entire file with some automatic formatting tool! Please don't do this. It makes it difficult to see what are the real differences you introduce to the code.

Can you please surround all your code with an #ifdef telling about the availability of the VK_KHR_external_memory_win32 extension, similar to e.g. VMA_EXTERNAL_MEMORY?

Agrael1 · 2024-08-26T12:21:49Z

crap, reflexes

Agrael1 · 2024-08-26T12:46:42Z

Done

include/vk_mem_alloc.h

Agrael1 · 2024-08-27T08:41:07Z

Hopefully I defined correct one. Also, I have a linker error with static function, seems like it is only possible to get the function dynamically

adam-sawicki-a · 2024-08-27T08:43:30Z

Yes, functions from extensions can only be fetched dynamically.

Agrael1 · 2024-08-27T08:44:25Z

do I set it to null, or try to fetch with dynamic code?

adam-sawicki-a · 2024-08-27T08:47:56Z

Please do like the other extensions do - for Static don't do anything, for Dynamic try to fetch the function pointer, for ValidateVulkanFunctions - assert if not null if the extension is enabled.

adam-sawicki-a · 2024-08-27T08:50:51Z

Are you willing to develop some simple test for this new functionality?

Agrael1 · 2024-08-27T08:52:07Z

Sure

Agrael1 · 2024-08-27T09:31:35Z

I wonder, why don't we allow for dedicated allocations to have custom pNext?

adam-sawicki-a · 2024-08-27T09:34:33Z

I wonder, why don't we allow for dedicated allocations to have custom pNext?

To avoid feature creep and not extend the VmaAllocationCreateInfo further. Please note that dedicated allocations can also be created in a custom pool.

Agrael1 · 2024-08-27T10:05:58Z

What do I specify at memoryTypeIndex on pool?

adam-sawicki-a · 2024-08-27T10:08:06Z

To create a custom pool you need to find the right memory type and choose it explicitly:
https://gpuopen-librariesandsdks.github.io/VulkanMemoryAllocator/html/custom_memory_pools.html#custom_memory_pools_MemTypeIndex

Agrael1 · 2024-08-27T10:34:45Z

I added code for fetching the extension and creating a pool. Can't launch test, however, it throws at some random point

Agrael1 · 2024-08-27T10:44:30Z

Fixed, working like a charm

adam-sawicki-a · 2024-08-27T12:11:08Z

Thank you for all this code! I need to analyze it some more, but it looks good overall.

There is one more thing I would like to have. I don't want the VmaAllocation_T structure to grow in size. As we need to add a HANDLE to it, I'm thinking about defining some new structure like VmaAllocationExtraData that would hold the new HANDLE and the old m_pMappedData, while an allocation object (m_DedicatedAllocation member) would contain a pointer to it. For most allocations, the pointer would be null. Only when a dedicated allocation needs to store a non-null mapped pointer and/or a Win32 handle, it would dynamically allocate the structure.

Would you like to implement such thing?

Agrael1 · 2024-08-27T12:43:03Z

Well, I'd take it to the new PR.
I'd also like to do several things:

Slice the library into different files for easier development, and creating a generator to concatenate them to a single file.
Make use of inline and make lib pure header only
Make 2 targets, 1 is static and 2 is header only cmake target
Formatting bot + .clang-format

Tell me if you are interested in some of them :)

Agrael1 · 2024-08-27T12:50:30Z

Also VmaDeviceMemoryBlock has the same m_pMappedData and handle, should that also be replaced?

adam-sawicki-a · 2024-08-27T12:53:17Z

OK, I'll analyze your code soon and hopefully I merge it. I agree the optimization to extract new structure VmaAllocationExtraData is better done as a separate task.

I am sorry but I am not interested in any of the big changes you proposed. The library is now mature and used by many developers. I don't want to do a revolution that could break anyone's workflow. If you want to introduce such big changes to the code, please feel free to maintain your own fork. I am happy to link to it from README.

About the Cmake script especially, I don't want to make more major changes, because this is a never-ending story of pull requests from users with various visions about how it should look like. The more such requests I get, the closer I am to removing the Cmake script completely. I regret adding it in the first place. I agree with the arguments of @ocornut about the value of not having it, as mentioned in ocornut/imgui#7892, section "3.6. Lack Of A Build System Is A Feature". Please also note that VMA ships with the Vulkan SDK as "vk_mem_alloc.h" file only, not the entire repository and no Cmake script.

adam-sawicki-a · 2024-08-27T12:53:53Z

VmaDeviceMemoryBlock doesn't need to have the mapped pointer extracted to a separate structure because I am not concerned about adding a new member to this class. Only the allocation object should stay lightweight.

Agrael1 · 2024-08-27T12:56:34Z

Ah, ok. I thought about making Wisdom Memory allocator, but that would require me to also get D3D12 MA shredded, which is a ton of work.

Maybe one day.

adam-sawicki-a · 2024-08-27T12:59:56Z

Do you use D3D12MA library as well? Extracting common code (the core TLSF allocation algorithm) from both libraries is another major refactoring (after splitting the library into multiple source files) that is possible but I don't want to do. If you do it on your fork, that would be great.

Agrael1 · 2024-08-27T13:04:30Z

I have Wisdom library, which is super thin compared to others, but super demanding in terms of Vulkan (works no less than 1.3 and requires heavy extensions). I stumbled upon both allocators, and now they are part of its standard. If there is a wish, I could do the Wisdom memory allocator and do the extraction for both.

adam-sawicki-a · 2024-08-27T13:06:27Z

It is up to you. From my perspective, it is good if we just have the change to VMA discussed in this PR.

Agrael1 · 2024-08-27T13:08:12Z

Will do. I'd also like to have Wisdom mentioned :)
It is alpha, but soon it will be finished

adam-sawicki-a · 2024-08-28T11:40:01Z

Thank you very much for this extensive and high quality contribution.

You didn't test dedicated allocations. Because of that, you didn't meet the bug that happens because VmaAllocation objects are created using a custom CPU memory pool and don't get their constructors and destructors called, so the handle of a dedicated allocation could have garbage data. This is another argument to keep the handle, together with the mapped pointer, in a new dynamically allocated structure VmaAllocationExtraData. I added it.

I also made some other fixes and written documentation for the new feature. Please let me know if it looks good to you.

Agrael1 · 2024-08-28T11:52:45Z

I need to finish one project, then I will continue. Where can I find new documentation?

Glad I could help!

adam-sawicki-a · 2024-08-28T11:54:46Z

It is already part of the documentation available online, most notably:
https://gpuopen-librariesandsdks.github.io/VulkanMemoryAllocator/html/vk_khr_external_memory_win32.html
https://gpuopen-librariesandsdks.github.io/VulkanMemoryAllocator/html/group__group__alloc.html#ga8d327b7458d8cf426b84b5efba9bb9bf

Agrael1 · 2024-08-28T11:55:54Z

Ah, I see, you have already implemented the data pointer, well, then if something comes, I could make a feature or two

Agrael1 · 2024-08-28T11:57:49Z

Wow, documentation looks awesome!

Agrael1 added 2 commits August 26, 2024 00:25

basic outline of the Win32 handle extension

746c651

made handle atomic

33bd6c6

Agrael1 marked this pull request as ready for review August 26, 2024 10:42

added vkGetMemoryWin32HandleKHR to functions

8c665c4

extra safety

c41e3fb

Agrael1 force-pushed the master branch from a069b40 to c41e3fb Compare August 26, 2024 12:14

New guard, fixed ABI issues

9402a6b

adam-sawicki-a requested changes Aug 27, 2024

View reviewed changes

documentation

65afd9e

Agrael1 force-pushed the master branch from a933d79 to 65afd9e Compare August 27, 2024 08:42

dynamic fetching of function

0c8feb2

Tests, documentation and fix

c9b2a6a

Agrael1 force-pushed the master branch from 0fb2980 to c9b2a6a Compare August 27, 2024 09:28

extension enabled

2683cfe

Fixed test not compiling shaders

e962c8c

Agrael1 force-pushed the master branch from 81d3192 to e962c8c Compare August 27, 2024 10:52

adam-sawicki-a merged commit 0d55cf5 into GPUOpen-LibrariesAndSDKs:master Aug 28, 2024

adam-sawicki-a mentioned this pull request Aug 28, 2024

Exporting Memory handle #440

Closed

adam-sawicki-a added feature Adding new feature next release To be done as soon as possible labels Aug 28, 2024

[WIP] Win32 Handle extension #442

[WIP] Win32 Handle extension #442

Conversation

Agrael1 commented Aug 25, 2024

Agrael1 commented Aug 26, 2024

adam-sawicki-a commented Aug 26, 2024

adam-sawicki-a commented Aug 26, 2024

Agrael1 commented Aug 26, 2024

Agrael1 commented Aug 26, 2024

adam-sawicki-a commented Aug 26, 2024

Agrael1 commented Aug 26, 2024

adam-sawicki-a commented Aug 26, 2024

Agrael1 commented Aug 26, 2024

Agrael1 commented Aug 26, 2024

adam-sawicki-a commented Aug 26, 2024

Agrael1 commented Aug 26, 2024

Agrael1 commented Aug 26, 2024

Agrael1 commented Aug 27, 2024

adam-sawicki-a commented Aug 27, 2024

Agrael1 commented Aug 27, 2024

adam-sawicki-a commented Aug 27, 2024

adam-sawicki-a commented Aug 27, 2024

Agrael1 commented Aug 27, 2024

Agrael1 commented Aug 27, 2024

adam-sawicki-a commented Aug 27, 2024

Agrael1 commented Aug 27, 2024 • edited Loading

adam-sawicki-a commented Aug 27, 2024

Agrael1 commented Aug 27, 2024

Agrael1 commented Aug 27, 2024

adam-sawicki-a commented Aug 27, 2024

Agrael1 commented Aug 27, 2024

Agrael1 commented Aug 27, 2024

adam-sawicki-a commented Aug 27, 2024

adam-sawicki-a commented Aug 27, 2024

Agrael1 commented Aug 27, 2024 • edited Loading

adam-sawicki-a commented Aug 27, 2024

Agrael1 commented Aug 27, 2024

adam-sawicki-a commented Aug 27, 2024

Agrael1 commented Aug 27, 2024

adam-sawicki-a commented Aug 28, 2024

Agrael1 commented Aug 28, 2024 • edited Loading

adam-sawicki-a commented Aug 28, 2024

Agrael1 commented Aug 28, 2024

Agrael1 commented Aug 28, 2024

Agrael1 commented Aug 27, 2024 •

edited

Loading

Agrael1 commented Aug 27, 2024 •

edited

Loading

Agrael1 commented Aug 28, 2024 •

edited

Loading