Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

vkcube should use the first available physical device #1054

Open
kvark opened this issue Nov 28, 2024 · 4 comments
Open

vkcube should use the first available physical device #1054

kvark opened this issue Nov 28, 2024 · 4 comments

Comments

@kvark
Copy link

kvark commented Nov 28, 2024

Highly related to #153

Code in question -

/* Try to auto select most suitable device */

As a result of this logic, vkcube always picks a discrete GPU on multi-gpu systems. This is in conflict with the principle of vkEnumeratePhysicalDevice returning the adapters in a "preferred" order by the system (see KhronosGroup/Vulkan-Loader#153 (comment) for context). If Vulkan loader is indeed trying to return the first adapter as the preferred, and/or there are applications expecting them, then I think vkcube should just pick the first (available) physical device instead of doing any kind of sorting. If you want to demonstrate a more comprehensive use of Vulkan, you could keep the sorting functionality behind an option.

The context here is - some systems returning Intel GPU as the first but actually failing to present with it. See https://gitlab.freedesktop.org/mesa/mesa/-/issues/4688 and https://gitlab.freedesktop.org/mesa/mesa/-/issues/12212 for examples. vkcube is often used to simply check "does Vulkan work on your system", and always returning the discrete GPU isn't actually providing this signal, which causes confusion.

Examples of confusion:

@water-chika
Copy link
Contributor

If you want to demonstrate a more comprehensive use of Vulkan, you could keep the sorting functionality behind an option.

vkcube --gpu_number 0 could help you always select first GPU.

vkcube is often used to simply check "does Vulkan work on your system", and always returning the discrete GPU isn't actually providing this signal, which causes confusion.

If you want to check all physical devices could work on your system, you could get total number GPU number and run vkcube --gpu_number <index> in a loop.

@jjulianoatnv
Copy link
Contributor

jjulianoatnv commented Dec 17, 2024

I'm opposed to the premise of this issue's problem statement.

We intentionally added sorting to bring Vulkan on par with APIs like D3D12 on Windows. On Windows, there is OS arbitration of physical device sorting. This sorting spans devices from separate IHVs (which is this issue's configuration.) The OS provides centralized user control that overrides the default heuristics. Users do use these capabilities. Matching D3D12 behavior in Vulkan is desired.

Linux lacks centralized control that spans IHVs. In part, there is an allergic reaction from linux maintainers to the concept of centralized arbitration that spans drivers. Recent discussion within the Vulkan working group has seemed to come to realize that centralized arbitration might in fact be good, and there might be an "education campaign" in the coming months. The future is TBD.

In the absence of centralized arbitration, the Vulkan loader for linux has implemented its own heuristics. The topic of the current github issue is really about the heuristics that were implemented inside the Vulkan loader for linux. This is a narrower scope than how the issue topic is currently written.

My words do not debate the quality and efficacy of those heuristics. I do not oppose change to the Linux heuristics. What I'm doing in this comment is clarifying the problem space and providing some background that is relevant to the discussion.

@kvark
Copy link
Author

kvark commented Dec 19, 2024

@jjulianoatnv thank you for a well-thought response!
If there is a gap in Linux support for Vulkan with regards to supporting multiple IHV, I don't think vkcube is the right place to solve it. Introducing adapter sorting in vkcube is not going to magically bring Vulkan on par with D3D12 for everybody.
The proper solution would lie somewhere in writing the Vulkan layers (like the MESA device select), working with linux maintainers (like you mentioned), etc. vkcube should assume a healthy system, not try to fix it.

@jjulianoatnv
Copy link
Contributor

Oops, in my previous comment I misspoke. I had incorrectly read this issue as requesting that the Vulkan Loader use the first available physical device. I see now that the issue is about the vkcube application defaulting to something other than the first physical device that is enumerated by Vulkan. My previous comment was attempting to explain that Vulkan (the loader or supporting layers) attempts to enumerate a good default physical device as the first enumerated physical device.

I agree that vkcube, and most other applications, should default to using the first enumerated physical device.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants