Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

roadmap: Jan has Hardware Controls and System Monitor and Prioritization #3908

Open
1 of 11 tasks
Tracked by #4485
dan-menlo opened this issue Oct 30, 2024 · 4 comments
Open
1 of 11 tasks
Tracked by #4485
Assignees
Milestone

Comments

@dan-menlo
Copy link
Contributor

dan-menlo commented Oct 30, 2024

@dan-menlo dan-menlo added this to Menlo Oct 30, 2024
@dan-menlo dan-menlo converted this from a draft issue Oct 30, 2024
@dan-menlo dan-menlo changed the title planning: Migrate Jan System Monitor and Hardware Compatibility to Hardware API planning: Migrate System Monitor and Hardware Compatibility to Hardware API Oct 30, 2024
@imtuyethan imtuyethan moved this from Investigating to Planning in Menlo Nov 27, 2024
@dan-menlo dan-menlo changed the title planning: Migrate System Monitor and Hardware Compatibility to Hardware API roadmap: Jan has Hardware Controls and System Monitor for Nvidia GPUs Nov 28, 2024
@dan-menlo dan-menlo changed the title roadmap: Jan has Hardware Controls and System Monitor for Nvidia GPUs roadmap: Jan uses Cortex.cpp for Hardware Controls and System Monitor for Nvidia GPUs Nov 28, 2024
@dan-menlo dan-menlo changed the title roadmap: Jan uses Cortex.cpp for Hardware Controls and System Monitor for Nvidia GPUs roadmap: Jan has Hardware Controls and System Monitor for Nvidia GPUs Nov 28, 2024
@imtuyethan imtuyethan moved this from Planning to In Progress in Menlo Nov 29, 2024
@dan-menlo
Copy link
Contributor Author

dan-menlo commented Dec 12, 2024

12 Dec

Hardware UI

Image

Image

Open Questions

  • How do we prioritize across both Nvidia and AMD GPUs? @vansangpfiev
  • UX for Vulkan (as it can run on both Nvidia and AMD)
  • Should we have separate settings for llama.cpp Engines and Hardware (eg. for Vulkan)

@vansangpfiev
Copy link

How do we prioritize across both Nvidia and AMD GPUs?

AMD and Nvidia have uuid field, we can use this field to map between GPU and the index we pass to CUDA_VISIBLE_DEVICES and GGML_VULKAN_DEVICE env variable.

@dan-menlo dan-menlo changed the title roadmap: Jan has Hardware Controls and System Monitor for Nvidia GPUs roadmap: Jan has Hardware Controls and System Monitor and Prioritization Dec 13, 2024
@dan-menlo
Copy link
Contributor Author

@vansangpfiev Please add in architecture diagrams and Engineering Spec

@vansangpfiev
Copy link

  • Diagram to configure global GPUs prioritization
  • Vulkan can detect both AMD GPUs and Nvidia GPUs
  • Nvidia-smi can only detect Nvidia GPUs
  • We have a mapping function to map uuid with cuda index and vulkan index

image

cc: @dan-homebrew @louis-jan

@imtuyethan imtuyethan modified the milestones: v0.5.15, v0.5.14 Jan 20, 2025
@urmauur urmauur moved this from Eng Review to QA in Menlo Feb 3, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: QA
Development

No branches or pull requests

5 participants