-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
build(deps): bump the pip group across 31 directories with 1 update #68
Closed
dependabot
wants to merge
1
commit into
main
from
dependabot/pip/bentoml/bentos/codestral/22b-v0.1-fp16-7231/src/pip-a80ba3bf45
Closed
build(deps): bump the pip group across 31 directories with 1 update #68
dependabot
wants to merge
1
commit into
main
from
dependabot/pip/bentoml/bentos/codestral/22b-v0.1-fp16-7231/src/pip-a80ba3bf45
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Bumps the pip group with 1 update in the /bentoml/bentos/codestral/22b-v0.1-fp16-7231/src directory: [vllm](https://github.com/vllm-project/vllm). Bumps the pip group with 1 update in the /bentoml/bentos/deepseek-r1-distill/llama3.1-8b-fp16-f208/src directory: [vllm](https://github.com/vllm-project/vllm). Bumps the pip group with 1 update in the /bentoml/bentos/deepseek-r1-distill/llama3.3-70b-instruct-fp16-5b46/src directory: [vllm](https://github.com/vllm-project/vllm). Bumps the pip group with 1 update in the /bentoml/bentos/deepseek-r1-distill/qwen2.5-1.5b-math-fp16-5e2f/src directory: [vllm](https://github.com/vllm-project/vllm). Bumps the pip group with 1 update in the /bentoml/bentos/deepseek-r1-distill/qwen2.5-14b-fp16-44c7/src directory: [vllm](https://github.com/vllm-project/vllm). Bumps the pip group with 1 update in the /bentoml/bentos/deepseek-r1-distill/qwen2.5-32b-fp16-29c6/src directory: [vllm](https://github.com/vllm-project/vllm). Bumps the pip group with 1 update in the /bentoml/bentos/deepseek-r1-distill/qwen2.5-7b-math-fp16-761e/src directory: [vllm](https://github.com/vllm-project/vllm). Bumps the pip group with 1 update in the /bentoml/bentos/deepseek-v3/671b-instruct-fp8-70d7/src directory: [vllm](https://github.com/vllm-project/vllm). Bumps the pip group with 1 update in the /bentoml/bentos/gemma/2b-instruct-fp16-1320/src directory: [vllm](https://github.com/vllm-project/vllm). Bumps the pip group with 1 update in the /bentoml/bentos/gemma/7b-instruct-awq-4bit-a9cb/src directory: [vllm](https://github.com/vllm-project/vllm). Bumps the pip group with 1 update in the /bentoml/bentos/gemma/7b-instruct-fp16-10bb/src directory: [vllm](https://github.com/vllm-project/vllm). Bumps the pip group with 1 update in the /bentoml/bentos/gemma2/27b-instruct-fp16-c1e5/src directory: [vllm](https://github.com/vllm-project/vllm). Bumps the pip group with 1 update in the /bentoml/bentos/gemma2/9b-instruct-fp16-fdaa/src directory: [vllm](https://github.com/vllm-project/vllm). Bumps the pip group with 1 update in the /bentoml/bentos/jamba1.5/mini-fp16-3615/src directory: [vllm](https://github.com/vllm-project/vllm). Bumps the pip group with 1 update in the /bentoml/bentos/llama2/13b-chat-fp16-49e4/src directory: [vllm](https://github.com/vllm-project/vllm). Bumps the pip group with 1 update in the /bentoml/bentos/llama2/70b-chat-fp16-cc77/src directory: [vllm](https://github.com/vllm-project/vllm). Bumps the pip group with 1 update in the /bentoml/bentos/llama2/7b-chat-awq-4bit-cc6f/src directory: [vllm](https://github.com/vllm-project/vllm). Bumps the pip group with 1 update in the /bentoml/bentos/llama2/7b-chat-fp16-81cf/src directory: [vllm](https://github.com/vllm-project/vllm). Bumps the pip group with 1 update in the /bentoml/bentos/llama3.1-nemotron/70b-instruct-fp16-8d09/src directory: [vllm](https://github.com/vllm-project/vllm). Bumps the pip group with 1 update in the /bentoml/bentos/llama3.1/405b-instruct-awq-4bit-bbd0/src directory: [vllm](https://github.com/vllm-project/vllm). Bumps the pip group with 1 update in the /bentoml/bentos/llama3.1/70b-instruct-awq-4bit-e86e/src directory: [vllm](https://github.com/vllm-project/vllm). Bumps the pip group with 1 update in the /bentoml/bentos/llama3.1/70b-instruct-fp16-d198/src directory: [vllm](https://github.com/vllm-project/vllm). Bumps the pip group with 1 update in the /bentoml/bentos/llama3.1/8b-instruct-awq-4bit-b149/src directory: [vllm](https://github.com/vllm-project/vllm). Bumps the pip group with 1 update in the /bentoml/bentos/llama3.1/8b-instruct-fp16-cbdd/src directory: [vllm](https://github.com/vllm-project/vllm). Bumps the pip group with 1 update in the /bentoml/bentos/llama3.2/11b-vision-instruct-714f/src directory: [vllm](https://github.com/vllm-project/vllm). Bumps the pip group with 1 update in the /bentoml/bentos/llama3.2/1b-instruct-fp16-ce2d/src directory: [vllm](https://github.com/vllm-project/vllm). Bumps the pip group with 1 update in the /bentoml/bentos/llama3.2/3b-instruct-fp16-be73/src directory: [vllm](https://github.com/vllm-project/vllm). Bumps the pip group with 1 update in the /bentoml/bentos/llama3.3/70b-instruct-fp16-419e/src directory: [vllm](https://github.com/vllm-project/vllm). Bumps the pip group with 1 update in the /bentoml/bentos/llama3/70b-instruct-awq-4bit-f693/src directory: [vllm](https://github.com/vllm-project/vllm). Bumps the pip group with 1 update in the /bentoml/bentos/llama3/70b-instruct-fp16-7265/src directory: [vllm](https://github.com/vllm-project/vllm). Bumps the pip group with 1 update in the /bentoml/bentos/mistral-large/123b-instruct-awq-4bit-13a5/src directory: [vllm](https://github.com/vllm-project/vllm). Updates `vllm` from 0.6.6post1 to 0.7.0 - [Release notes](https://github.com/vllm-project/vllm/releases) - [Commits](vllm-project/vllm@v0.6.6.post1...v0.7.0) Updates `vllm` from 0.6.6post1 to 0.7.0 - [Release notes](https://github.com/vllm-project/vllm/releases) - [Commits](vllm-project/vllm@v0.6.6.post1...v0.7.0) Updates `vllm` from 0.6.6post1 to 0.7.0 - [Release notes](https://github.com/vllm-project/vllm/releases) - [Commits](vllm-project/vllm@v0.6.6.post1...v0.7.0) Updates `vllm` from 0.6.6post1 to 0.7.0 - [Release notes](https://github.com/vllm-project/vllm/releases) - [Commits](vllm-project/vllm@v0.6.6.post1...v0.7.0) Updates `vllm` from 0.6.6post1 to 0.7.0 - [Release notes](https://github.com/vllm-project/vllm/releases) - [Commits](vllm-project/vllm@v0.6.6.post1...v0.7.0) Updates `vllm` from 0.6.6post1 to 0.7.0 - [Release notes](https://github.com/vllm-project/vllm/releases) - [Commits](vllm-project/vllm@v0.6.6.post1...v0.7.0) Updates `vllm` from 0.6.6post1 to 0.7.0 - [Release notes](https://github.com/vllm-project/vllm/releases) - [Commits](vllm-project/vllm@v0.6.6.post1...v0.7.0) Updates `vllm` from 0.6.6post1 to 0.7.0 - [Release notes](https://github.com/vllm-project/vllm/releases) - [Commits](vllm-project/vllm@v0.6.6.post1...v0.7.0) Updates `vllm` from 0.6.6post1 to 0.7.0 - [Release notes](https://github.com/vllm-project/vllm/releases) - [Commits](vllm-project/vllm@v0.6.6.post1...v0.7.0) Updates `vllm` from 0.6.6post1 to 0.7.0 - [Release notes](https://github.com/vllm-project/vllm/releases) - [Commits](vllm-project/vllm@v0.6.6.post1...v0.7.0) Updates `vllm` from 0.6.6post1 to 0.7.0 - [Release notes](https://github.com/vllm-project/vllm/releases) - [Commits](vllm-project/vllm@v0.6.6.post1...v0.7.0) Updates `vllm` from 0.6.6post1 to 0.7.0 - [Release notes](https://github.com/vllm-project/vllm/releases) - [Commits](vllm-project/vllm@v0.6.6.post1...v0.7.0) Updates `vllm` from 0.6.6post1 to 0.7.0 - [Release notes](https://github.com/vllm-project/vllm/releases) - [Commits](vllm-project/vllm@v0.6.6.post1...v0.7.0) Updates `vllm` from 0.6.6post1 to 0.7.0 - [Release notes](https://github.com/vllm-project/vllm/releases) - [Commits](vllm-project/vllm@v0.6.6.post1...v0.7.0) Updates `vllm` from 0.6.6post1 to 0.7.0 - [Release notes](https://github.com/vllm-project/vllm/releases) - [Commits](vllm-project/vllm@v0.6.6.post1...v0.7.0) Updates `vllm` from 0.6.6post1 to 0.7.0 - [Release notes](https://github.com/vllm-project/vllm/releases) - [Commits](vllm-project/vllm@v0.6.6.post1...v0.7.0) Updates `vllm` from 0.6.6post1 to 0.7.0 - [Release notes](https://github.com/vllm-project/vllm/releases) - [Commits](vllm-project/vllm@v0.6.6.post1...v0.7.0) Updates `vllm` from 0.6.6post1 to 0.7.0 - [Release notes](https://github.com/vllm-project/vllm/releases) - [Commits](vllm-project/vllm@v0.6.6.post1...v0.7.0) Updates `vllm` from 0.6.6post1 to 0.7.0 - [Release notes](https://github.com/vllm-project/vllm/releases) - [Commits](vllm-project/vllm@v0.6.6.post1...v0.7.0) Updates `vllm` from 0.6.6post1 to 0.7.0 - [Release notes](https://github.com/vllm-project/vllm/releases) - [Commits](vllm-project/vllm@v0.6.6.post1...v0.7.0) Updates `vllm` from 0.6.6post1 to 0.7.0 - [Release notes](https://github.com/vllm-project/vllm/releases) - [Commits](vllm-project/vllm@v0.6.6.post1...v0.7.0) Updates `vllm` from 0.6.6post1 to 0.7.0 - [Release notes](https://github.com/vllm-project/vllm/releases) - [Commits](vllm-project/vllm@v0.6.6.post1...v0.7.0) Updates `vllm` from 0.6.6post1 to 0.7.0 - [Release notes](https://github.com/vllm-project/vllm/releases) - [Commits](vllm-project/vllm@v0.6.6.post1...v0.7.0) Updates `vllm` from 0.6.6post1 to 0.7.0 - [Release notes](https://github.com/vllm-project/vllm/releases) - [Commits](vllm-project/vllm@v0.6.6.post1...v0.7.0) Updates `vllm` from 0.6.6post1 to 0.7.0 - [Release notes](https://github.com/vllm-project/vllm/releases) - [Commits](vllm-project/vllm@v0.6.6.post1...v0.7.0) Updates `vllm` from 0.6.6post1 to 0.7.0 - [Release notes](https://github.com/vllm-project/vllm/releases) - [Commits](vllm-project/vllm@v0.6.6.post1...v0.7.0) Updates `vllm` from 0.6.6post1 to 0.7.0 - [Release notes](https://github.com/vllm-project/vllm/releases) - [Commits](vllm-project/vllm@v0.6.6.post1...v0.7.0) Updates `vllm` from 0.6.6post1 to 0.7.0 - [Release notes](https://github.com/vllm-project/vllm/releases) - [Commits](vllm-project/vllm@v0.6.6.post1...v0.7.0) Updates `vllm` from 0.6.6post1 to 0.7.0 - [Release notes](https://github.com/vllm-project/vllm/releases) - [Commits](vllm-project/vllm@v0.6.6.post1...v0.7.0) Updates `vllm` from 0.6.6post1 to 0.7.0 - [Release notes](https://github.com/vllm-project/vllm/releases) - [Commits](vllm-project/vllm@v0.6.6.post1...v0.7.0) Updates `vllm` from 0.6.6post1 to 0.7.0 - [Release notes](https://github.com/vllm-project/vllm/releases) - [Commits](vllm-project/vllm@v0.6.6.post1...v0.7.0) --- updated-dependencies: - dependency-name: vllm dependency-type: direct:production dependency-group: pip - dependency-name: vllm dependency-type: direct:production dependency-group: pip - dependency-name: vllm dependency-type: direct:production dependency-group: pip - dependency-name: vllm dependency-type: direct:production dependency-group: pip - dependency-name: vllm dependency-type: direct:production dependency-group: pip - dependency-name: vllm dependency-type: direct:production dependency-group: pip - dependency-name: vllm dependency-type: direct:production dependency-group: pip - dependency-name: vllm dependency-type: direct:production dependency-group: pip - dependency-name: vllm dependency-type: direct:production dependency-group: pip - dependency-name: vllm dependency-type: direct:production dependency-group: pip - dependency-name: vllm dependency-type: direct:production dependency-group: pip - dependency-name: vllm dependency-type: direct:production dependency-group: pip - dependency-name: vllm dependency-type: direct:production dependency-group: pip - dependency-name: vllm dependency-type: direct:production dependency-group: pip - dependency-name: vllm dependency-type: direct:production dependency-group: pip - dependency-name: vllm dependency-type: direct:production dependency-group: pip - dependency-name: vllm dependency-type: direct:production dependency-group: pip - dependency-name: vllm dependency-type: direct:production dependency-group: pip - dependency-name: vllm dependency-type: direct:production dependency-group: pip - dependency-name: vllm dependency-type: direct:production dependency-group: pip - dependency-name: vllm dependency-type: direct:production dependency-group: pip - dependency-name: vllm dependency-type: direct:production dependency-group: pip - dependency-name: vllm dependency-type: direct:production dependency-group: pip - dependency-name: vllm dependency-type: direct:production dependency-group: pip - dependency-name: vllm dependency-type: direct:production dependency-group: pip - dependency-name: vllm dependency-type: direct:production dependency-group: pip - dependency-name: vllm dependency-type: direct:production dependency-group: pip - dependency-name: vllm dependency-type: direct:production dependency-group: pip - dependency-name: vllm dependency-type: direct:production dependency-group: pip - dependency-name: vllm dependency-type: direct:production dependency-group: pip - dependency-name: vllm dependency-type: direct:production dependency-group: pip ... Signed-off-by: dependabot[bot] <[email protected]>
dependabot
bot
added
dependencies
Pull requests that update a dependency file
python
Pull requests that update Python code
labels
Jan 27, 2025
This pull request was built based on a group rule. Closing it will not ignore any of these versions in future pull requests. To ignore these dependencies, configure ignore rules in dependabot.yml |
dependabot
bot
deleted the
dependabot/pip/bentoml/bentos/codestral/22b-v0.1-fp16-7231/src/pip-a80ba3bf45
branch
February 6, 2025 23:21
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
dependencies
Pull requests that update a dependency file
python
Pull requests that update Python code
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Bumps the pip group with 1 update in the /bentoml/bentos/codestral/22b-v0.1-fp16-7231/src directory: vllm.
Bumps the pip group with 1 update in the /bentoml/bentos/deepseek-r1-distill/llama3.1-8b-fp16-f208/src directory: vllm.
Bumps the pip group with 1 update in the /bentoml/bentos/deepseek-r1-distill/llama3.3-70b-instruct-fp16-5b46/src directory: vllm.
Bumps the pip group with 1 update in the /bentoml/bentos/deepseek-r1-distill/qwen2.5-1.5b-math-fp16-5e2f/src directory: vllm.
Bumps the pip group with 1 update in the /bentoml/bentos/deepseek-r1-distill/qwen2.5-14b-fp16-44c7/src directory: vllm.
Bumps the pip group with 1 update in the /bentoml/bentos/deepseek-r1-distill/qwen2.5-32b-fp16-29c6/src directory: vllm.
Bumps the pip group with 1 update in the /bentoml/bentos/deepseek-r1-distill/qwen2.5-7b-math-fp16-761e/src directory: vllm.
Bumps the pip group with 1 update in the /bentoml/bentos/deepseek-v3/671b-instruct-fp8-70d7/src directory: vllm.
Bumps the pip group with 1 update in the /bentoml/bentos/gemma/2b-instruct-fp16-1320/src directory: vllm.
Bumps the pip group with 1 update in the /bentoml/bentos/gemma/7b-instruct-awq-4bit-a9cb/src directory: vllm.
Bumps the pip group with 1 update in the /bentoml/bentos/gemma/7b-instruct-fp16-10bb/src directory: vllm.
Bumps the pip group with 1 update in the /bentoml/bentos/gemma2/27b-instruct-fp16-c1e5/src directory: vllm.
Bumps the pip group with 1 update in the /bentoml/bentos/gemma2/9b-instruct-fp16-fdaa/src directory: vllm.
Bumps the pip group with 1 update in the /bentoml/bentos/jamba1.5/mini-fp16-3615/src directory: vllm.
Bumps the pip group with 1 update in the /bentoml/bentos/llama2/13b-chat-fp16-49e4/src directory: vllm.
Bumps the pip group with 1 update in the /bentoml/bentos/llama2/70b-chat-fp16-cc77/src directory: vllm.
Bumps the pip group with 1 update in the /bentoml/bentos/llama2/7b-chat-awq-4bit-cc6f/src directory: vllm.
Bumps the pip group with 1 update in the /bentoml/bentos/llama2/7b-chat-fp16-81cf/src directory: vllm.
Bumps the pip group with 1 update in the /bentoml/bentos/llama3.1-nemotron/70b-instruct-fp16-8d09/src directory: vllm.
Bumps the pip group with 1 update in the /bentoml/bentos/llama3.1/405b-instruct-awq-4bit-bbd0/src directory: vllm.
Bumps the pip group with 1 update in the /bentoml/bentos/llama3.1/70b-instruct-awq-4bit-e86e/src directory: vllm.
Bumps the pip group with 1 update in the /bentoml/bentos/llama3.1/70b-instruct-fp16-d198/src directory: vllm.
Bumps the pip group with 1 update in the /bentoml/bentos/llama3.1/8b-instruct-awq-4bit-b149/src directory: vllm.
Bumps the pip group with 1 update in the /bentoml/bentos/llama3.1/8b-instruct-fp16-cbdd/src directory: vllm.
Bumps the pip group with 1 update in the /bentoml/bentos/llama3.2/11b-vision-instruct-714f/src directory: vllm.
Bumps the pip group with 1 update in the /bentoml/bentos/llama3.2/1b-instruct-fp16-ce2d/src directory: vllm.
Bumps the pip group with 1 update in the /bentoml/bentos/llama3.2/3b-instruct-fp16-be73/src directory: vllm.
Bumps the pip group with 1 update in the /bentoml/bentos/llama3.3/70b-instruct-fp16-419e/src directory: vllm.
Bumps the pip group with 1 update in the /bentoml/bentos/llama3/70b-instruct-awq-4bit-f693/src directory: vllm.
Bumps the pip group with 1 update in the /bentoml/bentos/llama3/70b-instruct-fp16-7265/src directory: vllm.
Bumps the pip group with 1 update in the /bentoml/bentos/mistral-large/123b-instruct-awq-4bit-13a5/src directory: vllm.
Updates
vllm
from 0.6.6post1 to 0.7.0Release notes
Sourced from vllm's releases.
... (truncated)
Commits
5204ff5
[Bugfix] Fix Granite 3.0 MoE model loading (#12446)0cc6b38
[Frontend] Support scores endpoint in run_batch (#12430)28e0750
[V1] Avoid list creation in input preparation (#12457)582cf78
[DOC] Add link to vLLM blog (#12460)0034b09
[Frontend] Rerank API (Jina- and Cohere-compatible API) (#12376)72bac73
[Build/CI] Fix libcuda.so linkage (#12424)68f1114
[Bugfix][Kernel] Fix perf regression caused by PR #12405 (#12434)72f4880
[Bugfix/CI] Fix broken kernels/test_mha.py (#12450)aa2cd2c
[Bugfix] Disable w16a16 2of4 sparse CompressedTensors24 (#12417)9ddc352
[Frontend] generation_config.json for maximum tokens(#12242)Updates
vllm
from 0.6.6post1 to 0.7.0Release notes
Sourced from vllm's releases.
... (truncated)
Commits
5204ff5
[Bugfix] Fix Granite 3.0 MoE model loading (#12446)0cc6b38
[Frontend] Support scores endpoint in run_batch (#12430)28e0750
[V1] Avoid list creation in input preparation (#12457)582cf78
[DOC] Add link to vLLM blog (#12460)0034b09
[Frontend] Rerank API (Jina- and Cohere-compatible API) (#12376)72bac73
[Build/CI] Fix libcuda.so linkage (#12424)68f1114
[Bugfix][Kernel] Fix perf regression caused by PR #12405 (#12434)72f4880
[Bugfix/CI] Fix broken kernels/test_mha.py (#12450)aa2cd2c
[Bugfix] Disable w16a16 2of4 sparse CompressedTensors24 (#12417)9ddc352
[Frontend] generation_config.json for maximum tokens(#12242)Updates
vllm
from 0.6.6post1 to 0.7.0Release notes
Sourced from vllm's releases.
... (truncated)
Commits
5204ff5
[Bugfix] Fix Granite 3.0 MoE model loading (#12446)0cc6b38
[Frontend] Support scores endpoint in run_batch (#12430)28e0750
[V1] Avoid list creation in input preparation (#12457)582cf78
[DOC] Add link to vLLM blog (#12460)0034b09
[Frontend] Rerank API (Jina- and Cohere-compatible API) (#12376)72bac73
[Build/CI] Fix libcuda.so linkage (#12424)68f1114
[Bugfix][Kernel] Fix perf regression caused by PR #12405 (#12434)72f4880
[Bugfix/CI] Fix broken kernels/test_mha.py (#12450)aa2cd2c
[Bugfix] Disable w16a16 2of4 sparse CompressedTensors24 (#12417)9ddc352
[Frontend] generation_config.json for maximum tokens(#12242)Updates
vllm
from 0.6.6post1 to 0.7.0Release notes
Sourced from vllm's releases.
... (truncated)
Commits
5204ff5
[Bugfix] Fix Granite 3.0 MoE model loading (#12446)0cc6b38
[Frontend] Support scores endpoint in run_batch (#12430)28e0750
[V1] Avoid list creation in input preparation (#12457)582cf78
[DOC] Add link to vLLM blog (#12460)0034b09
[Frontend] Rerank API (Jina- and Cohere-compatible API) (#12376)72bac73
[Build/CI] Fix libcuda.so linkage (#12424)68f1114
[Bugfix][Kernel] Fix perf regression caused by PR #12405 (#12434)72f4880
[Bugfix/CI] Fix broken kernels/test_mha.py (#12450)aa2cd2c
[Bugfix] Disable w16a16 2of4 sparse CompressedTensors24 (#12417)9ddc352
[Frontend] generation_config.json for maximum tokens(#12242)Updates
vllm
from 0.6.6post1 to 0.7.0Release notes
Sourced from vllm's releases.
... (truncated)
Commits
5204ff5
[Bugfix] Fix Granite 3.0 MoE model loading (#12446)0cc6b38
[Frontend] Support scores endpoint in run_batch (#12430)28e0750
[V1] Avoid list creation in input preparation (#12457)582cf78
[DOC] Add link to vLLM blog (#12460)0034b09
[Frontend] Rerank API (Jina- and Cohere-compatible API) (#12376)72bac73
[Build/CI] Fix libcuda.so linkage (#12424)68f1114
[Bugfix][Kernel] Fix perf regression caused by PR