Feat: reimplement vllm backend beam search using logprobs #316

Workflow file for this run

.github/workflows/test_cli_cuda_vllm.yaml at 9f4303f

	name: CLI CUDA vLLM Tests

	on:
	workflow_dispatch:
	push:
	branches:
	- main
	pull_request:
	branches:
	- main
	types:
	- opened
	- reopened
	- synchronize
	- labeled
	- unlabeled

	concurrency:
	cancel-in-progress: true
	group: ${{ github.workflow }}-${{ github.event.pull_request.number \|\| github.ref }}

	jobs:
	run_cli_cuda_vllm_single_gpu_tests:
	if: ${{
	(github.event_name == 'push') \|\|
	(github.event_name == 'workflow_dispatch') \|\|
	contains( github.event.pull_request.labels.*.name, 'cli') \|\|
	contains( github.event.pull_request.labels.*.name, 'cuda') \|\|
	contains( github.event.pull_request.labels.*.name, 'vllm') \|\|
	contains( github.event.pull_request.labels.*.name, 'single_gpu') \|\|
	contains( github.event.pull_request.labels.*.name, 'cli_cuda_vllm_single_gpu')
	}}

	runs-on:
	group: aws-g5-4xlarge-plus

	container:
	image: vllm/vllm-openai:latest
	options: --ipc host --gpus all --entrypoint /bin/bash

	steps:
	- name: Checkout
	uses: actions/checkout@v4

	- name: Install dependencies
	run: \|
	pip install -e .[testing]

	- name: Run tests (sequential)
	run: \|
	FORCE_SEQUENTIAL=1 pytest tests/test_cli.py -x -s -k "cli and cuda and vllm and not (tp or pp)"

	run_cli_cuda_vllm_multi_gpu_tests:
	if: ${{
	(github.event_name == 'push') \|\|
	(github.event_name == 'workflow_dispatch') \|\|
	contains( github.event.pull_request.labels.*.name, 'cli') \|\|
	contains( github.event.pull_request.labels.*.name, 'cuda') \|\|
	contains( github.event.pull_request.labels.*.name, 'vllm') \|\|
	contains( github.event.pull_request.labels.*.name, 'multi_gpu') \|\|
	contains( github.event.pull_request.labels.*.name, 'cli_cuda_vllm_multi_gpu')
	}}

	runs-on:
	group: aws-g5-12xlarge-plus

	container:
	image: vllm/vllm-openai:latest
	options: --ipc host --gpus all --entrypoint /bin/bash

	steps:
	- name: Checkout
	uses: actions/checkout@v4

	- name: Install dependencies
	run: \|
	pip install -e .[testing]

	- name: Run tests (sequential)
	run: \|
	FORCE_SEQUENTIAL=1 pytest tests/test_cli.py -x -s -k "cli and cuda and vllm and (tp or pp)"

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat: reimplement vllm backend beam search using logprobs #316

Workflow file

Feat: reimplement vllm backend beam search using logprobs #316

Jobs

Run details

Workflow file for this run