feat: expose max_num_tokens as configurable #93
e2e-nvidia-l4-x1.yml
on: pull_request_target
start-medium-ec2-runner
e2e-medium-workflow-complete
stop-medium-ec2-runner
Annotations
1 error
E2E (NVIDIA L4 x1)
Canceling since a higher priority waiting request for 'E2E (NVIDIA L4 x1)-340' exists
|