-
Notifications
You must be signed in to change notification settings - Fork 95
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ClearML Agent fails scheduling tasks in Kubernetes after updating to v1.9.3 #223
Comments
Seeing the same issue using Helm deployment (charts "clearml-7.14.1" and "clearml-agent-5.3.1") |
Hi @dberkerdem and @@mb-ii Can you share the exact server version/build from the UI's profile page? |
WebApp: 2.0.0-613 • Server: 2.0.0-613 • API: 2.31 |
1 similar comment
WebApp: 2.0.0-613 • Server: 2.0.0-613 • API: 2.31 |
@jkhenning here's full Helm configuration for easier debugging: values.yaml
Chart.yaml
|
May I ask what's the last Helm chart version this was working for you @dberkerdem ? |
I use 5.3.1 but it is not related with Chart version. To solve this problem you need to expose following environment variable agents environment. The problem is caused by dynamic installation of latest ClearML version on ClearML Agent startup.See behavior agentk8sglue:
extraEnvs:
- name: CLEARML_AGENT_UPDATE_VERSION
value: "==1.9.2" |
Thanks, it helped! @dberkerdem |
I also saw this issue in clearml-agent 1.9.3. Downgrade to 1.9.2 helped. It seems a bug in 1.9.3 version :/ |
After updating ClearML Agent that run in Kubernetes to v1.9.3, we start encountering following error on new scheduled tasks.
ERROR: Could not push back task [2667c614e10a46df90882ae3aa3ca7c8] to k8s pending queue k8s_scheduler [45bfd703d59245acbe3f3982fdb5f2d2], error: Validation error (Cannot skip setting execution queue for a task that is not enqueued or does not have execution queue set)
The text was updated successfully, but these errors were encountered: