From ad0392f982ea899d5f65a3bf98a89bec91e82727 Mon Sep 17 00:00:00 2001 From: Andrew Aikawa Date: Tue, 13 Aug 2024 10:39:44 -0700 Subject: [PATCH] Update controller.rst patch controller docs --- docs/source/admin/controller.rst | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/docs/source/admin/controller.rst b/docs/source/admin/controller.rst index 249cf21..3df8f2c 100644 --- a/docs/source/admin/controller.rst +++ b/docs/source/admin/controller.rst @@ -83,7 +83,7 @@ After which you should see in your controller logs [I 07-09 05:37:45 node.py:98] Node gke-a3-cluster-gpu-pool-2d164072-zz64 tainted. # in a separate terminal you can verify - $ kubectl describe pod gke-a3-cluster-gpu-pool-2d164072-zz64 | grep trainy + $ kubectl describe node gke-a3-cluster-gpu-pool-2d164072-zz64 | grep trainy trainy.konduktor.ai/faulty=true:NoSchedule You can remove all the taints in the cluster with :code:`konduktor reset` @@ -105,4 +105,4 @@ Features and Roadmap - In-cluster deployment of controller - **Available** ✅ - Pod log error detection - **Available** ✅ - Health Checks (Taint Removal) - In progress 🚧 -- Node Resolution Hooks (Reboot, Power Cycle) - In progress 🚧 \ No newline at end of file +- Node Resolution Hooks (Reboot, Power Cycle) - In progress 🚧