You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Currently when we do a "rolling restart" of the cluster the operator is restarting the pods independent of whether the starrocks cluster is in a clean state or not.
This leads to the problem we are facing WRITE errors with "under-replicated" tablets during rolling restarts because cluster ist still syncing tablets while operator is removing next BE pod
Currently we do manual DELETE POD instead of rolling restart and watch out for "pending tablets" on the cluster. We go with next pod if "pending tablets = 0" -> See example below
Operator should consider the "health/balance" state of the cluster and should only go on with removing of PODs if cluster is in sync.
Not sure if "pending_tablets" ist the best approach but should definitely avoid tablets which are not writable during restarts
The text was updated successfully, but these errors were encountered:
Describe the current behavior
Currently when we do a "rolling restart" of the cluster the operator is restarting the pods independent of whether the starrocks cluster is in a clean state or not.
This leads to the problem we are facing WRITE errors with "under-replicated" tablets during rolling restarts because cluster ist still syncing tablets while operator is removing next BE pod
Currently we do manual DELETE POD instead of rolling restart and watch out for "pending tablets" on the cluster. We go with next pod if "pending tablets = 0" -> See example below
Describe the enhancement
Operator should consider the "health/balance" state of the cluster and should only go on with removing of PODs if cluster is in sync.
Not sure if "pending_tablets" ist the best approach but should definitely avoid tablets which are not writable during restarts
The text was updated successfully, but these errors were encountered: