You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When running kedro-mlflow on Databricks, occasionally a new run of the experiment might be triggered when running parallelized code. This is because Databricks enables autologging (at least in recent runtimes), and the new runs might be due to an mlflow bug.
Proposed solution: Add a new hook to disable autolog, or include it in the current hook.
class DisableMLFlowAutoLogger:
@hook_impl(tryfirst=True)
def after_context_created(self, context) -> None:
mlflow.autolog(disable=True)
Although I encountered this because of Databricks, I can't imagine a context where you'd like to enable autolog together with the plugin. Could be a parameter of mlflow.yml if you want to be flexible.
Description
When running kedro-mlflow on Databricks, occasionally a new run of the experiment might be triggered when running parallelized code. This is because Databricks enables autologging (at least in recent runtimes), and the new runs might be due to an mlflow bug.
Proposed solution: Add a new hook to disable autolog, or include it in the current hook.
Although I encountered this because of Databricks, I can't imagine a context where you'd like to enable autolog together with the plugin. Could be a parameter of mlflow.yml if you want to be flexible.
Context
See conversation for context:
https://kedro-org.slack.com/archives/C03RKP2LW64/p1732141412790889
Steps to Reproduce
This will trigger maybe 4-6 new runs when using LightGBM in your objective.
Expected Result
Results should be in the run started by kedro-mlflow.
Actual Result
New runs are triggered.
Your Environment
Databricks Runtime 15.4 ML
Kedro 19.9
kedro-mlflow 0.13.3
Does the bug also happen with the last version on master?
Yes
The text was updated successfully, but these errors were encountered: