Start untangling orchestrator #1739

mnonnenmacher · 2025-01-07T07:34:39Z

The few classes in the orchestrator currently have a very high degree of coupling and no clear separation of concerns. For example:

The logic to determine the next jobs is partly implemented in WorkerScheduleInfo and partly in WorkerScheduleContext.
Messaging is handled by Orchestrator, WorkerScheduleContext, and WorkerScheduleInfo.
Orchestrator, WorkerJobRepositories, and WorkerScheduleInfo all interact with the repositories to create jobs or update their status.

Besides some minor clean ups, this PR separates the logic to determine the next jobs into a new class that has no dependencies on infrastructure. It is the first in a series of changes to untangle the different responsibilities of the orchestrator to make them testable in isolation and improve overview.

Signed-off-by: Martin Nonnenmacher <[email protected]>

This makes the code more explicit and helps with upcoming refactorings. Also, the function name was confusing because it was not only responsible for scheduling jobs but also for executing the `onFailure` handler from the caller. Signed-off-by: Martin Nonnenmacher <[email protected]>

The word "current" does not carry any meaning in the context of the function. Signed-off-by: Martin Nonnenmacher <[email protected]>

Use the `WorkerScheduleInfo` enum instead of the `Endpoint` class to define the dependencies between jobs. This slightly simplifies the code and improves type safety. It also makes the companion object of `WorkerScheduleInfo` obsolete. Signed-off-by: Martin Nonnenmacher <[email protected]>

Extract the logic to determine which jobs should run next into a new `OrtRunInfo` class to be able to test it independently. The class will be taken into use in a follow-up commit. Signed-off-by: Martin Nonnenmacher <[email protected]>

If the analyzer did not run, it makes no sense to run the reporter worker as there is no data to report. Signed-off-by: Martin Nonnenmacher <[email protected]>

Add a new function `scheduleNextJobs` that uses the previously introduced `OrtRunInfo` to determine which jobs need to be scheduled and delete all now unused functions from the previous implementation. Signed-off-by: Martin Nonnenmacher <[email protected]>

With the introduction of the `JobStatus.final` property in 4d23673 the `WorkerJob.isCompleted()` helper function is not required anymore. Signed-off-by: Martin Nonnenmacher <[email protected]>

oheger-bosch · 2025-01-08T13:56:09Z

orchestrator/src/main/kotlin/OrtRunInfo.kt

+import org.eclipse.apoapsis.ortserver.model.JobStatus
+
+/** A class to store the required information to determine which jobs can be run. */
+internal class OrtRunInfo(


While I like the idea to extract the scheduling logic to a dedicated class, I have some problems with the current implementation:

IIUC, this class now contains the scheduling logic and is responsible to determine the next jobs that should run. This should also be reflected by the class name. OrtRunInfo is meaningless in this context and rather reminds of a data model class.

The relation between this class and WorkerScheduleContext is unclear. Orchestrator now creates a WorkerScheduleContext, and with the help of this context, an OrtRunInfo. This is because the latter has its own state derived from the context (this is not really untangling). It would be better if OrtRunInfo was stateless and only implemented the scheduling strategy. The getNextJobs() function could be passed a WorkerScheduleContext info object and obtain all required information from there.

oheger-bosch · 2025-01-08T14:02:23Z

orchestrator/src/main/kotlin/WorkerScheduleInfo.kt

@@ -125,7 +125,7 @@ internal enum class WorkerScheduleInfo(
            configs.evaluator != null
    },

-    REPORTER(ReporterEndpoint, runsAfter = listOf(EVALUATOR), runAfterFailure = true) {
+    REPORTER(ReporterEndpoint, dependsOn = listOf(ANALYZER), runsAfter = listOf(EVALUATOR), runAfterFailure = true) {


It was a deliberate decision that the Reporter step should always be executed to enable use cases like an "ORT run report" containing information about the whole run with its successful and failing steps. So, I would be reluctant to say that it generally makes no sense to run the reporter after a failed analyzer step. In every case, the type "fix" is not correct for this commit because this is no bug, but the behavior was by design.

mnonnenmacher added 10 commits January 6, 2025 20:09

chore(orchestrator): Fix a function name

18acb53

Signed-off-by: Martin Nonnenmacher <[email protected]>

chore(orchestrator): Remove unnecessary whitespace

f5b0f6d

Signed-off-by: Martin Nonnenmacher <[email protected]>

chore(orchestrator): Remove an unnecessary suppression

be8a3cb

Signed-off-by: Martin Nonnenmacher <[email protected]>

refactor(orchestrator): Rename getCurrentOrtRun to getOrtRun

391a733

The word "current" does not carry any meaning in the context of the function. Signed-off-by: Martin Nonnenmacher <[email protected]>

refactor(orchestrator): Extract scheduling logic

8bb979a

Extract the logic to determine which jobs should run next into a new `OrtRunInfo` class to be able to test it independently. The class will be taken into use in a follow-up commit. Signed-off-by: Martin Nonnenmacher <[email protected]>

fix(orchestrator): Make the reporter depend on the analyzer

5c10474

If the analyzer did not run, it makes no sense to run the reporter worker as there is no data to report. Signed-off-by: Martin Nonnenmacher <[email protected]>

chore(orchestrator): Remove an unnecessary helper function

3af763d

With the introduction of the `JobStatus.final` property in 4d23673 the `WorkerJob.isCompleted()` helper function is not required anymore. Signed-off-by: Martin Nonnenmacher <[email protected]>

oheger-bosch reviewed Jan 8, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Start untangling orchestrator #1739

Start untangling orchestrator #1739

mnonnenmacher commented Jan 7, 2025

oheger-bosch Jan 8, 2025

oheger-bosch Jan 8, 2025

Start untangling orchestrator #1739

Are you sure you want to change the base?

Start untangling orchestrator #1739

Conversation

mnonnenmacher commented Jan 7, 2025

oheger-bosch Jan 8, 2025

Choose a reason for hiding this comment

oheger-bosch Jan 8, 2025

Choose a reason for hiding this comment