Unify execute API of JobQueue, refactor websocket connection #6558

berland · 2023-11-13T13:03:57Z

Put websocket connection handling into only one (async) function, that will run until it is finished.
Reduce the execute functions to only one, letting both legacy ensemble and simulation context use it.

The simulation_context now has to call it in an async loop, and it is also doing slightly more work (logging changes) than before.

Pre review checklist

Read through the code changes carefully after finishing work
Make sure tests pass locally (after every commit!)
Prepare changes in small commits for more convenient review (optional)
PR title captures the intent of the changes, and is fitting for release notes.
Updated documentation
Ensured that unit tests are added for all new behavior (See
Ground Rules),
and changes to existing code have good test coverage.

Pre merge checklist

Added appropriate release note label
Commit history is consistent and clean, in line with the contribution guidelines.

codecov-commenter · 2023-11-14T11:12:04Z

Codecov Report

Attention: 5 lines in your changes are missing coverage. Please review.

Comparison is base (cd4bc85) 83.51% compared to head (ceb0b47) 83.51%.

Files	Patch %	Lines
src/ert/job_queue/queue.py	93.42%	5 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #6558      +/-   ##
==========================================
- Coverage   83.51%   83.51%   -0.01%     
==========================================
  Files         346      346              
  Lines       20764    20778      +14     
  Branches      948      948              
==========================================
+ Hits        17341    17352      +11     
- Misses       3129     3132       +3     
  Partials      294      294

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

berland · 2023-11-14T13:01:14Z

Tested manually with GUI for poly_example, bigpoly and also with everest's egg model.

berland · 2023-11-14T13:07:41Z

Up for discussion: Should this be squashed?
(there are some minor details to be fixed if not squashed, some print-statements and maybe one mypy issue)

jonathan-eq

Great Job!

jonathan-eq · 2023-11-14T13:55:31Z

src/ert/job_queue/queue.py

    ) -> None:
+        assert self.ens_id is not None  # mypy
        events = deque(


Why do we use deque here instead of a normal queue?

probably a local and minor optimization thing to have a double-ended queue.

jonathan-eq · 2023-11-14T14:32:47Z

src/ert/job_queue/queue.py

+        if self._ee_uri is None:
+            # If no ensemble evaluator present, we will publish to the log
+            while (
+                change := await self._changes_to_publish.get()


I did not know about this (walrus operator :=). Thanks🎉

src/ert/ensemble_evaluator/_builder/_legacy.py

xjules · 2023-11-15T11:51:01Z

src/ert/job_queue/queue.py

        experiment_id: Optional[str] = None,
    ) -> None:
        for q_index, q_node in enumerate(self.job_list):
            cert_path = f"{q_node.run_path}/{CERT_FILE}"
-            if cert is not None:
+            if self._ee_cert is not None:


Was it not clearer when conninfo was passed directly into the function as a parameter?

It was more explicit, but that explicitness would f.ex make it possible to mixup the ee connection for the execute function with what is given in this jobs file. I think it is cleaner to set the connection info just one place in the Queue object.

The only thing is that Queue does not need to know about connection setup between ee and job_runner; hence my comment on passing it rather as a function params 🤷

That is correct, I am removing that "feature" to have different connection info.

src/ert/simulator/simulation_context.py

xjules · 2023-11-15T14:14:32Z

Up for discussion: Should this be squashed? (there are some minor details to be fixed if not squashed, some print-statements and maybe one mypy issue)

I would squash the fixup commits at least :)

xjules

I think it looks good! If there are troubles with mypy I'd squash it as it sort of addresses the same thing.

The websocket connection from the JobQueue is refactored to use a single asyncio task for fetching changes to be published, publish them, and also keep the connection open. The queue execute function had two variants, one that included websocket communication (for "legacy" ert, and one for Everest, the "simulation context"). Now there is only one, which is async. The changes that are published to the websocket channel is in the simulation context (Everest) logged to the debug log. Previously that information was not propagated. The connection to the ensemble evaluator is now configured once, and from now on it is not possible to have the job_runner use a different connection than the JobQueue uses for publishing its realization state changes.

berland force-pushed the jobqueue_unify_execute_api branch 4 times, most recently from d805142 to 41868ba Compare November 14, 2023 11:00

berland force-pushed the jobqueue_unify_execute_api branch from 0c76269 to ceb0b47 Compare November 14, 2023 12:09

berland marked this pull request as ready for review November 14, 2023 12:18

berland changed the title ~~Jobqueue unify execute api~~ Unify execute API of JobQueue, refactor websocket connection Nov 14, 2023

berland self-assigned this Nov 14, 2023

berland added the release-notes:maintenance Automatically categorise as maintenance change in release notes label Nov 14, 2023

jonathan-eq self-requested a review November 14, 2023 14:52

jonathan-eq approved these changes Nov 14, 2023

View reviewed changes

xjules reviewed Nov 15, 2023

View reviewed changes

src/ert/ensemble_evaluator/_builder/_legacy.py Show resolved Hide resolved

xjules reviewed Nov 15, 2023

View reviewed changes

src/ert/simulator/simulation_context.py Show resolved Hide resolved

xjules approved these changes Nov 15, 2023

View reviewed changes

berland force-pushed the jobqueue_unify_execute_api branch from c327ea1 to 1ddc0c8 Compare November 15, 2023 15:04

berland enabled auto-merge (rebase) November 15, 2023 15:05

berland merged commit 0569a3d into equinor:main Nov 15, 2023
41 checks passed

berland deleted the jobqueue_unify_execute_api branch November 16, 2023 14:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unify execute API of JobQueue, refactor websocket connection #6558

Unify execute API of JobQueue, refactor websocket connection #6558

berland commented Nov 13, 2023 •

edited

Loading

codecov-commenter commented Nov 14, 2023 •

edited

Loading

berland commented Nov 14, 2023

berland commented Nov 14, 2023 •

edited

Loading

jonathan-eq left a comment

jonathan-eq Nov 14, 2023

berland Nov 15, 2023

jonathan-eq Nov 14, 2023

xjules Nov 15, 2023 •

edited

Loading

berland Nov 15, 2023

xjules Nov 15, 2023

berland Nov 15, 2023

xjules commented Nov 15, 2023 •

edited

Loading

xjules left a comment

Unify execute API of JobQueue, refactor websocket connection #6558

Unify execute API of JobQueue, refactor websocket connection #6558

Conversation

berland commented Nov 13, 2023 • edited Loading

Pre review checklist

Pre merge checklist

codecov-commenter commented Nov 14, 2023 • edited Loading

Codecov Report

berland commented Nov 14, 2023

berland commented Nov 14, 2023 • edited Loading

jonathan-eq left a comment

Choose a reason for hiding this comment

jonathan-eq Nov 14, 2023

Choose a reason for hiding this comment

berland Nov 15, 2023

Choose a reason for hiding this comment

jonathan-eq Nov 14, 2023

Choose a reason for hiding this comment

xjules Nov 15, 2023 • edited Loading

Choose a reason for hiding this comment

berland Nov 15, 2023

Choose a reason for hiding this comment

xjules Nov 15, 2023

Choose a reason for hiding this comment

berland Nov 15, 2023

Choose a reason for hiding this comment

xjules commented Nov 15, 2023 • edited Loading

xjules left a comment

Choose a reason for hiding this comment

berland commented Nov 13, 2023 •

edited

Loading

codecov-commenter commented Nov 14, 2023 •

edited

Loading

berland commented Nov 14, 2023 •

edited

Loading

xjules Nov 15, 2023 •

edited

Loading

xjules commented Nov 15, 2023 •

edited

Loading