Feature: data letter queue #73

heemankv · 2024-08-08T08:16:35Z

This PR resolves Issue #55 .

Implemented Dead Letter Queue handler.
Adds last_job_status to job metadata.
Runs and Tests needs to spin up appropriate queues.
Test for handle_job_failure.

apoorvsadana

Original PR was #56. Created this because of merge conflicts as explained on Slack

EvolveArt

We should probably add some tests for the edge cases where it would fail ?
Also I'm correct that in this PR the DLQ is implemented but is never used by the orchestrator right ?

EvolveArt · 2024-08-09T10:57:30Z

crates/orchestrator/src/tests/config.rs

@@ -127,6 +127,8 @@ impl TestConfigBuilder {
            self.storage.unwrap(),
        );

+        drop_database().await.unwrap();


Would need more explanation here.
Ideally this should have been a separate PR, will keep in mind.

But if your question is any of these :

Why is drop_database implemented in TestConfigBuilder ?

Why are we not using ? and using unwrap() ?

Then :

This is to ensure that each test case has a fresh database to work with so that no overlapping of database arguments exist.

Our assumption is that there is no perk for a test case to return an Error, since it's a checking procedure we are fine a throwing the error there directly.

yea the question was more, why do we drop in the middle of the code

Since this is an initiation function for all the clients, which runs before each test we can drop the database at any point in this code.

EvolveArt · 2024-08-09T10:58:16Z

crates/orchestrator/src/tests/jobs/mod.rs

+#[case(JobType::SnosRun, JobStatus::Failed)]
+#[tokio::test]
+async fn handle_job_failure_with_failed_job_status_works(#[case] job_type: JobType, #[case] job_status: JobStatus) {
+    TestConfigBuilder::new().build().await;


move this to a fixture

As mentioned in here.

TestConfigBuilder allows for customisation over any external client of our choice,
moving it to a global fixture is not feasible since all test cases have different customisation requirements.

We can make fixture for tests under same scope if they require same customised external clients.

for eg :
All tests under da_job if require same config customisation, can implement a fixture just for themselves.
similarly for other scopes .

We can create a separate issue for this and resolve there.

sounds good to me

EvolveArt · 2024-08-09T11:03:25Z

crates/orchestrator/src/jobs/mod.rs

+
+    if job.status == JobStatus::Completed {
+        log::error!("Invalid state exists on DL queue: {}", job.status.to_string());
+        return Ok(());


why do we fail silently here ?

DL-queue is supposed to handle actual failed cases.
If JobStatus::Completed job is pushed to DL-queue multiple times by the queuing agent,
we prefer not stopping the orchestrator rather failing silently.

crates/orchestrator/src/jobs/types.rs

EvolveArt · 2024-08-09T11:15:16Z

crates/orchestrator/src/tests/jobs/mod.rs

+
+#[rstest]
+#[case::pending_verification(JobType::SnosRun, JobStatus::PendingVerification)]
+#[case::verification_timeout(JobType::SnosRun, JobStatus::VerificationTimeout)]


why are the other statuses not covered ?

We initially covered all the statuses as shown here, but it felt redundant to test all, hence they were removed.

What do you suggest @EvolveArt ?

well even if it feels redundant I think it's important to have them, you want to avoid having jobs at an unexpected state in the DLQ

Valid, Implemented

heemankv · 2024-08-09T21:18:16Z

We should probably add some tests for the edge cases where it would fail ?
Also I'm correct that in this PR the DLQ is implemented but is never used by the orchestrator right ?

Please feel free to mention the edge case to run the tests on.
And yes this PR is for adding support of DLQ, after which we would setup DL-queue to be used in the future.
@apoorvsadana feel free to add more!
Thanks

apoorvsadana · 2024-08-10T03:39:49Z

@EvolveArt the DLQ is being used. You need to set it up on SQS/RabbitMQ etc. and when messages fail, they automatically go to the DLQ and they are moved to the failed state.

EvolveArt

LGTM

…s_works

* dl-queue: added termination queue * dl-queue: spwan consumer to a macro_rule * dl-queue: test for handle_job_failure * dl-queue: handle_job_failure failed test case * dl-queue: fixed test cases * dl-queue: tests fixed * dl-queue: assert optimised * dl-queue: DL job rewritten tests * dl-queue: formatting changes * dl-queue: update mod.rs * dl-queue: lint fixes * dl-queue: using strum for JobStatus Display * dl-queue: added test cases for handle_job_failure_with_failed_job_status_works * fix: testcase

heemankv added 8 commits August 8, 2024 13:37

update: added termination queue

d507793

update: spwan consumer to a macro_rule

ed8d3df

update: Test for handle_job_failure

8db6859

update: handle_job_failure failed test case

3eb4433

update: fixed test cases

7ebc833

update: tests fixed

d44bae6

update: assert optimised

5b0f1b3

update: DL job rewritten tests

b5ebc5f

heemankv requested a review from apoorvsadana August 8, 2024 08:16

heemankv self-assigned this Aug 8, 2024

heemankv added 3 commits August 8, 2024 13:50

chore: formatting changes

eb5dcfe

Update mod.rs

98b6092

chore: lint fixes

789353b

apoorvsadana reviewed Aug 8, 2024

View reviewed changes

apoorvsadana approved these changes Aug 8, 2024

View reviewed changes

Merge branch 'main' into feat/data-letter-queue

08d2215

EvolveArt requested changes Aug 9, 2024

View reviewed changes

apoorvsadana mentioned this pull request Aug 9, 2024

update: added termination queue #56

Closed

heemankv changed the title ~~Feat/data letter queue~~ Feature: data letter queue Aug 9, 2024

update: using strum for JobStatus Display

a9b18c6

EvolveArt approved these changes Aug 12, 2024

View reviewed changes

heemankv added 3 commits August 13, 2024 07:34

Merge branch 'main' into feat/data-letter-queue

f563414

update: added test cases for handle_job_failure_with_failed_job_statu…

1de1c1c

…s_works

fix: testcase

8733c24

heemankv added the enhancement New feature or request label Aug 13, 2024

heemankv merged commit 73355ca into main Aug 13, 2024
8 checks passed

heemankv deleted the feat/data-letter-queue branch August 13, 2024 05:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature: data letter queue #73

Feature: data letter queue #73

heemankv commented Aug 8, 2024

apoorvsadana left a comment

EvolveArt left a comment

EvolveArt Aug 9, 2024

heemankv Aug 9, 2024

EvolveArt Aug 12, 2024

heemankv Aug 13, 2024

EvolveArt Aug 9, 2024

heemankv Aug 9, 2024

EvolveArt Aug 12, 2024

EvolveArt Aug 9, 2024

heemankv Aug 9, 2024

EvolveArt Aug 9, 2024

heemankv Aug 9, 2024 •

edited

Loading

EvolveArt Aug 12, 2024

heemankv Aug 13, 2024

heemankv commented Aug 9, 2024

apoorvsadana commented Aug 10, 2024

EvolveArt left a comment

Feature: data letter queue #73

Feature: data letter queue #73

Conversation

heemankv commented Aug 8, 2024

apoorvsadana left a comment

Choose a reason for hiding this comment

EvolveArt left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

heemankv Aug 9, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

heemankv commented Aug 9, 2024

apoorvsadana commented Aug 10, 2024

EvolveArt left a comment

Choose a reason for hiding this comment

heemankv Aug 9, 2024 •

edited

Loading