-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[OPS] [BUG] Preparation job stuck during 15 hours before being generated #1088
Comments
New occurrence on sample processing report |
New occurrence
The behaviour same to be the same for |
New occurrence 10 timeout exection for l0p
4 HK/TM impacted |
Environment:
rs-addon s3-l0p v1.14
Traceability:
Current Behavior:
On sample processing, we forecast to download 2 sessions S3B
All L0 Product have been generated during this period except 4 products generated after more than 15h00:
hereafter we saw for HKTM prouct 2 of these with ingestion_time over 15hours (ingestion_time is difference between start_processing_date and t0_pdgs_date)
We saw these 4 job have been generated at about 08:00:00 by the preparation_worker
Products and jobs impacted:
JobOrder.149473.xml (MW0)
JobOrder.149509.xml (HKTM2)
JobOrder.149472.xml (DOP)
JobOrder.149494.xml (HKTM)
Expected Behavior:
Job should be successfully generated by preparation once all data needed data were available.
Steps To Reproduce:
This issue has been detected on sample processing S3
Test execution artefacts (i.e. logs, screenshots…)
Hereafter preparation log
s3-l0p-part1-preparation-worker-v52-d9dcb799f-9fzpj.log.gz
sample mongo job 149494:
joborder149494.json
sample mongo job 149509:
joborder149509.json
Whenever possible, first analysis of the root cause
On preparation log we saw these 4 jobs seems not detected as completed due to missing TM_0_NAT products.
sample for last job created 149494
Detection search start:
Product detected (we saw NAT products)
But it seems not enought to generate execution:
Once receive new AUX data the following days at 08:00:11, it seems restart job generation processus whhich raised an timeout which force processing
Note 2 NAT product selected ( S3B_TM_0_HKM__G_20240411T142342_20240411T160317_20240411T164017_5975______________SVL_O_NR_OPE.ISIP) for processing
Following tasktable
Herafter TM_NAT covering needed:
So product stuck because generated NAT files (142325 -->160259) not cover all product duration (142342 --> 160317)
So issue can talk about generation NAT which are shorter
All NAT job for this processing;
So we should explain why NAT generated by job 149479 was shorter
On generation log for S3B_TM_0_NAT____20240411T142325_20240411T160259_20240411T165222_5973_091_367______LN3_O_AL_002.SEN3 No error seen:
S3_L0P_S3B_TM_0_NAT____20240411T142325_20240411T160259_20240411T165222_5973_091_367______LN3_O_AL_002.log
Bug Generic Definition of Ready (DoR)
Bug Generic Definition of Done (DoD)
The text was updated successfully, but these errors were encountered: