-
Notifications
You must be signed in to change notification settings - Fork 4.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
System.IO.Tests crash in CI (Linux arm64) #100441
Comments
Tagging subscribers to this area: @dotnet/area-system-io |
137 means out of memory. The tests started to fail not only in main but also in older branches where we have not touched the code at all: #100558 @dotnet/area-infrastructure-libraries Is it possible that the test VMs simply have less memory available now? |
I don't think that we have access to that information for a Helix test client. Might make sense to print some diagnostics in the RunTests.sh/cmd script, i.e. available RAM and disk space. |
@adamsitnik I'd be surprised if something like that happened, but we can double check: @dotnet/dnceng do you know? The thing is, this OOM failure is only happening in System.IO and System.IO.Net5Compat . I am pretty sure I don't see it anywhere else.
|
This is an intermittent issue, so maybe widen up the dates a bit more? When was the last time a System.IO change happened in servicing before April? |
The failure was most likely triggered by Linux kernel update, docker container update or test infra update. These updates are rolled out regularly in the background. I do not think it is a good use of time to try to find the exact update that triggered this failure months ago. We won't be able to do much with that information. The failure is likely triggered by a test that consumes too many resources. It does not have to be direct memory use. For example, the test can be creating too many file handles that manifests as 137. I think we should try to find the offending test or tests, e.g. by trying to reproduce the failure with verbose logging. |
Failed for below leg in runtime-coreclr libraries-pgo/20240810.1
|
removing 'blocking-clean-ci' label as it has not failed in 30 days
|
Fixed by #107163 |
Build Information
Build: https://dev.azure.com/dnceng-public/cbb18261-c48f-4abb-8651-8cdcb5474649/_build/results?buildId=623676
Build error leg or test failing: System.IO.Tests.WorkItemExecution
Pull request: #100433
Error Message
Fill the error message using step by step known issues guidance.
Known issue validation
Build: 🔎 https://dev.azure.com/dnceng-public/public/_build/results?buildId=623676
Error message validated:
[arm64 System.IO.Tests Killed -- exit code 137 --
]Result validation: ✅ Known issue matched with the provided build.
Validation performed at: 3/29/2024 2:52:42 PM UTC
Report
Summary
The text was updated successfully, but these errors were encountered: