Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

WIP: try generic worker with podman instead of docker for container engine #999

Closed
wants to merge 8 commits into from

Conversation

bhearsum
Copy link
Collaborator

This is a variant of #700 to see if podman works without issues with caches after some recent fixes.

The new image we're upgrading GPU workers to uses Ubuntu 24.04, which makes it incompatible with various parts of the pipeline (mostly due to Python package pinning). As it turns out, the easiest way to fix this is to dockerize the GPU tasks.

We need slight updates to GPU task payloads to accommodate this.

This will fix mozilla#391.
Now that we're running inside a docker image we don't have these available on the filesystem already. I explored the idea of installing them into the Docker image, but it's quite impractical. The host image is Ubuntu 24.04, and the containers Ubuntu 22.04. We need to have a matching toolkit version, and we require version 12 at this point, which isn't available on 22.04.
Without this we end up with these files being inaccessible in subsequent tasks.
This has always been needed, but it was found on the host system on the previous image.
@bhearsum
Copy link
Collaborator Author

bhearsum commented Jan 27, 2025

Podman has had many issues (both here and in other places we use it); we're definitely not going this route for the time being.

@bhearsum bhearsum closed this Jan 27, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant