-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix broken Docker recipe and update dependencies #7
Conversation
seaborn==0.12.2 \ | ||
voila==0.4.1 \ | ||
"colabfold[alphafold] @ git+https://github.com/sokrypton/ColabFold" && \ | ||
&& \ | ||
# As of Nov 2023, colabfold requires 0.3.25 <= jax < 0.4.0, which leads to build errors. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@anuprulez you need to decide if we can remove that.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Strictly speaking, tensorflow errors have been an issue for the past release as well and we now have another IT with a separate container (v. 0.2) that provides the colabfold service.
Option 2: If we want to keep it, I could try to provide colabfold out of a conda env installed in this container so that Tensorflow et al. still work.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Some how I do not get this email even though I am subscribed to this repo. Not even in my spam. Sorry for that!
I will have a look at it today :)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I agree that Colabfold has issues with the latest versions of TensorFlow, maybe with CUDA as well. We can remove it from v0.3 and later versions of this Docker container. We already have it on v0.2 in case it is needed for my defense.
@tuncK @bgruening
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@bgruening, shall we merge this?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sure.
@anuprulez do you want to create a new release? |
@anuprulez is the IT on .eu still broken? Because that is why I had started this. |
@tuncK yes, its broken I think with the following error message when I run a tensorflow notebook:
|
I can create a new release but after verifying all the notebooks (except those using Colabfold) by running this container on my VM. Probably next week. |
The newly released v0.4 version of this tool throws the same error as above as well as the v0.3:
It is not possible to train models using Tensorflow. However, TF recognises GPU but fails while training any model. |
Can you please try this again tomorrow: I tried usegalaxy-eu/infrastructure-playbook#1067 |
I tried it, but unfortunately, it did not work. I tried this solution already directly in the Docker container as well. Still, it does not find libdevice file even though it is present at /opt/conda/nvvm/. I will look into it. |
The last time I was dealing with this, it had something to do with:
|
Jax version dictated by colabfold is too old resulting in a docker build error.
Also updated tensorflow to 2.15 & CUDA to 12.2 along with other packages.