-
Notifications
You must be signed in to change notification settings - Fork 137
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Running problem on Linux server #342
Comments
Hi @looperalt, My guess is something went wrong when trying to preprocess the dataset - did you check your terminal before that last error message to see if preprocessing was actually successful (e.g., the progress bar for preprocessing showed up, was completed, etc)? Remember, you have to preprocess the dataset for the first time and anytime you change key parameters that would affect preprocessing in your config file (refer to the example config files for more details). Also, note the project README and how datasets are expected to be organized (in most cases, the way they are downloaded). If it seems like preprocessing somehow starts and then abruptly stops, you could try seeing if the issue is the default |
Yes, a progress bar appears during preprocessing, but when the progress bar reaches the end, this issue occurs. However, it's strange that when I run the code locally on Windows, there is no problem. When I run it on a Linux remote server, this issue arises. I have followed the process outlined in the README file without any operational errors, and I am puzzled as to why there would be different results when running the code on different systems. I would like to know if you have encountered this issue and how to adjust the parameters when running this code on a Linux remote server |
If you can run it locally on a Windows machine but somehow not on a Linux server, are you sure there isn't something going wrong with how your dataset (whether the dataset itself or the preprocessed folder) is being pointed to in the Linux case? I guess if the Linux remote server has too much CPU usage going on, that could also lead to stuck or dead processes that prevent successful preprocessing, but if you use Feel free to share the config you're trying to run here, perhaps there is something up with the file paths that I can identify. |
When running on the Linux server, the progress bar advances normally, but after the progress bar is full, it displays a |
If it was read and not preprocessed or saved, again, try two things 1) adjust If 1) does not make a difference at all, let me know, and we can dig into other things that may be specific to your situation and causing issues. I should note, the majority of toolbox users (i.e., hundreds of people, myself included) use Linux and the default multi-process setting without any issue, so troubleshooting with respect to your particular remote server is the way to go. |
I have tried the methods you suggested, continuously adjusting the parameters of the multi-process function, but unfortunately, the error still persists. |
Ok, so we've ruled out multi-processing as far as too many processes being the issue. Here's a few more things to try:
All the best, Akshay |
I have confirmed that my preprocessing folder has write permissions. In fact, there is something very strange: when I first tried to run the code, .npy files appeared in the preprocessing folder, but when I closed the program and ran it again, the .npy files never appeared again. This is a very confusing situation for me. |
I would troubleshoot that a bit more, that does sound strange and it's really hard for me to tell what the issue might be since it sounds quite specific to your remote server / your environment. Is it safe to assume, using |
When I run this code, all the CPU core usage will exceed 100%, the server I use is 16 core 80G memory, when I run directly will show the CPU usage of more than 1600%, after a period of time will exit the code shows this error, whether it is related to my memory size and CPU performance |
When I run the code on a different server, I get the following error |
The following error is displayed when using RF as the face processing module
|
I suspect this has something to do with your file path for the Haar Cascade model or perhaps related libraries that were installed (e.g., OpenCV).
That, or something similar to check if the detector was initialized properly, should tell us if the issue is with the detector initialization itself. My guess at this point is that somehow the Haar Cascade
|
I'm facing the same issues but for me its: ValueError: ('test', 'No files in file list'), the same project i run on local it works fine, but when i upload it on drive and run the code from google colab it shows this error, the paths i have triple checked, the files uploaded on the memory as it takes Ram, after that bar 100% completed it throws this error and exits the program. I have changes multiprocess_quota to 1 , still the same error |
I'd recommend making a separate issue regarding this with more details, but I should stress this toolbox was never designed nor tested to run out-of-the-box on Google Colab, so you may have to figure this out on your own unless someone else has also done this successfully and can help. |
When I run iBVPnet on a Linux server, the following problems occur:
TypeError: Binding inputs to tf.function wrapped_fn failed due to Can not cast TensorSpec(shape=(1, 1024, 1365, 4), dtype=tf.float32, name=None) to TensorSpec(shape=(None, None, None, 3), dtype=tf.float32, name=None). Received args: (array([[[[149. , 155. , 163. , 0. ]
and
ValueError: ('train', 'No files in file list')
The text was updated successfully, but these errors were encountered: