Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

marius_preprocess triggers program aborted #81

Closed
VeritasYin opened this issue Dec 11, 2021 · 7 comments
Closed

marius_preprocess triggers program aborted #81

VeritasYin opened this issue Dec 11, 2021 · 7 comments
Assignees
Labels
bug Something isn't working

Comments

@VeritasYin
Copy link

Describe the bug
run marius_preprocess or import preprocess would trigger the following error.

free(): invalid pointer
Aborted

To Reproduce
Steps to reproduce the behavior:

  1. Run the given example 'marius_preprocess output_dir/ --dataset fb15k'
    OR
  2. 'from marius.tools import preprocess' in Python

Environment
gcc version 9.3.0 (Ubuntu 9.3.0-17ubuntu1~20.04)
Python 3.8.5

@VeritasYin VeritasYin added the bug Something isn't working label Dec 11, 2021
@JasonMoho
Copy link
Collaborator

Thanks for reporting, haven't seen this before. I'll try reproducing this tomorrow with the above environment.

Could you try working around this by using a different environment? The gcc version should be fine, but you can try python 3.7 or 3.9 and see if that fixes it.

I also haven't deployed on ubuntu20 before and usually work off ubuntu18. So if changing the python version doesn't work, you can try running with docker using this dockerfile. This should ensure that there are no environment issues.

@JasonMoho JasonMoho self-assigned this Dec 12, 2021
@JasonMoho
Copy link
Collaborator

@VeritasYin
Noticed in #82 You got things running by downgrading to python 3.7. Did you change anything else in your environment? Or was that the only thing you needed to do?

@VeritasYin
Copy link
Author

I first downgrade to Python 3.7 and then recompile the code from the scratch. This time, it works. But it occasically shows error when running under cpu config as "libgomp: Thread creation failed: Resource temporarily unavailable".

@JasonMoho
Copy link
Collaborator

Yeah must be some strange issue with python 3.8. I usually run on 3.6 or 3.7. We have a new version of the system releasing in February, I'll make sure to address this issue for that release.

As far as the libgomp issue. It's possible that too many threads are being created by OMP and pytorch. You could try limiting the number of OMP threads by setting OMP_NUM_THREADS manually. I think by default this will be set to the number of cores on your machine.

@VeritasYin
Copy link
Author

@JasonMoho Thanks for your prompt response. Another issue regarding preprocess is that for homophily graphs such as ogbl datasets, the preprocess will not generate the file "rel_mapping.txt", which would trigger errors during postprocessing for exporting node embeddings.

@JasonMoho
Copy link
Collaborator

JasonMoho commented Dec 13, 2021

No problem! Appreciate that you are reporting these issues.

For our upcoming release we have completely revamped the preprocessing and postprocessing, and this mapping issue was fixed for that. Sorry about the current alpha state that things are in, it will be quite a bit more polished soon :)

@JasonMoho
Copy link
Collaborator

Closing this as version 0.0.2 has been merged and includes improved preprocessing.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants