-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
OSError: [Errno 36] File name too long #29
Comments
Hello I got the same problem , File name too long
and also I find the log file shows
Should I clean similar sequence of rep.fa, or there is something should be changed in the script ? Thank you! |
Hi @MrBleem, based on the error message, it appears that you ran TELR with a TE library that includes multiple identical or highly similar sequences under the same TE family. The current TELR assumes a TE library that contains one TE sequence for each family. I would suggest you to clean up the library and remove redundant sequences before running TELR. For D. melanogaster genome sequencing data, we suggest using v10.2 of the D. melanogaster canonical TE sequence library available at https://github.com/bergmanlab/drosophila-transposons/blob/master/releases/D_mel_transposon_sequence_set_v10.2.fa. Hi @unavailable-2374, thanks for sharing the failed log. Yes, the issue appears to be caused by redundant TE sequences in the library described above. I agree with your intuition to clean redundant/similar sequences and try again. Sorry for not making that clear in the README. Let me know if the issue still exists after the library clean up. Best, |
Oh, thank you! I will try it right now. Best, |
Ok, I will change the TE reference file. Best |
Hello Very unfortunately, I tried to merge the sequences with similarity 85+ in TE_lib with cd-hit and run TELR with the result as a library, but I still get "File name too long" errors. Best, ShuoCao |
Thanks for reporting that the error persists @unavailable-2374. I will try to push a code update later this week to have this issue fixed. |
Thanks! |
Hi @unavailable-2374 and @MrBleem, I have merged a fix into the master branch on the "file name too long" error (see #30). Please pull the latest commit from the master branch, reinstall TELR locally, and see if this new version can resolve the issue. Thanks! cd TELR # this is the TELR git folder
git pull # necessary to use this patched update
conda activate TELR
pip install . # install this patched update
# run TELR
telr -h Best, |
Hi Shunhua Thank, I will try it immediately! Shuo |
Hi Shunhua Unfortunately, what if I said that the error still exists?
Also reported in the additional .err file:
|
The error message indicates that the new TELR version was not being used in the new run. Could you double check that you have locally pip installed the new version of TELR after
If you previously installed TELR through conda/mamba. You would notice that the telr version is the following from bioconda channel when listing out all the dependencies using
After you pip install TELR locally, you would see that the TELR version is now from pip when you check the environment dependencies.
Hopefully this helps confirm the new pip version of TELR has been installed successfully. Let me know if you still encounter issues. Sorry about the trouble. Best, |
Hi @shunhuahan
I have double checked this, sorry that the problem still here. Best, |
I realized the previous update only partially fixes the issue. The error you posted in #29 (comment) and #29 (comment) should be gone already with the previous update at #30. The most recent error message is caused by the intermediately JSON file containing too many TE family names. I have made another pull request to fix this issue (see #32). Please do Best, |
Hi @MrBleem and @unavailable-2374 We have updated the TELR recipe at bioconda with a new version (v1.1) that includes the fix to the filename problem you identified. You can install the latest version of TELR via bioconda as follows:
Please let us know if this resolves your issue. Best regards, |
Hello!
After get the expected result with the S2R data, I tried run TELR on other fly data.
Then I got the OSError:
OSError: [Errno 36] File name too long: '/home/boxu/temp/telr/result/intermediate_files/input_json/chr2L_1887214_1887214_21128_24591_ID=162;FBgn0000638_FB|ID=163;FBgn0000638_FB|ID=164;FBgn0000638_FB|ID=165;FBgn0000638_FB|ID=166;FBgn0000638_FB|ID=167;FBgn0000638_FB|ID=168;FBgn0000638_FB|ID=169;FBgn0000638_FB|ID=170;FBgn0000638_FB|ID=171;FBgn0000638_FB_input.json'
And I noticed that some insertion annotated with one type of TE but different ID in the result, just like:"ID=12;Tirant|ID=13;Tirant". Is it causing the problem?
Dose it means there is mulitple insertion seq from same TE? I'm confused for that.
Could you help me solve this?
Thank you very much!
The text was updated successfully, but these errors were encountered: