Error in SEVtras.sEV_recognizer #26

janicewu123 · 2024-08-06T16:24:34Z

Dear Author, I have met some troubles when I used SEVtras.sEV_recognize to analysis my own 10x-scRNAseq file of 21 samples. Firstly, my 10x-scRNAseq data was not in standard format:

Therefore, when I run the code (SEVtras.sEV_recognizer(input_path='./',sample_file='./sample2_file',out_path='./outputs',species='Homo',dir_origin=False,predefine_threads=20)), I got this error (KeyError: 2):

So I manually modified the original file of 10x single-cell data, as shown in below:

After completing the modification, I was able to run the program successfully, but soon encountered an error of “ValueError: max() arg is an empty sequence”.

As you suggested in issues #4 and #20, I first adjusted to lower the parameter 'alpha' in 0.09, 0.05, 0.01 and even 0.001, but the error still existed; Considering that the number of samples has reached 21, it should not be the reason for the small number of samples. Therefore, I suspect that the data might be snRNA-seq data, but the sequencing company reports that it is scRNA-seq data (authenticity is questionable), and it is impossible to judge based on the raw data at this time. Do you have any good method to make a preliminary judgment of the two based on 10x original data? In addition, as the sample quality is not high (mt content is too high), it is unknown whether it has an impact on SEVtras analysis? Finally, I am not sure if this problem stems from the way I modified the original 10x file or if there is some other root cause. I would appreciate any insight or suggestions you may have on resolving this error. Thank you very much for your time and help.
In addition, in order to confirm whether there is a problem in my parameter setting, I selected the GSE234527 data set sample for testing again, and the program ran normally and the results could be obtained.

RuiqiaoHe · 2024-08-07T09:17:20Z

Thanks for your kind testing. In my view, the modifications you made will not affect the identification of SEVtras.
Regarding scRNA-seq or snRNA-seq, you can test the expression of genes that should be expressed in the cytoplasm. As the high expression in MT genes, the dataset may come from scRNA-seq.
However, I am wondering if the raw data you input into SEVtras is the output file in the raw_feature_bc_matrix directory. If you are using the filtered ones, it will result in the error you encountered now.

janicewu123 · 2024-08-08T02:17:16Z

Thanks very much for your reply! I rechecked my 10x data and found that the data provided by the company came from the filtered feature bc matrix directory. Thanks again for pointing this out!

janicewu123 closed this as completed Aug 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error in SEVtras.sEV_recognizer #26

Error in SEVtras.sEV_recognizer #26

janicewu123 commented Aug 6, 2024

RuiqiaoHe commented Aug 7, 2024

janicewu123 commented Aug 8, 2024

Error in SEVtras.sEV_recognizer #26

Error in SEVtras.sEV_recognizer #26

Comments

janicewu123 commented Aug 6, 2024

RuiqiaoHe commented Aug 7, 2024

janicewu123 commented Aug 8, 2024