You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello I am trying to process 1000s of CRAM files with EHDN (latest version installed via conda).
While it works fine for bam files the HTSlib seems to struggle to parse CRAM formatted data- which relates to the explanation at the bottom of this page (see "The REF_PATH and REF_CACHE" section): https://www.htslib.org/workflow/cram.html
Indeed the following command:
ExpansionHunterDenovo profile --reads in.cram --reference GRCh38.fa --output-prefix out
results in the error:
[W::find_file_url] Failed to open reference "http://www.ebi.ac.uk/ena/cram/md5/b0397179e5a92bb7a3300b68e45a9f72" Permission denied
....
[E::cram_next_slice] Failure to decode slice
The permission denied error is due to the fact that I am on a closed server and I can't download things onto it..
It has been suggested to me to convert the files to BAM format and then analyse- but this isn't really an option for me as the files that I have are large and there are 1000s of them.... I was hoping for a longer term solution?
./seq_cache_populate.pl -root /some_dir/cache GRCh38.fasta #works fine
export REF_PATH=/some_dir/cache/%2s/%2s/%s:http://www.ebi.ac.uk/ena/cram/md5/%s
export REF_CACHE=/some_dir/cache/%2s/%2s/%s
While the MD5 reference is made correctly, this still results in the same error as the conda installed binary executable seems to have its' own internal environment variables....
Hence I am wondering if I could I possibly edit the code before installing EHDN from source?
Is there some place in the script where I could set the environment variables ($REF_PATH) setting the path to the MD5 sums cache folder?
The text was updated successfully, but these errors were encountered:
Hello I am trying to process 1000s of CRAM files with EHDN (latest version installed via conda).
While it works fine for bam files the HTSlib seems to struggle to parse CRAM formatted data- which relates to the explanation at the bottom of this page (see "The REF_PATH and REF_CACHE" section): https://www.htslib.org/workflow/cram.html
Indeed the following command:
results in the error:
The permission denied error is due to the fact that I am on a closed server and I can't download things onto it..
It has been suggested to me to convert the files to BAM format and then analyse- but this isn't really an option for me as the files that I have are large and there are 1000s of them.... I was hoping for a longer term solution?
I have tried the instructions at https://www.htslib.org/workflow/cram.html
where I use the command
While the MD5 reference is made correctly, this still results in the same error as the conda installed binary executable seems to have its' own internal environment variables....
Hence I am wondering if I could I possibly edit the code before installing EHDN from source?
Is there some place in the script where I could set the environment variables ($REF_PATH) setting the path to the MD5 sums cache folder?
The text was updated successfully, but these errors were encountered: