Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Command download broken #550

Open
holtgrewe opened this issue Jun 28, 2022 · 1 comment
Open

Command download broken #550

holtgrewe opened this issue Jun 28, 2022 · 1 comment
Labels

Comments

@holtgrewe
Copy link
Member

holtgrewe commented Jun 28, 2022

Describe the bug
There is problematic data with chrM vs chrMT.

To Reproduce

+-> java -jar jannovar-cli/target/jannovar-cli-0.40-SNAPSHOT.jar download -d hg19/refseq
Options
JannovarDownloadOptions [downloadDir=data, getDataSourceFiles()=[bundle:///default_sources.ini], isReportProgress()=true, getHttpProxy()=null, getHttpsProxy()=null, getFtpProxy()=null, geneIdentifiers=[], outputFile=]
Downloading/parsing for data source "hg19/refseq"
INFO Downloading data...
INFO Downloading https://ftp.ncbi.nlm.nih.gov/genomes/archive/old_refseq/H_sapiens/ARCHIVE/ANNOTATION_RELEASE.105/GFF/ref_GRCh37.p13_top_level.gff3.gz
INFO Downloaded file ref_GRCh37.p13_top_level.gff3.gz looks like a valid gzip'ed file
INFO Downloading https://ftp.ncbi.nlm.nih.gov/genomes/archive/old_refseq/H_sapiens/ARCHIVE/ANNOTATION_RELEASE.105/RNA/rna.fa.gz
INFO Downloaded file rna.fa.gz looks like a valid gzip'ed file
INFO Downloading http://hgdownload.soe.ucsc.edu/goldenPath/hg19/database/chromInfo.txt.gz
INFO Downloaded file chromInfo.txt.gz looks like a valid gzip'ed file
INFO Downloading https://ftp.ncbi.nlm.nih.gov/genomes/archive/old_refseq/Homo_sapiens/ARCHIVE/ANNOTATION_RELEASE.105/Assembled_chromosomes/chr_accessions_GRCh37.p13
INFO Downloading https://www.ncbi.nlm.nih.gov/sviewer/viewer.cgi?save=file&db=nuccore&report=fasta&id=251831106
INFO Downloading http://ftp.ebi.ac.uk/pub/databases/genenames/new/tsv/hgnc_complete_set.txt
INFO Building ReferenceDictionary...
Exception in thread "main" java.lang.IllegalArgumentException: Multiple entries with same key: 25=16569 and 25=16571
        at com.google.common.collect.ImmutableMap.checkNoConflict(ImmutableMap.java:190)
        at com.google.common.collect.RegularImmutableMap.checkNoConflictInKeyBucket(RegularImmutableMap.java:109)
        at com.google.common.collect.RegularImmutableMap.fromEntryArray(RegularImmutableMap.java:95)
        at com.google.common.collect.ImmutableMap$Builder.build(ImmutableMap.java:357)
        at de.charite.compbio.jannovar.data.ReferenceDictionaryBuilder.build(ReferenceDictionaryBuilder.java:115)
        at de.charite.compbio.jannovar.impl.parse.ReferenceDictParser.parse(ReferenceDictParser.java:139)
        at de.charite.compbio.jannovar.datasource.JannovarDataFactory.build(JannovarDataFactory.java:101)
        at de.charite.compbio.jannovar.cmd.download.DownloadCommand.run(DownloadCommand.java:43)
        at de.charite.compbio.jannovar.Jannovar.main(Jannovar.java:74)

Expected behavior
The download and build should succeed.

Additional context
N/A

@holtgrewe holtgrewe added the bug label Jun 28, 2022
@brettva
Copy link

brettva commented Aug 2, 2023

@holtgrewe did you ever find a solution?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants