[Bug] DNAscope gnomad rule failing due to memory #1431

mathiasbio · 2024-05-07T14:41:19Z

Description

A couple of WGS cases have now been unable to complete in the DNAscope gnomad rule which creates a BAF file to be used as input in GENS. See deviation: https://github.com/Clinical-Genomics/Deviations/issues/631

The memory issues seem to appear after entry into the HLA-region in chromosome 6, with a warning like this:
2024-05-07/05:43:07.323620 HCAssembler WARN :skip selectHaplotype 6:32492321-32492798 due to big read number 44 and big hap number 34117160

But it's strange that the cases keep increasing memory even after it has finished processing this region, and that it's so sample dependent, where these problematic samples can't finish even with 400G mem, whereas other samples finish entirely without having used more than around 30G.

How to reproduce

No response

Expected behaviour

No response

Anything else?

No response

Pipeline version

No response

mathiasbio · 2024-06-04T12:08:21Z

It seems after my own manual tests that after removing the variants in the gnomad VCF that exists in
chr6 32393697-32787282 (HLA region)

The DNAscope gnomad command finishes, whereas running with the original one fails. I believe this region is too small to matter in the GENS visualisation that we could replace the current gnomad VCF with this HLA filtered one in production, and hopefully that fixes this issue with memory fails.

But does this require us to make a new version of balsamic?

mathiasbio · 2024-06-14T12:58:52Z

Replaced by user-story #1447

mathiasbio added the Bug Something isn't working label May 7, 2024

mathiasbio added this to BALSAMIC May 7, 2024

github-project-automation bot moved this to Todo in BALSAMIC May 7, 2024

mathiasbio self-assigned this May 7, 2024

mathiasbio added the Needs Refinement label Jun 4, 2024

mathiasbio moved this from Todo to In Progress in BALSAMIC Jun 14, 2024

mathiasbio mentioned this issue Jun 14, 2024

[User Story] Remove low-mappability variants from gnomad vcf used for GENS #1447

Open

3 tasks

mathiasbio closed this as completed Jun 14, 2024

github-project-automation bot moved this from In Progress to Completed in BALSAMIC Jun 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug] DNAscope gnomad rule failing due to memory #1431

[Bug] DNAscope gnomad rule failing due to memory #1431

mathiasbio commented May 7, 2024

mathiasbio commented Jun 4, 2024 •

edited

Loading

mathiasbio commented Jun 14, 2024

[Bug] DNAscope gnomad rule failing due to memory #1431

[Bug] DNAscope gnomad rule failing due to memory #1431

Comments

mathiasbio commented May 7, 2024

Description

How to reproduce

Expected behaviour

Anything else?

Pipeline version

mathiasbio commented Jun 4, 2024 • edited Loading

mathiasbio commented Jun 14, 2024

mathiasbio commented Jun 4, 2024 •

edited

Loading