Reconsider rgsm value in fastq list rows #606

alexiswl · 2023-07-13T11:12:22Z

As highlighted by Stephen before running the validation samples, there exists some cases where the tumor sample id and the normal sample id are the same value.

See Subject SBJ00480 as an example:

This is a bit of a problem.

Some proposed solutions (to be discussed on Tuesday)

Note change of rgsm values wont impact file names as these are determined by the output-prefix parameter for dragen runs.
However it will impact the values of sample names in vcfs and bams.

Replace rgsm and use the library id instead.

The tumor and normal names in vcfs and bams would instead be the library id.
Append _N or _T based on phenotype to the RGSM

This would mean that vcfs and bams would have similar names as existing
Use sample name + library id

Would help if users are relying on the sample name in vcfs and bams for tumor and normal samples.
Nothing at all and ensure that sample names differ between tumors and normals for a given subject.

ohofmann · 2023-07-13T11:21:26Z

Ouch. I can see cases where the same 'sample' was used to generate both tumor and normal DNA (in the sense of a tissue specimen) although in this particular case I would argue that the various mixes should have different sample identifiers - they are distinct mixes of two other samples. I.e., for this specific case I'd change names upstream in the tracking sheet, demultiplex again and process from there.

If we want to future-proof things option 3 feels best to me; the _N/_T suffix will run into issues at some point, but just the library ID could also work.

alexiswl · 2023-07-13T11:23:40Z

I.e., for this specific case I'd change names upstream in the tracking sheet, demultiplex again and process from there.

Given there were other samples on this run, we were considering just updating the labmetadata and the fastq list row rgsm values for these entries. Saves a demultiplex.

ohofmann · 2023-07-13T11:26:56Z

Demultiplexing is stable, I don't mind re-generating all FASTQs if it saves some extra work...

victorskl · 2023-07-13T23:15:07Z

for this specific case I'd change names upstream in the tracking sheet, demultiplex again and process from there.

I would also recommend this and re-convert affected samples. I can help with data cleanup and, re-run SOP at Portal side, if any.

As I am aware, these samples are for validation lot. So, it has future value on re-running on them again to validate analysis pipeline changes, etc.

So, it worth the effort to make it right at best as much, I reckon.

alexiswl · 2023-07-13T23:19:22Z

Also worth noting, as Florian pointed out yesterday, we may come across a situation where the tumor and normal having the same sample id is actually appropriate (same tube, and some magical process separates the too - I don't know what this process is called). So it is inevitable we will have to move away from the current set up at some point.

If you're happy to rerun the demux Victor and manage that then happy, thought it would be easier for you if we just updated the fastq-list-rows

victorskl · 2023-07-13T23:27:20Z

Let me access the situation and impact a bit further, pls...

reisingerf · 2023-07-14T05:04:31Z

If we want to future-proof things option 3 feels best to me

I (strongly) vote for number 1.

ohofmann · 2023-07-14T05:17:54Z

Note that we'd still be running samples in T/N mode with the same library ID at some point (maybe not automatically, but the overall workflow should not fail if we run the same library against its copy).

reisingerf · 2023-07-16T23:32:22Z

against its copy

Huh? A library against itself?

I can't see a reason why the orchestration should have an issue with that, but how would we distinguish between them in workflow results (not sure if all tools would support that)?

We could possibly work out a naming convention, e.g. a second (virtual) library with a different ID pointing to the same FASTQs that could be used as the copy?

ohofmann · 2023-07-16T23:41:01Z

Part of the accreditation process - sequence the same library twice at different coverage levels and analyse, anything called is a false positive. Might ignore that for the portal and co.

alexiswl · 2023-07-19T10:56:06Z

After discussing with Victor, I don't think demultiplexing is the right way forward, for this sample alone we have 204 'libraryrun' entries

SELECT sequencerun.instrument_run_id, limsrow_metadata.phenotype, fastqlistrow.*
FROM silver.limsrow_metadata
INNER JOIN portal.libraryrun
  ON limsrow_metadata.library_id = libraryrun.library_id
INNER JOIN portal.sequencerun
  ON sequencerun.instrument_run_id = libraryrun.instrument_run_id
INNER JOIN portal.fastqlistrow 
  ON libraryrun.library_id = fastqlistrow.rglb
  AND sequencerun.id = fastqlistrow.sequence_run_id
  AND libraryrun.lane = fastqlistrow.lane
WHERE 
  limsrow_metadata.subject_id = 'SBJ00480'
AND
  fastqlistrow.rgsm = 'PTC_TsqN200511'
ORDER BY sequencerun.instrument_run_id

Gives us the following:

Output

instrument_run_id	phenotype	id	rgid	rgsm	rglb	lane	read_1	read_2	sequence_run_id	partition_name
200605_A01052_0016_AH5LYNDSXY	tumor	6278	ACTAAGAT.AACCGCGG.4.200605_A01052_0016_AH5LYNDSXY.PTC_TsqN200511_L2000431	PTC_TsqN200511	L2000431	4	gds://production/primary_data/200605_A01052_0016_AH5LYNDSXY/2022031722413b7c/WGS_TsqNano/PTC_TsqN200511_L2000431_S8_L004_R1_001.fastq.gz	gds://production/primary_data/200605_A01052_0016_AH5LYNDSXY/2022031722413b7c/WGS_TsqNano/PTC_TsqN200511_L2000431_S8_L004_R2_001.fastq.gz	1025	*
200605_A01052_0016_AH5LYNDSXY	normal	6279	GGCATTCT.CTAGCTTG.4.200605_A01052_0016_AH5LYNDSXY.PTC_TsqN200511_L2000437	PTC_TsqN200511	L2000437	4	gds://production/primary_data/200605_A01052_0016_AH5LYNDSXY/2022031722413b7c/WGS_TsqNano/PTC_TsqN200511_L2000437_S9_L004_R1_001.fastq.gz	gds://production/primary_data/200605_A01052_0016_AH5LYNDSXY/2022031722413b7c/WGS_TsqNano/PTC_TsqN200511_L2000437_S9_L004_R2_001.fastq.gz	1025	*
200612_A01052_0017_BH5LYWDSXY	tumor	6056	TCCAACGC.TTGGACTT.4.200612_A01052_0017_BH5LYWDSXY.PTC_TsqN200511_L2000434	PTC_TsqN200511	L2000434	4	gds://production/primary_data/200612_A01052_0017_BH5LYWDSXY/20220317975ade80/WGS_TsqNano/PTC_TsqN200511_L2000434_S4_L004_R1_001.fastq.gz	gds://production/primary_data/200612_A01052_0017_BH5LYWDSXY/20220317975ade80/WGS_TsqNano/PTC_TsqN200511_L2000434_S4_L004_R2_001.fastq.gz	1023	*
200612_A01052_0017_BH5LYWDSXY	tumor	6052	CTTGGTAT.CCAAGTCC.3.200612_A01052_0017_BH5LYWDSXY.PTC_TsqN200511_L2000433	PTC_TsqN200511	L2000433	3	gds://production/primary_data/200612_A01052_0017_BH5LYWDSXY/20220317975ade80/WGS_TsqNano/PTC_TsqN200511_L2000433_S2_L003_R1_001.fastq.gz	gds://production/primary_data/200612_A01052_0017_BH5LYWDSXY/20220317975ade80/WGS_TsqNano/PTC_TsqN200511_L2000433_S2_L003_R2_001.fastq.gz	1023	*
200612_A01052_0017_BH5LYWDSXY	tumor	6058	TTACAGGA.TGACAAGC.4.200612_A01052_0017_BH5LYWDSXY.PTC_TsqN200511_L2000436	PTC_TsqN200511	L2000436	4	gds://production/primary_data/200612_A01052_0017_BH5LYWDSXY/20220317975ade80/WGS_TsqNano/PTC_TsqN200511_L2000436_S6_L004_R1_001.fastq.gz	gds://production/primary_data/200612_A01052_0017_BH5LYWDSXY/20220317975ade80/WGS_TsqNano/PTC_TsqN200511_L2000436_S6_L004_R2_001.fastq.gz	1023	*
200612_A01052_0017_BH5LYWDSXY	tumor	6051	GTCGGAGC.GGTTATAA.3.200612_A01052_0017_BH5LYWDSXY.PTC_TsqN200511_L2000432	PTC_TsqN200511	L2000432	3	gds://production/primary_data/200612_A01052_0017_BH5LYWDSXY/20220317975ade80/WGS_TsqNano/PTC_TsqN200511_L2000432_S1_L003_R1_001.fastq.gz	gds://production/primary_data/200612_A01052_0017_BH5LYWDSXY/20220317975ade80/WGS_TsqNano/PTC_TsqN200511_L2000432_S1_L003_R2_001.fastq.gz	1023	*
200612_A01052_0017_BH5LYWDSXY	tumor	6057	CCGTGAAG.CAGTGGAT.4.200612_A01052_0017_BH5LYWDSXY.PTC_TsqN200511_L2000435	PTC_TsqN200511	L2000435	4	gds://production/primary_data/200612_A01052_0017_BH5LYWDSXY/20220317975ade80/WGS_TsqNano/PTC_TsqN200511_L2000435_S5_L004_R1_001.fastq.gz	gds://production/primary_data/200612_A01052_0017_BH5LYWDSXY/20220317975ade80/WGS_TsqNano/PTC_TsqN200511_L2000435_S5_L004_R2_001.fastq.gz	1023	*
200612_A01052_0017_BH5LYWDSXY	tumor	6053	ACTAAGAT.AACCGCGG.3.200612_A01052_0017_BH5LYWDSXY.PTC_TsqN200511_L2000431_topup	PTC_TsqN200511	L2000431	3	gds://production/primary_data/200612_A01052_0017_BH5LYWDSXY/20220317975ade80/WGS_TsqNano/PTC_TsqN200511_L2000431_topup_S3_L003_R1_001.fastq.gz	gds://production/primary_data/200612_A01052_0017_BH5LYWDSXY/20220317975ade80/WGS_TsqNano/PTC_TsqN200511_L2000431_topup_S3_L003_R2_001.fastq.gz	1023	*
200619_A01052_0018_AH5LYGDSXY	tumor	6004	GTCGGAGC.GGTTATAA.2.200619_A01052_0018_AH5LYGDSXY.PTC_TsqN200511_L2000432_topup	PTC_TsqN200511	L2000432	2	gds://production/primary_data/200619_A01052_0018_AH5LYGDSXY/2022031729d7fed3/WGS_TsqNano/PTC_TsqN200511_L2000432_topup_S13_L002_R1_001.fastq.gz	gds://production/primary_data/200619_A01052_0018_AH5LYGDSXY/2022031729d7fed3/WGS_TsqNano/PTC_TsqN200511_L2000432_topup_S13_L002_R2_001.fastq.gz	1024	*
200619_A01052_0018_AH5LYGDSXY	tumor	6017	GTCGGAGC.GGTTATAA.3.200619_A01052_0018_AH5LYGDSXY.PTC_TsqN200511_L2000432_topup	PTC_TsqN200511	L2000432	3	gds://production/primary_data/200619_A01052_0018_AH5LYGDSXY/2022031729d7fed3/WGS_TsqNano/PTC_TsqN200511_L2000432_topup_S13_L003_R1_001.fastq.gz	gds://production/primary_data/200619_A01052_0018_AH5LYGDSXY/2022031729d7fed3/WGS_TsqNano/PTC_TsqN200511_L2000432_topup_S13_L003_R2_001.fastq.gz	1024	*
200619_A01052_0018_AH5LYGDSXY	tumor	6030	GTCGGAGC.GGTTATAA.4.200619_A01052_0018_AH5LYGDSXY.PTC_TsqN200511_L2000432_topup	PTC_TsqN200511	L2000432	4	gds://production/primary_data/200619_A01052_0018_AH5LYGDSXY/2022031729d7fed3/WGS_TsqNano/PTC_TsqN200511_L2000432_topup_S13_L004_R1_001.fastq.gz	gds://production/primary_data/200619_A01052_0018_AH5LYGDSXY/2022031729d7fed3/WGS_TsqNano/PTC_TsqN200511_L2000432_topup_S13_L004_R2_001.fastq.gz	1024	*
200619_A01052_0018_AH5LYGDSXY	tumor	5991	GTCGGAGC.GGTTATAA.1.200619_A01052_0018_AH5LYGDSXY.PTC_TsqN200511_L2000432_topup	PTC_TsqN200511	L2000432	1	gds://production/primary_data/200619_A01052_0018_AH5LYGDSXY/2022031729d7fed3/WGS_TsqNano/PTC_TsqN200511_L2000432_topup_S13_L001_R1_001.fastq.gz	gds://production/primary_data/200619_A01052_0018_AH5LYGDSXY/2022031729d7fed3/WGS_TsqNano/PTC_TsqN200511_L2000432_topup_S13_L001_R2_001.fastq.gz	1024	*
200624_A01052_0019_BH5L7WDSXY	tumor	6137	GTCGGAGC.GGTTATAA.2.200624_A01052_0019_BH5L7WDSXY.PTC_TsqN200511_L2000432_topup2	PTC_TsqN200511	L2000432	2	gds://production/primary_data/200624_A01052_0019_BH5L7WDSXY/20220317692fb729/WGS_TsqNano/PTC_TsqN200511_L2000432_topup2_S13_L002_R1_001.fastq.gz	gds://production/primary_data/200624_A01052_0019_BH5L7WDSXY/20220317692fb729/WGS_TsqNano/PTC_TsqN200511_L2000432_topup2_S13_L002_R2_001.fastq.gz	1026	*
200624_A01052_0019_BH5L7WDSXY	tumor	6163	GTCGGAGC.GGTTATAA.4.200624_A01052_0019_BH5L7WDSXY.PTC_TsqN200511_L2000432_topup2	PTC_TsqN200511	L2000432	4	gds://production/primary_data/200624_A01052_0019_BH5L7WDSXY/20220317692fb729/WGS_TsqNano/PTC_TsqN200511_L2000432_topup2_S13_L004_R1_001.fastq.gz	gds://production/primary_data/200624_A01052_0019_BH5L7WDSXY/20220317692fb729/WGS_TsqNano/PTC_TsqN200511_L2000432_topup2_S13_L004_R2_001.fastq.gz	1026	*
200624_A01052_0019_BH5L7WDSXY	tumor	6124	GTCGGAGC.GGTTATAA.1.200624_A01052_0019_BH5L7WDSXY.PTC_TsqN200511_L2000432_topup2	PTC_TsqN200511	L2000432	1	gds://production/primary_data/200624_A01052_0019_BH5L7WDSXY/20220317692fb729/WGS_TsqNano/PTC_TsqN200511_L2000432_topup2_S13_L001_R1_001.fastq.gz	gds://production/primary_data/200624_A01052_0019_BH5L7WDSXY/20220317692fb729/WGS_TsqNano/PTC_TsqN200511_L2000432_topup2_S13_L001_R2_001.fastq.gz	1026	*
200624_A01052_0019_BH5L7WDSXY	tumor	6150	GTCGGAGC.GGTTATAA.3.200624_A01052_0019_BH5L7WDSXY.PTC_TsqN200511_L2000432_topup2	PTC_TsqN200511	L2000432	3	gds://production/primary_data/200624_A01052_0019_BH5L7WDSXY/20220317692fb729/WGS_TsqNano/PTC_TsqN200511_L2000432_topup2_S13_L003_R1_001.fastq.gz	gds://production/primary_data/200624_A01052_0019_BH5L7WDSXY/20220317692fb729/WGS_TsqNano/PTC_TsqN200511_L2000432_topup2_S13_L003_R2_001.fastq.gz	1026	*
210331_A01052_0041_BHYMHFDSXY	normal	2444	GGCATTCT.CAAGCTAG.1.210331_A01052_0041_BHYMHFDSXY.PTC_TsqN200511_L2000437_rerun	PTC_TsqN200511	L2000437	1	gds://production/primary_data/210331_A01052_0041_BHYMHFDSXY/2022010540ea2224/WGS_TsqNano/PTC_TsqN200511_L2000437_rerun_S1_L001_R1_001.fastq.gz	gds://production/primary_data/210331_A01052_0041_BHYMHFDSXY/2022010540ea2224/WGS_TsqNano/PTC_TsqN200511_L2000437_rerun_S1_L001_R2_001.fastq.gz	777	*
230602_A00130_0258_BH55TMDSX7	tumor	13801	GTCGGAGC.TTATAACC.4.230602_A00130_0258_BH55TMDSX7.PTC_TsqN200511_L2000432_rerun2	PTC_TsqN200511	L2000432	4	gds://production/primary_data/230602_A00130_0258_BH55TMDSX7/202306044d9dcbd1/WGS_TsqNano/PTC_TsqN200511_L2000432_rerun2_S14_L004_R1_001.fastq.gz	gds://production/primary_data/230602_A00130_0258_BH55TMDSX7/202306044d9dcbd1/WGS_TsqNano/PTC_TsqN200511_L2000432_rerun2_S14_L004_R2_001.fastq.gz	1981	*
230602_A00130_0258_BH55TMDSX7	tumor	13800	ACTAAGAT.CCGCGGTT.4.230602_A00130_0258_BH55TMDSX7.PTC_TsqN200511_L2000431_rerun2	PTC_TsqN200511	L2000431	4	gds://production/primary_data/230602_A00130_0258_BH55TMDSX7/202306044d9dcbd1/WGS_TsqNano/PTC_TsqN200511_L2000431_rerun2_S13_L004_R1_001.fastq.gz	gds://production/primary_data/230602_A00130_0258_BH55TMDSX7/202306044d9dcbd1/WGS_TsqNano/PTC_TsqN200511_L2000431_rerun2_S13_L004_R2_001.fastq.gz	1981	*
230602_A00130_0258_BH55TMDSX7	tumor	13797	ACTAAGAT.CCGCGGTT.3.230602_A00130_0258_BH55TMDSX7.PTC_TsqN200511_L2000431_rerun2	PTC_TsqN200511	L2000431	3	gds://production/primary_data/230602_A00130_0258_BH55TMDSX7/202306044d9dcbd1/WGS_TsqNano/PTC_TsqN200511_L2000431_rerun2_S13_L003_R1_001.fastq.gz	gds://production/primary_data/230602_A00130_0258_BH55TMDSX7/202306044d9dcbd1/WGS_TsqNano/PTC_TsqN200511_L2000431_rerun2_S13_L003_R2_001.fastq.gz	1981	*
230608_A01052_0153_AH53JWDSX7	tumor	13822	CTTGGTAT.GGACTTGG.1.230608_A01052_0153_AH53JWDSX7.PTC_TsqN200511_L2000433_rerun2	PTC_TsqN200511	L2000433	1	gds://production/primary_data/230608_A01052_0153_AH53JWDSX7/2023060984ce8fd3/WGS_TsqNano/PTC_TsqN200511_L2000433_rerun2_S5_L001_R1_001.fastq.gz	gds://production/primary_data/230608_A01052_0153_AH53JWDSX7/2023060984ce8fd3/WGS_TsqNano/PTC_TsqN200511_L2000433_rerun2_S5_L001_R2_001.fastq.gz	1991	*
230608_A01052_0153_AH53JWDSX7	tumor	13827	CTTGGTAT.GGACTTGG.2.230608_A01052_0153_AH53JWDSX7.PTC_TsqN200511_L2000433_rerun2	PTC_TsqN200511	L2000433	2	gds://production/primary_data/230608_A01052_0153_AH53JWDSX7/2023060984ce8fd3/WGS_TsqNano/PTC_TsqN200511_L2000433_rerun2_S5_L002_R1_001.fastq.gz	gds://production/primary_data/230608_A01052_0153_AH53JWDSX7/2023060984ce8fd3/WGS_TsqNano/PTC_TsqN200511_L2000433_rerun2_S5_L002_R2_001.fastq.gz	1991	*
230629_A01052_0154_BH7WF5DSX7	tumor	13948	TCCAACGC.AAGTCCAA.2.230629_A01052_0154_BH7WF5DSX7.PTC_TsqN200511_L2000434_rerun2	PTC_TsqN200511	L2000434	2	gds://production/primary_data/230629_A01052_0154_BH7WF5DSX7/2023063018bf4ea3/WGS_TsqNano/PTC_TsqN200511_L2000434_rerun2_S1_L002_R1_001.fastq.gz	gds://production/primary_data/230629_A01052_0154_BH7WF5DSX7/2023063018bf4ea3/WGS_TsqNano/PTC_TsqN200511_L2000434_rerun2_S1_L002_R2_001.fastq.gz	2034	*
230629_A01052_0154_BH7WF5DSX7	tumor	13949	CCGTGAAG.ATCCACTG.2.230629_A01052_0154_BH7WF5DSX7.PTC_TsqN200511_L2000435_rerun2	PTC_TsqN200511	L2000435	2	gds://production/primary_data/230629_A01052_0154_BH7WF5DSX7/2023063018bf4ea3/WGS_TsqNano/PTC_TsqN200511_L2000435_rerun2_S2_L002_R1_001.fastq.gz	gds://production/primary_data/230629_A01052_0154_BH7WF5DSX7/2023063018bf4ea3/WGS_TsqNano/PTC_TsqN200511_L2000435_rerun2_S2_L002_R2_001.fastq.gz	2034	*
230629_A01052_0154_BH7WF5DSX7	normal	13950	GGCATTCT.CAAGCTAG.2.230629_A01052_0154_BH7WF5DSX7.PTC_TsqN200511_L2000437_rerun3	PTC_TsqN200511	L2000437	2	gds://production/primary_data/230629_A01052_0154_BH7WF5DSX7/2023063018bf4ea3/WGS_TsqNano/PTC_TsqN200511_L2000437_rerun3_S5_L002_R1_001.fastq.gz	gds://production/primary_data/230629_A01052_0154_BH7WF5DSX7/2023063018bf4ea3/WGS_TsqNano/PTC_TsqN200511_L2000437_rerun3_S5_L002_R2_001.fastq.gz	2034	*
230629_A01052_0154_BH7WF5DSX7	normal	13952	GGCATTCT.CAAGCTAG.3.230629_A01052_0154_BH7WF5DSX7.PTC_TsqN200511_L2000437_rerun2	PTC_TsqN200511	L2000437	3	gds://production/primary_data/230629_A01052_0154_BH7WF5DSX7/2023063018bf4ea3/WGS_TsqNano/PTC_TsqN200511_L2000437_rerun2_S4_L003_R1_001.fastq.gz	gds://production/primary_data/230629_A01052_0154_BH7WF5DSX7/2023063018bf4ea3/WGS_TsqNano/PTC_TsqN200511_L2000437_rerun2_S4_L003_R2_001.fastq.gz	2034	*
230629_A01052_0154_BH7WF5DSX7	tumor	13951	TTACAGGA.GCTTGTCA.3.230629_A01052_0154_BH7WF5DSX7.PTC_TsqN200511_L2000436_rerun2	PTC_TsqN200511	L2000436	3	gds://production/primary_data/230629_A01052_0154_BH7WF5DSX7/2023063018bf4ea3/WGS_TsqNano/PTC_TsqN200511_L2000436_rerun2_S3_L003_R1_001.fastq.gz	gds://production/primary_data/230629_A01052_0154_BH7WF5DSX7/2023063018bf4ea3/WGS_TsqNano/PTC_TsqN200511_L2000436_rerun2_S3_L003_R2_001.fastq.gz	2034	*
230707_A00130_0264_BH7WJTDSX7	tumor	14027	CCGTGAAG.ATCCACTG.2.230707_A00130_0264_BH7WJTDSX7.PTC_TsqN200511_L2000435_rerun3	PTC_TsqN200511	L2000435	2	gds://production/primary_data/230707_A00130_0264_BH7WJTDSX7/2023070859f58140/WGS_TsqNano/PTC_TsqN200511_L2000435_rerun3_S3_L002_R1_001.fastq.gz	gds://production/primary_data/230707_A00130_0264_BH7WJTDSX7/2023070859f58140/WGS_TsqNano/PTC_TsqN200511_L2000435_rerun3_S3_L002_R2_001.fastq.gz	2066	*

Summarised as

SELECT sequencerun.instrument_run_id, limsrow_metadata.phenotype, COUNT(DISTINCT fastqlistrow.rglb) as libraries_in_run
FROM silver.limsrow_metadata
INNER JOIN portal.libraryrun
  ON limsrow_metadata.library_id = libraryrun.library_id
INNER JOIN portal.sequencerun
  ON sequencerun.instrument_run_id = libraryrun.instrument_run_id
INNER JOIN portal.fastqlistrow 
  ON libraryrun.library_id = fastqlistrow.rglb
  AND sequencerun.id = fastqlistrow.sequence_run_id
  AND libraryrun.lane = fastqlistrow.lane
WHERE 
  limsrow_metadata.subject_id = 'SBJ00480'
AND
  fastqlistrow.rgsm = 'PTC_TsqN200511'
GROUP BY limsrow_metadata.phenotype, sequencerun.instrument_run_id
ORDER BY sequencerun.instrument_run_id

instrument_run_id	phenotype	libraries_in_run
200605_A01052_0016_AH5LYNDSXY	normal	1
200605_A01052_0016_AH5LYNDSXY	tumor	1
200612_A01052_0017_BH5LYWDSXY	tumor	6
200619_A01052_0018_AH5LYGDSXY	tumor	1
200624_A01052_0019_BH5L7WDSXY	tumor	1
210331_A01052_0041_BHYMHFDSXY	normal	1
230602_A00130_0258_BH55TMDSX7	tumor	2
230608_A01052_0153_AH53JWDSX7	tumor	1
230629_A01052_0154_BH7WF5DSX7	tumor	3
230629_A01052_0154_BH7WF5DSX7	normal	1
230707_A00130_0264_BH7WJTDSX7	tumor	1

alexiswl · 2023-07-19T11:19:18Z

Instead the short-term approach is to update these fastq list rows' rgsm values to contain the sample name and the library id

victorskl · 2023-07-21T09:53:49Z

WGS Accreditation: Assess SEQC dilution series

alexiswl self-assigned this Jul 13, 2023

victorskl added the on hold too fast, temporary on hold, to coordinate changes label Jul 21, 2023

victorskl added triage & trello Triage and migrate into respective Trello project change request and removed on hold too fast, temporary on hold, to coordinate changes labels Aug 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reconsider rgsm value in fastq list rows #606

Reconsider rgsm value in fastq list rows #606

alexiswl commented Jul 13, 2023

ohofmann commented Jul 13, 2023

alexiswl commented Jul 13, 2023

ohofmann commented Jul 13, 2023

victorskl commented Jul 13, 2023

alexiswl commented Jul 13, 2023

victorskl commented Jul 13, 2023

reisingerf commented Jul 14, 2023

ohofmann commented Jul 14, 2023

reisingerf commented Jul 16, 2023

ohofmann commented Jul 16, 2023

alexiswl commented Jul 19, 2023 •

edited

Loading

alexiswl commented Jul 19, 2023

victorskl commented Jul 21, 2023

Reconsider rgsm value in fastq list rows #606

Reconsider rgsm value in fastq list rows #606

Comments

alexiswl commented Jul 13, 2023

ohofmann commented Jul 13, 2023

alexiswl commented Jul 13, 2023

ohofmann commented Jul 13, 2023

victorskl commented Jul 13, 2023

alexiswl commented Jul 13, 2023

victorskl commented Jul 13, 2023

reisingerf commented Jul 14, 2023

ohofmann commented Jul 14, 2023

reisingerf commented Jul 16, 2023

ohofmann commented Jul 16, 2023

alexiswl commented Jul 19, 2023 • edited Loading

Summarised as

alexiswl commented Jul 19, 2023

victorskl commented Jul 21, 2023

alexiswl commented Jul 19, 2023 •

edited

Loading