Np add single end r1 r2 mapping dna mode and merge sort split reads by name #1092

nikellepetrillo · 2023-10-10T13:29:46Z

Description

Give your PR a concise yet descriptive title.
Please explain the changes you made here.
Explain the motivation for making this change. What existing problem does the pull request solve?
Mention any issues fixed, addressed, or otherwise related to this pull request, including issue numbers or hard links for issues in other repos.
You can delete these instructions once you have written your PR description.

Checklist

If you can answer "yes" to the following items, please add a checkmark next to the appropriate checklist item(s) and notify our WARP documentation team by tagging either @ekiernan or @kayleemathews in a comment on this PR.

Did you add inputs, outputs, or tasks to a workflow?
Did you modify, delete or move: file paths, file names, input names, output names, or task names?
If you made a changelog update, did you update the pipeline version number?

…and_merge_sort_split_reads_by_name

github-actions · 2023-10-10T13:29:59Z

Remember to squash merge!

github-actions · 2023-10-10T13:30:12Z

Remember to squash merge!

…dna_mode_and_merge_sort_split_reads_by_name' into np_add_single_end_r1_r2_mapping_dna_mode_and_merge_sort_split_reads_by_name

ekiernan

I had a formatting question, but it LGTM!

ekiernan · 2023-10-11T13:22:38Z

beta-pipelines/skylab/m3c/CondensedSnm3C.wdl

+
+        for file in "${R1_files[@]}"; do
+          sample_id=$(basename "$file" ".hisat3n_dna.split_reads.R1.fastq")
+          hisat-3n /cromwell_root/reference/hg38 -q -U ${sample_id}.hisat3n_dna.split_reads.R1.fastq --directional-mapping-reverse --base-change C,T --no-repeat-index --no-spliced-alignment --no-temp-splicesite -t --new-summary --summary-file ${sample_id}.hisat3n_dna_split_reads_summary.R1.txt --threads 11 | samtools view -b -q 10 -o "${sample_id}.hisat3n_dna.split_reads.R1.bam"


Should we split this line up like we do for the parameters for other tools, so that each parameter is on it's own line?

that would probably make it more readable (and consistent with the rest of warp)

yup totally missed this. will reformat!

…and_merge_sort_split_reads_by_name

github-actions · 2023-10-11T14:03:36Z

Remember to squash merge!

jessicaway · 2023-10-11T14:46:06Z

beta-pipelines/skylab/m3c/CondensedSnm3C.wdl

@@ -452,8 +454,11 @@ task Split_unmapped_reads {

        CODE

+        # wait 15 seconds for the files to be written


was this causing issues? weird

yes, the tar command on line 461 kept complaining about the files being changed as it was tarring things up... Im not sure why the tar command would even start up if the previous python command block wasn't finished writing out the files? but the sleep 15 seemed to help

jessicaway

Looks good! Thanks @nikellepetrillo!

jessicaway · 2023-10-11T14:48:16Z

beta-pipelines/skylab/m3c/CondensedSnm3C.wdl

+
+        for file in "${R1_files[@]}"; do
+          sample_id=$(basename "$file" ".hisat3n_dna.split_reads.R1.fastq")
+          hisat-3n /cromwell_root/reference/hg38 -q -U ${sample_id}.hisat3n_dna.split_reads.R1.fastq --directional-mapping-reverse --base-change C,T --no-repeat-index --no-spliced-alignment --no-temp-splicesite -t --new-summary --summary-file ${sample_id}.hisat3n_dna_split_reads_summary.R1.txt --threads 11 | samtools view -b -q 10 -o "${sample_id}.hisat3n_dna.split_reads.R1.bam"


that would probably make it more readable (and consistent with the rest of warp)

jessicaway · 2023-10-11T14:48:56Z

beta-pipelines/skylab/m3c/CondensedSnm3C.wdl

+
+       for file in "${R2_files[@]}"; do
+         sample_id=$(basename "$file" ".hisat3n_dna.split_reads.R2.fastq")
+         hisat-3n /cromwell_root/reference/hg38 -q -U ${sample_id}.hisat3n_dna.split_reads.R2.fastq --directional-mapping --base-change C,T --no-repeat-index --no-spliced-alignment --no-temp-splicesite -t --new-summary --summary-file ${sample_id}.hisat3n_dna_split_reads_summary.R2.txt --threads 11 | samtools view -b -q 10 -o "${sample_id}.hisat3n_dna.split_reads.R2.bam"


Same here (split the command over multiple lines for readability and consistency)

…dna_mode_and_merge_sort_split_reads_by_name' into np_add_single_end_r1_r2_mapping_dna_mode_and_merge_sort_split_reads_by_name

kevinpalis

Looks great!

github-actions · 2023-10-11T17:42:03Z

Remember to squash merge!

…and_merge_sort_split_reads_by_name

github-actions · 2023-10-11T17:54:11Z

Remember to squash merge!

* Update Loom_schema.md (#1080) * Lk dynamically select barcode orientation (#1086) * Added dynamic barcode selection to the ATAC FastqProcess task * Np pd 2327 separate unmapped reads (#1084) * add separate unmapped reads task * clean up * add splut unmapped reads (#1087) * KM Update metrics descriptions (#1088) * Update metrics descriptions * Update Loom_schema.md * Update Loom_schema.md * Add TagSort and definitions * Km fix broken links in docs (#1090) * Update Terra link address * Fix links to snaptools Docker * Fixed link to SS2 overview * Update link to snM3C test_inputs directory * Fixed link to Multiome test data * Fix link to loom_to_h5ad.py script * Fix link to loom creation script * Fixed links * Lk pd 2363 join barcodes (#1089) Added GEX and ATAC whitelist barcodes to both ATAC and GEX h5ad files so that barcodes that can be joined. * Update UnmappedBamToAlignedBam.wdl (#1070) Changed MarkDuplicates task to use Float size(Array[File], [String]) * Np add single end r1 r2 mapping dna mode and merge sort split reads by name (#1092) * echo whitelistnew task * rearranging * cleaning up * cleaning up * cleaning up * formatting --------- Co-authored-by: aawdeh <[email protected]> * Lk pd 2374 update whitelist (#1093) Updated path of multiome whitelists * Km fix broken links in WARP docs (#1097) * Update Multiome README * Update broken links in docs * Update get-started.md * Update broken bookmarks in docs * Remove the dropna and MATCH columns from JoinBarcodes (#1098) * Remove the dropna and MATCH columns from JoinBarcodes * PD-2379-add_remove_overlap_read_parts_task (#1094) * Added merge_original_and_split_bam_sort_all_reads_name_and_position * Cleaning up a few ls and pwds, adding echos * add task dedup_unique_bam_and_index_unique_bam (#1103) * Km atac h5ad overview (#1102) * Create count-matrix-overview.md * Update count-matrix-overview.md * Add barcode column names Add the column names containing the GEX and ATAC barcodes in the output h5ad files to the Multiome README * Update count-matrix-overview.md * Update README.md * Km doc updates (#1100) * Update links in Optimus Overview doc * Update version numbers in READMEs * Update count-matrix-overview.md * Update count-matrix-overview.md * Add link to SnapATAC2 import_data function * Update count-matrix-overview.md * Update website/docs/Pipelines/ATAC/count-matrix-overview.md Co-authored-by: ekiernan <[email protected]> * Update website/docs/Pipelines/ATAC/count-matrix-overview.md Co-authored-by: ekiernan <[email protected]> * Update website/docs/Pipelines/ATAC/count-matrix-overview.md Co-authored-by: ekiernan <[email protected]> --------- Co-authored-by: ekiernan <[email protected]> * Add unique_reads_allc * Update CondensedSnm3C.wdl * Km add atac rrid and update multiome readme (#1109) * Update Multiome README * Update pipeline version output * Update README.md * Update inputs to CramToUnmappedBam test (#1104) * update paths to input files * Add unique_reads_cgn_extraction task (#1107) --------- Co-authored-by: ekiernan <[email protected]> Co-authored-by: Nikelle Petrillo <[email protected]> Co-authored-by: Kaylee Mathews <[email protected]> Co-authored-by: aawdeh <[email protected]> Co-authored-by: ekiernan <[email protected]> Co-authored-by: Jessica Way <[email protected]>

nikellepetrillo and others added 5 commits October 5, 2023 13:05

echo whitelistnew task

439997c

rearranging

d3401d0

cleaning up

7b75763

cleaning up

3f0ea52

Merge branch 'develop' into np_add_single_end_r1_r2_mapping_dna_mode_…

a0f881e

…and_merge_sort_split_reads_by_name

nikellepetrillo added 2 commits October 10, 2023 11:48

cleaning up

03d750c

Merge remote-tracking branch 'origin/np_add_single_end_r1_r2_mapping_…

c050328

…dna_mode_and_merge_sort_split_reads_by_name' into np_add_single_end_r1_r2_mapping_dna_mode_and_merge_sort_split_reads_by_name

ekiernan approved these changes Oct 11, 2023

View reviewed changes

Merge branch 'develop' into np_add_single_end_r1_r2_mapping_dna_mode_…

d63e3a4

…and_merge_sort_split_reads_by_name

jessicaway reviewed Oct 11, 2023

View reviewed changes

jessicaway approved these changes Oct 11, 2023

View reviewed changes

Merge remote-tracking branch 'origin/np_add_single_end_r1_r2_mapping_…

a31e8be

…dna_mode_and_merge_sort_split_reads_by_name' into np_add_single_end_r1_r2_mapping_dna_mode_and_merge_sort_split_reads_by_name

kevinpalis approved these changes Oct 11, 2023

View reviewed changes

formatting

0dc3862

Merge branch 'develop' into np_add_single_end_r1_r2_mapping_dna_mode_…

09b696c

…and_merge_sort_split_reads_by_name

nikellepetrillo merged commit 53ba51f into develop Oct 11, 2023

nikellepetrillo deleted the np_add_single_end_r1_r2_mapping_dna_mode_and_merge_sort_split_reads_by_name branch October 11, 2023 17:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Np add single end r1 r2 mapping dna mode and merge sort split reads by name #1092

Np add single end r1 r2 mapping dna mode and merge sort split reads by name #1092

nikellepetrillo commented Oct 10, 2023

github-actions bot commented Oct 10, 2023

github-actions bot commented Oct 10, 2023

ekiernan left a comment

ekiernan Oct 11, 2023

jessicaway Oct 11, 2023

nikellepetrillo Oct 11, 2023

github-actions bot commented Oct 11, 2023

jessicaway Oct 11, 2023

nikellepetrillo Oct 11, 2023

jessicaway left a comment

jessicaway Oct 11, 2023

jessicaway Oct 11, 2023

kevinpalis left a comment

github-actions bot commented Oct 11, 2023

github-actions bot commented Oct 11, 2023

		@@ -452,8 +454,11 @@ task Split_unmapped_reads {

		CODE

		# wait 15 seconds for the files to be written

Np add single end r1 r2 mapping dna mode and merge sort split reads by name #1092

Np add single end r1 r2 mapping dna mode and merge sort split reads by name #1092

Conversation

nikellepetrillo commented Oct 10, 2023

Description

Checklist

github-actions bot commented Oct 10, 2023

github-actions bot commented Oct 10, 2023

ekiernan left a comment

Choose a reason for hiding this comment

ekiernan Oct 11, 2023

Choose a reason for hiding this comment

jessicaway Oct 11, 2023

Choose a reason for hiding this comment

nikellepetrillo Oct 11, 2023

Choose a reason for hiding this comment

github-actions bot commented Oct 11, 2023

jessicaway Oct 11, 2023

Choose a reason for hiding this comment

nikellepetrillo Oct 11, 2023

Choose a reason for hiding this comment

jessicaway left a comment

Choose a reason for hiding this comment

jessicaway Oct 11, 2023

Choose a reason for hiding this comment

jessicaway Oct 11, 2023

Choose a reason for hiding this comment

kevinpalis left a comment

Choose a reason for hiding this comment

github-actions bot commented Oct 11, 2023

github-actions bot commented Oct 11, 2023