-
Notifications
You must be signed in to change notification settings - Fork 1
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #42 from ctg-lund/add-docs
Add docs
- Loading branch information
Showing
20 changed files
with
269 additions
and
148 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,91 @@ | ||
# Introduction | ||
This guide is to describe how to process data from the CiteSeq protocol. Using antibodies to tag your cells, which can help the researcher to detect things such as cell surface proteins, or pool multiple cell types together (Antibodybody Derived Tags (ADTs) or HashTag Oligonucleotides (HTOs) respectively). | ||
|
||
At it's complicated it consists of 3 modalities(Gene Expression + ADTs + HTOs). This page aim to describe how we process it here at CTG. | ||
|
||
## Feature reference (ADT) | ||
|
||
The feature reference should be provided by the lab which did the experiments. How a feature reference is constructed is [described here.](https://support.10xgenomics.com/single-cell-gene-expression/software/pipelines/latest/using/feature-bc-analysis) | ||
|
||
The feature reference is the file which describes these following things fo each ADT: | ||
* id : Unique ID used to track feature counts. May only include ASCII characters and must not use whitespace, slash, quote, or comma characters. Each ID must be unique and must not collide with a gene identifier from the transcriptome | ||
* name : Human-readable name for this feature. May only include ASCII characters and must not use whitespace, slash, quote, or comma characters. This name will be displayed in the Loupe Browser Active Feature list | ||
* read : Specifies which RNA sequencing read contains the Feature Barcode sequence. Must be R1 or R2. Note: in most cases R2 is the correct read | ||
* pattern : Specifies how to extract the Feature Barcode sequence from the read | ||
* sequence : Nucleotide barcode sequence associated with this feature. E.g., antibody barcode or sgRNA protospacer sequence. | ||
* feature_type : Type of the feature. See the Library/Feature Types section for details on the allowed values for this field. FASTQ data specified in the Library CSV file with a library_type that matches the feature_type will be scanned for occurrences of this feature. Each feature type in the feature reference must match a library_type entry in the Libraries CSV file. This field is case-sensitive. | ||
|
||
Example: | ||
|
||
|id |name |read|pattern|sequence |feature_type | | ||
|-----|-----|----|-------|---------------|--------------------| | ||
|A0001|Ms.CD4|R2 |5P(BC) |AACAAGACCCTTGAG|Antibody Capture | | ||
|A0002|Ms.CD8a|R2 |5P(BC) |TACCCGTAATAGCGT|Antibody Capture | | ||
|A0003|Ms.CD366|R2 |5P(BC) |ATTGGCACTCAGATG|Antibody Capture | | ||
|A0004|Ms.CD279|R2 |5P(BC) |GAAAGTCAAAGCACT|Antibody Capture | | ||
|A0013|Ms.Ly.6C|R2 |5P(BC) |AAGTCGTGAGGCATG|Antibody Capture | | ||
|A0014|HuMs.CD11b|R2 |5P(BC) |TGAAGGCTCATTTGT|Antibody Capture | | ||
|A0015|Ms.Ly.6G|R2 |5P(BC) |ACATTGACGCAACTA|Antibody Capture | | ||
|A0070|HuMs.CD49f|R2 |5P(BC) |TTCCGAGGATGATCT|Antibody Capture | | ||
|A0073|HuMs.CD44|R2 |5P(BC) |TGGCTTCAGGTCCTA|Antibody Capture | | ||
|
||
## CMO-Set (HTOs) | ||
If you want to use cellranger multi to demultiplex the HTOs, write a cmo file like this. Otherwise they can be specified as ADT and demultiplexed with the Seurats HTO demux. | ||
CMO_reference file: | ||
|
||
|id |name |read|pattern|sequence |feature_type | | ||
|-----|-----|----|-------|---------------|--------------------| | ||
|HTO1 |HTO1 |R2 |5P(BC) |ACCCACCAGTAAGAC|Multiplexing Capture| | ||
|HTO2 |HTO2 |R2 |5P(BC) |GGTCGAGAGCATTCA|Multiplexing Capture| | ||
|HTO3 |HTO3 |R2 |5P(BC) |CTTGCCGCATGTCAT|Multiplexing Capture| | ||
|HTO6 |HTO6 |R2 |5P(BC) |TATGCTGCCACGGTA|Multiplexing Capture| | ||
|HTO11|HTO11|R2 |5P(BC) |GCTTACCGAATTAAC|Multiplexing Capture| | ||
|HTO12|HTO12|R2 |5P(BC) |CTGCAAATATAACGG|Multiplexing Capture| | ||
|
||
## Config | ||
The csv file which cellranger multi takes as input | ||
```yaml | ||
[gene-expression] | ||
reference,/path/to/refdata-gex-mm10-2020-A | ||
cmo-set,/path/to/CMO_reference.csv | ||
|
||
[feature] | ||
reference,/path/to/HB_feature_ref.csv | ||
|
||
[libraries] | ||
fastq_id,fastqs,feature_types | ||
230316_HB_1,/path/to/0_fastq,Gene Expression | ||
230316_HB_1_ADT,/path/to/0_fastq,Antibody Capture | ||
230316_HB_1_HTO,/path/to/0_fastq,Multiplexing Capture | ||
|
||
[samples] | ||
sample_id,cmo_ids | ||
sample1,HTO1 | ||
sample2,HTO2 | ||
sample3,HTO3 | ||
sample4,HTO6 | ||
sample5,HTO11 | ||
sample6,HTO12 | ||
``` | ||
How to create a CMO-set is described more in [detail here.](https://support.10xgenomics.com/single-cell-gene-expression/software/pipelines/latest/using/multi) | ||
## Cellranger multi for HTO + anything else | ||
`singularity run --bind /root/ /path/to/cellranger.simg cellranger multi --id=230316_BM --csv=/path/to/BM_config.csv` | ||
## ADT + GEX only | ||
libraries.csv: | ||
``` | ||
fastqs,sample,library_type | ||
/path/to/fastq/GEX_Sample,GEX_Sample,Gene Expression | ||
/path/to/fastq/ADT_Sample,GEX_Sample,Antibody Capture | ||
``` | ||
command for running: | ||
``` | ||
singularity run --bind /projects/ /path/to/cellranger.simg cellranger count \ | ||
--id=Ram_014 \ | ||
--libraries=/path/to/libraries.csv \ | ||
--transcriptome=/path/to/refdata-gex-GRCh38-2020-A \ | ||
--feature-ref=/path/to/TotalSeq_A_Human_Universal_Cocktail_V1_399907_Antibody_reference_UMI_counting_CellRanger.csv \ | ||
--localmem=140 \ | ||
--jobmode=local \ | ||
--localcores=24 | ||
``` | ||
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,50 @@ | ||
# Basic run instructions: | ||
The fixation uses cellranger multi. Example: | ||
``` | ||
singularity run --bind /projects/ /path/to/cellranger.simg cellranger multi \ | ||
--id=sample_id \ | ||
--csv=/path/to/config.csv | ||
``` | ||
|
||
# Config files: | ||
For single plexed fixation samples: | ||
config.csv | ||
``` | ||
[gene-expression] | ||
reference,/path/to/refdata-gex-GRCh38-2020-A | ||
probe-set,/path/to/Chromium_Human_Transcriptome_Probe_Set_v1.0_GRCh38-2020-A.csv | ||
[libraries] | ||
fastq_id,fastqs,feature_types | ||
sample_01_fix,/path/to/fastq,Gene Expression | ||
``` | ||
For multiplexed samples: | ||
config.csv | ||
``` | ||
[gene-expression] | ||
reference,/path/to/refdata-gex-GRCh38-2020-A | ||
probe-set,/path/to/Chromium_Human_Transcriptome_Probe_Set_v1.0_GRCh38-2020-A.csv | ||
[libraries] | ||
fastq_id,fastqs,feature_types | ||
221117_Milladur,/path/to/fastq/sample_id/,Gene Expression | ||
[samples] | ||
sample_id,probe_barcode_ids | ||
sample1,BC001 | ||
sample2,BC002 | ||
sample3,BC003 | ||
sample4,BC004 | ||
sample5,BC005 | ||
sample6,BC006 | ||
sample7,BC007 | ||
sample8,BC008 | ||
sample9,BC009 | ||
sample10,BC010 | ||
sample11,BC011 | ||
sample12,BC012 | ||
sample13,BC013 | ||
sample14,BC014 | ||
sample15,BC015 | ||
``` |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,32 @@ | ||
# VDJ only | ||
``` | ||
cellranger vdj --id=sample345 \ | ||
--reference=/opt/refdata-cellranger-vdj-GRCh38-alts-ensembl-7.1.0 \ | ||
--fastqs=/home/jdoe/runs/HAWT7ADXX/outs/fastq_path \ | ||
``` | ||
|
||
|
||
# VDJ + GEX | ||
You will need to construct a config.csv file like | ||
``` | ||
[gene-expression] | ||
reference,/path/to/transcriptome | ||
[vdj] | ||
reference,/path/to/vdj_reference | ||
[libraries] | ||
fastq_id,fastqs,feature_types | ||
GEX_fastqs_id,/path/to/GEX_fastqs,Gene Expression | ||
VDJ_B_fastqs_id,/path/to/vdj_B_fastqs,VDJ-B | ||
VDJ_T_fastqs_id,/path/to/vdj_T_fastqs,VDJ-T | ||
``` | ||
Where VDJ-T/B is set manually as opposed to cellranger vdj where it autodetects Antigens. | ||
Then run with: | ||
``` | ||
cellranger multi --id=<sample-id> --csv=/path/to/config.csv | ||
``` | ||
# Reference names | ||
I haven't changed the names provided by 10X standard refernces. These are the names. | ||
* Human: refdata-cellranger-vdj-GRCh38-alts-ensembl-7.1.0 | ||
* Mouse: refdata-cellranger-vdj-GRCm38-alts-ensembl-7.0.0 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,48 @@ | ||
# Introduction | ||
This is a document describing how to run spaceranger. The document will be continuously updated while the pipeline matures. | ||
|
||
# Spaceranger Count | ||
**Required input files** | ||
* Fastq-files | ||
* Cytassist image | ||
* Microscope image | ||
* Slide area | ||
* Slide .gpr file (This will need to be downloaded) | ||
|
||
|
||
**Count sbatch script:** | ||
```bash | ||
#!/bin/bash -ue | ||
#SBATCH -c 24 | ||
#SBATCH -t 12:00:00 | ||
#SBATCH --mem 124G | ||
#SBATCH -J sample | ||
#SBATCH -o /path/to/out/sample.out | ||
#SBATCH -e /path/to/err/sample.err | ||
|
||
singularity run --bind /projects/ \ | ||
/path/to/spaceranger.simg \ | ||
spaceranger count \ | ||
--id="Visium_FFPE_Human" \ | ||
--transcriptome=/path/to/refdata-gex-GRCh38-2020-A \ | ||
--probe-set=/path/to/Visium_Human_Transcriptome_Probe_Set_v2.0_GRCh38-2020-A.csv \ | ||
--fastqs=/path/to/sample/fastq \ | ||
--sample=sample \ | ||
--image=/path/to/CytAssist_HighRes_Sample-A-index-D1.tif \ | ||
--slide=V42L13-392 \ | ||
--slidefile=/path/to/V42L13-392.gpr \ | ||
--area=A1 \ | ||
--cytaimage=/path/to/CytAssist_LowRes_Sample-A-Index-D1.tif \ | ||
``` | ||
|
||
# Glossary | ||
**Visium CytAssist Instrument**: An instrument that mediates the tissue permeabilization to release the ligation products from tissues on standard glass slide for capture by spatially barcoded oligonucleotides within each Capture Area on the Visium slide. It also captures the image of the tissue section on the Visium slide. | ||
|
||
**CytAssist Captured Image (or CytAssist Image)**: A low resolution brightfield image in TIFF format that is captured by the CytAssist of the eosin stained tissue section on the CytAssist slide inside the instrument. This image contains the fiducial frame. | ||
|
||
**Microscope Image**: A high resolution brightfield or fluorescence image of the tissue section on the standard glass slide captured by a microscope. This image does not contain the fiducial frame. | ||
|
||
**CytAssist Spatial Gene Expression Slide, 6.5 mm**: Visium Spatial Gene Expression slide for use with CytAssist instrument with two capture areas each with dimensions of 6.5 mm by 6.5 mm. The spots within the capture area on these slides contain specialized oligos for capturing poly-adenylated mRNA tags. These slides have serial numbers starting with "V4". | ||
|
||
**CytAssist Spatial Gene Expression Slide, 11 mm**: Visium Spatial Gene Expression slide for use with CytAssist instrument with two capture areas each with dimensions 11 mm by 11 mm. The number of spots within the capture area on these slides are ~3x higher compared to the 6.5 mm capture areas and contain specialized oligos for capturing poly-adenylated mRNA tags. These slides have serial numbers starting with "V5". | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.