Name		Name	Last commit message	Last commit date
parent directory ..
a_Transcriptome_assembly		a_Transcriptome_assembly
b_Microexon_annotation		b_Microexon_annotation
c_AS_quantification		c_AS_quantification
d_Analysis		d_Analysis
.DS_Store		.DS_Store
README.md		README.md
StringTie2_MicroExonator_Whippet.png		StringTie2_MicroExonator_Whippet.png

README.md

II. Alternative splicing analyses

To perform a deep characterisation of known and novel alternative splicing events we performed a reference guided transcriptome assemby and splicing quantifiaction inegrating three main computational worflows (StringTie2, MicroExonator and Whippet) and several in-house scripts. Overall desing the data analyses we conducted are represented in the following workflow chart:

Computational Workflow for alternative splicing analyses. The flow diagram shows the steps to conduct alternative splicing analyses using VASA-seq data. To expand the transcriptome annotation, we implemented a Hisat2/StringTie2 based pipeline using Snakemake. This pipeline starts with the deduplication of raw reads using Unique Molecular Identifier (UMI) information. FASTQ files are merged by cell-type and mapped mouse reference genome (mm10) using HISAT2. Resultant alignments for each cell-type are assembled using StringTie2 and subsequently merged into a single GTF file. Assembled transcripts are annotated using gtfcompare and filtered customised parameters designed to avoid false-positive transcripts. The extended transcriptome is stored as a GTF file, which is used to expand further the annotation of splicing events using MicroExonator, which is a separate snakemake pipeline designed to discover novel microexons. The final transcriptome annotation is used to quantify alternative splicing events with Whippetthrough a dedicated MicroExonator's downstream module. To this end, the final extended transcriptome GTF is processed to generate a Contiguous Splice Graph (CSG) index, which enable all downstream splicing profiling steps. Whippet quantifies alternative splicing events by measuring the inclusion of splicing nodes. We quantified the splice node inclusion and isoform abundance across single-cells and pseudo-bulks. Finally, we used cell-type annotations to perform pairwise comparisons of splicing profiles, leading to the detection of differentially included splicing nodes. To this end, we ran whippet-quant across pseudo-bulks randomly sampled from cell types and whippet-delta to assess differential inclusion of splicing nodes. This MicroExonator's module runs Whippet over multiple groups of randomly generated cell-type pseudo-bulks to avoid spurious results due to random arrangements of pseudo-bulks. The results from these computational replicates are post-processed to provide a list of splicing nodes that were robustly detected as differentially included across pairwise comparisons.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

II_Alternative_splicing

II_Alternative_splicing

README.md

II. Alternative splicing analyses

Files

II_Alternative_splicing

Directory actions

More options

Directory actions

More options

Latest commit

History

II_Alternative_splicing

Folders and files

parent directory

README.md

II. Alternative splicing analyses