Name		Name	Last commit message	Last commit date
parent directory ..
data		data
envs		envs
out		out
scripts		scripts
LICENSE.md		LICENSE.md
README.md		README.md
Snakefile		Snakefile
cluster.json		cluster.json
config.yaml		config.yaml
submit_snakemake.sh		submit_snakemake.sh

README.md

Configuration

The file config.yaml contains the configuration:

gaps:  "data/hg38.gaps.bed"
genome: "/data/genomes/hg38/hg38.fa"
maxpeaks: 100
peak_dir: "data/remap.test"
pfm_dir: "data/pfm.test"
reference: "gimme.vertebrate.v5.0"

You will need to have the hg38 genome FASTA file available (shameless plug: use genomepy). The peak_dir is set to data/remap.test and the pfm_dir to data/pfm.test. These are small data sets to check this to see if the workflow works. The variable maxpeaks selects the number of peaks to use (we used 5000 in the manuscript). Use the script scripts/download_remap_peaks.sh to download all the remap peaks to the directory data/remap. By setting pfm_dir to data/pfm all motif files named *.pfm in that directory will be included for comparison. The reference determines which motif database will be used as a reference for figure1a.png.

When the workflow is finished, the directory out/ will contain several final.*.txt files that contain all the metrics.

Run

Install snakemake and gimmemotifs using conda:

conda create -n db_comparison python=3 snakemake gimmemotifs

Activate the environment:

conda activate db_comparison

Dry run:

snakemake -n

Full run:

snakemake --use-conda -j 12 --resources mem_mb=12000

Change the -j 12 to your preferred number of cores and --resources mem_mb=12000 to change the available memory (in MBs). See cluster.json and the script submit_snakemake.sh for an example on how to run the workflow on a cluster (SLURM in this case).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

motif_database_comparison

motif_database_comparison

README.md

Configuration

Run

Files

motif_database_comparison

Directory actions

More options

Directory actions

More options

Latest commit

History

motif_database_comparison

Folders and files

parent directory

README.md

Configuration

Run