Skip to content

Latest commit

 

History

History
24 lines (15 loc) · 1.66 KB

README.md

File metadata and controls

24 lines (15 loc) · 1.66 KB

Overview

This script inputs a fasta file and a text file containing motifs and generates a .png figure that showing where each motif prefers to bind on every gene in the file.

The motif file should contain one motif per line. Motifs can be ambiguous if they follow standard IUPAC rules. One exception to this is that Uracil and Thymine are treated interchangeably here.

Details

Using PyCairo to draw, the script uses object oriented programming to draw motif objects onto gene objects.

Argparse is required here to designate your fasta file and motif file, and the output .png graph will inherit the name of your fasta file. For example, Fig_1.fasta will create Fig_1.png.

There is a small element of randomness involved with graphing. Each motif is colored using a series of three random numbers ranging from (0-1). If the user is unhappy with the way the graph looked, simply rerun the scrip. If the user wants to keep the current color scheme, they can set a seed following these instructions. The y values for each motif binding site are also influenced by a random number to allow viewing of overlapping motifs.

This script can process any number of genes of any length, as well as any number of motifs, though after about 10 different motifs things get messy due to excessive sequence overlap and motif color similarity.

Usage

The user does not need to change any code in the file, simply add the argparse options for the fasta and motif file then run the script. The output figure will be added to the folder containing the script.