HMM CpG island finder lab

Task 1

Your first task is to hand-tune a basic CpG island finder. Our state space will have two states: CpG island and background. Our emission model will be very simple (simpler than other HMMs that model each nucleotide: instead of modeling individual nucletides, we'll model dinucleotides. So our model considers a sequence of dinucleotides, which have either two possibilities, either it is a CpG, or it is not. Thus, the input sequence is a string of 0 and 1, where 1 represents a CpG and 0 represents anything else.

In the file cgi_hmm.py you'll find:

a simple HMM model for a CpG island finder.
a function to encode a regular DNA sequence into the 0/1 dinucleotide representation
a function to visualize a dinucleotide sequence together with a state sequence output from the HMM.

If you run the code, you can see the output of the viterbi parse, showing where the predicted islands are. Unfortunately, this model is not working because the parameters have been initialized randomly. Your task is to think about initiation, emission, and transition probabilities, and through trial-and-error, adjust the parameters to allow the model to make a reasonable segmentation of the chunk.fa sequence.

To complete the assignment, provide 2 things (7 points):

Your final code where you've parameterized the model by hand.
The output plot (produced by the cgi_plot function) showing your viterbi parse on the sequence in chunk.fa.

Task 2

Answer the following question (3 points):

Describe the 3 different broad applications/problems an HMM can be applied to, and name an algorithm that is used to solve each one.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md
cgi_hmm.py		cgi_hmm.py
chunk.fa		chunk.fa

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HMM CpG island finder lab

Task 1

Task 2

About

Releases

Packages

Languages

uvacobi/hmm_lab

Folders and files

Latest commit

History

Repository files navigation

HMM CpG island finder lab

Task 1

Task 2

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages