Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Short sequences #16

Open
denklewer opened this issue Dec 5, 2022 · 1 comment
Open

Short sequences #16

denklewer opened this issue Dec 5, 2022 · 1 comment

Comments

@denklewer
Copy link

denklewer commented Dec 5, 2022

Hello,
Is it possible to use your library to train HMM for short sequences?
I have some short sequeces (vectors of length from 8 to 25) And want to build the model which will be able to return probability score for each sequence.
All your examples contain 1 big sequence. So I'm wondering if it is possible to use multiple for training?

@lorek
Copy link
Collaborator

lorek commented Dec 12, 2022

We have not implemented training on multiple sequences. Note that we have two different training methods:
a) F^Q -- which relies on co-occurrence matrix
b) F^ML -- which optimises the log-likelihood of observations.
Implementing training on multiple sequences in a) is quite straightforward (it is no effort to compute the co-occurrence matrix then). However, implementing it in b) would require more work (but is doable). Mainly because, by default we sample shorter subseqences.

Of course you can always concatenate your sequences, be since there are short and you have plenty of them -- however a distortion on "connections" could influence everything significantly.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants