-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Assessing best avenues for improved date estimation. #166
Comments
This is a great framework, thanks @hyanwong. Here's the status on each:
|
Ah - I forgot we has actually done the perfectly ordered case. Great. We should have somewhere we collect these metrics.
Do we know how much better we do when we don't have ancestral state errors in the simulation? I should have checked this but I'm not sure I did.
Yes, but how much better do we do when we have no errors injected?
Ah. I had read it, but didn't think of it as immediately relevant to using the recombination clock. I should look again. |
I'm nearly complete with an implementation of #7. Turns out there is a straightforward way to do this with the existing API. Would that implementation be of any interest? |
@swamidass, definitely! That sounds fantastic. Feel free to make a pull request with your changes |
Alright. I'll see what I can do. At least consider giving me a middle authorship if it turns out being helpful :) . |
Sounds good. And of course, credit will most certainly go where it's due! |
Let me see about getting a clean implantation with test code to you in the next week or so. If it makes sense to chat then, we can. |
It struck me that we have several different routes for improving dating: some are to do with simply improving the topologies, some to do with the tsdate algorithm, etc. It would be very useful to know where best to focus future efforts. For most potential improvements, we can assess (via simulation) how much of a difference it would make if we completely solved the problem. We should try working out the impact of each of these
n
possible improvements, so we know where to focus our efforts. It may also be that a combination of 2 or more works better than just one. If we have a decent metric for times (e.g. RSMLE or spearman's rho) then we can carry out a number of simulations using different demographic models and random seeds, and inspect ann
xn
matrix of (say) the RSMLE improvement for various combinations.Here's a list of the n possible ways we can think to improve the process (I may have missed some)
The text was updated successfully, but these errors were encountered: