-
Notifications
You must be signed in to change notification settings - Fork 180
Questions about edm-multi #21
Comments
Thanks for your interest in our work. Below are the answers to your
Sorry for the delayed response, but hopefully I was able to answer your On Wed, Feb 17, 2016 at 1:31 PM, Eric Bunch [email protected]
|
Thanks for the response. With regards to #1, that makes more sense with how I was understanding the algorithm. So there is no way to make use of the significance measure described in the paper for multiple breakouts? Would it be possible to determine the bestStat value for each point and each segmentation, and then do some sort of filtering process based on the min_size and some sort of tolerance parameter? Or is that approach too naive? I think the answer to 2 makes sense. I think maybe I'm overthinking it, but I'll give it some time to digest. So the penalization term Another question I'm having is will the left segment ever be the union of two disjoint intervals? The reason I'm asking that is on lines 47 and 48 the values By the way, I really enjoyed reading through the paper (I'm assuming you are the author Nicholas on that), and think the method is really cool. And the package you've written is great. I don't know how the multi version works quite yet, but it works really well! |
I'm so sorry about the delay on getting back to your latest questions. For your questions about beta*G(number[t]) it makes sense that the function Actually, Z[0], Z[1], ... , Z[prev[t]] - 1 are removed from the left tree. Hope that helps. Once again sorry for the delay. On Fri, Feb 19, 2016 at 2:55 PM, Eric Bunch [email protected]
|
I've been going through the code for edm-multi.cpp, and I some general confusion about the algorithm being carried out. I'll keep my questions limited at first so that perhaps the answers to them can alleviate my confusion on the rest.
F[t]
term in the definition of tmp. It seems like this would have an unwanted accumulative effect on the statistic.*G = Quadratic
, breakouts that occur earlier in the time series are favored more than those that occur later? It seems like the more breakouts that have been observed, the more this term will penalize the tmp statistic.The text was updated successfully, but these errors were encountered: