-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
unify bootstrap procedures #7
Comments
You certainly won't hear a complaint from me regarding this idea! In the same spirit I guess one would need to bring in a little bit more nesting, going from Another thing that might be useful (although it must be optional because it uses tons of memory) would be to have access to the actual bootstrap samples of the correlation function rather than just the bootstrap of the mean (hence my rename above). |
I think we can close this, it has been (mostly) done in urbach/hadron |
Another issue that I have noticed are the bootstrap parameters in functions. A lot of functions take Perhaps it would be better to use a global option for this instead? Alternatively one could remove the default values everywhere, but that would force users to specify them even when already have bootstrapped the values. |
Though in R there is no problem with missing parameters until one actually uses them. So I think it would be doable to remove the default values and then if a code path with bootstrap is chosen, R would complain about the missing values. |
I also think that in many cases the defaults hurt more than they help, as the code tries to be needlessly helpful. As for "already bootstrapped data", all the objects can tell you if they are bootstrapped or not. As a result, the logic chain should be:
The generalisation to multiple data sets which are individually bootstrapped is relatively straightforward by comparisons of boot.R and boot.l between all data sets. Unfortunately, they way it is set up now, this does not prevent the situation where the samples are actually different even though the parameters are the same. For this one would need to be able to relate all bootstrap samples back to the raw data that they were sampled from. If the raw data has unique indexing for each measurement, this unique indexing will carry through to the bootstrap samples, thus making it impossible that incompatible bootstrap samples are ever combined. Unfortunately, all raw data is presently not endowed with unique IDs... I do believe that this should be done rather soon. It will break many existing analysis codes and data file formats which don't specify a measurement index (configuration number), but the gains in safety warrant breaking backwards-compatibility... |
I agree, we should change this. We should not forget to keep track of
the seed as well...
|
Actually I think that there should only be |
I agree, it's certainly a cleaner solution. However, there is no "if not, not" since most functionality will not work without bootstrap samples. |
I think we should unify all bootstrap methods. Currently its a bit of a mess, and its never clear how the bootstrap data type is called. Maybe we should have
$t
and$t0
everywhere like in theboot
package?The text was updated successfully, but these errors were encountered: