-
Notifications
You must be signed in to change notification settings - Fork 56
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature Request] Beesd to run a full dedup cycle and then end #279
Comments
Just let it run for 1-2 hours, bees is best effort, it has no concept of a full cycle because while it modifies the file system, it'll add new transactions, thus generate new cycles. Then inspect your journals if 1-2 hours is enough. If bees falls behind, you'd need to increase the time. See |
In this use case, we are talking of a filesystem that is only receiving the backup and is having no more activity. So, bees is ending it's cycle and all the crawlers end. No more transactions, no more activity. I can see that in the logs. But I want to automate the backup script to umount the filesystem. I know it's a niche use case, but I think it's legitimate :) |
In this case, it may actually work. How do you watch the logs? Maybe we could implement something like this in the beesd wrapper? |
I just look at journalctl until I get something similar to this.
After that, there's no more activite, that's why I think that must be possible. I think that adding a flag to the bees binary that makes it quit when runs out of data will do the trick. I can help with writing a new launcher script or updating the existing one, if needed. |
The condition is something like hitting the "ran out of data" condition twice in a row on every crawler, without encountering any new extents in between in any crawler. The trick is that a new extent almost inevitably appears as a result of bees's own activity, so the condition is never met. Maybe something like "fewer than N extents" where N is a command-line option would work. In Somewhere in the issues here, there's a clever script that measures the amount of IO that bees does, and if that drops to zero reads for a few seconds, it terminates the bees process. |
Hummm. I can try the script measuring IO activity. Thanks for the idea. |
I have a script that creates a backup on external btrfs hard drive.
Once the backup is done, I will like to have the option to run a dedup cycle to remove data redundancy and then umount the volume.
Currently I must manually monitor the journal to wait for the dedup run end before unmounting the volume.
Thanks!
The text was updated successfully, but these errors were encountered: