Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Migrate from xz to zstd for dumps compression #3215

Merged
merged 1 commit into from
Mar 7, 2025
Merged

Conversation

amCap1712
Copy link
Member

@amCap1712 amCap1712 commented Mar 6, 2025

Tested on jsonl statistics dumps which are 43G uncompressed, the results are as follows:

xz zstd (-10)
Compression 8.3x 6.7x
Time Taken 87m 3s 33m 12s

xz compresses a little better but zstd is much faster, and I feel the overall tradeoffs are in favour of zstd.

Tested on jsonl statistics dumps which are 43G uncompressed, the results are
as follows:

|             | xz     | zstd (-10) |
|-------------|--------|------------|
| Compression | 8.3x   | 6.7x       |
| Time Taken  | 87m 3s | 33m 12s    |

xz compresses a little better but zstd is much better, and I feel the overall
tradeoffs are in favour of zstd.
@amCap1712 amCap1712 requested a review from mayhem March 6, 2025 13:56
@mayhem
Copy link
Member

mayhem commented Mar 7, 2025

way in favor of zstd!

@amCap1712 amCap1712 merged commit 96cdd01 into stats-dump Mar 7, 2025
2 checks passed
@amCap1712 amCap1712 deleted the zstd-dumps branch March 7, 2025 14:52
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants