Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The high cost-effectiveness of columnar storage. #715

Merged
merged 21 commits into from
Aug 30, 2023
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion docs/MatrixOne/Overview/feature/cost-effective.md
Original file line number Diff line number Diff line change
Expand Up @@ -48,7 +48,7 @@ MatrixOne 是专门设计用来解决混合负载问题的数据库。MatrixOne
1. 自由压缩算法:每一列单独存放,不同列自由选择其数据结构最适用的压缩算法。这些算法可以利用列数据的特点,例如重复值、有序性和数据类型特定的压缩技术,以获得更好的压缩效果。相比之下,传统的行存储数据库使用的压缩算法通常更加通用,无法充分利用列数据的特点。
2. 列内数据高冗余:列存数据库将相同列中的数据存储在一起,这样相似的值就会聚集在一起,冗余度更高,使得压缩算法能够更有效地识别和压缩相同或相似的数据项,从而提高压缩比。

MatrixOne 总体的数据压缩率最低可达 10%,具体压缩率受实际数据的数据结构、数据冗余程度等多因素影响。
MatrixOne 总体的数据压缩率最低可达 1%,具体压缩率受实际数据的数据结构、数据冗余程度等多因素影响。

总的来说,MatrixOne 的列式存储方式将极大的压缩您的实际存储空间,极大的降低您的存储成本。

Expand Down