[CH] Adaptive sort memory controll and support memory sort shuffle #5893

liuneng1994 · 2024-05-28T08:12:52Z

What changes were proposed in this pull request?

优化了排序阶段的spill机制，并且根据spark task的offheap内存配置自动调整相关参数，保证在高列数（>300）的情况下稳定运行。目前能够稳定运行 300列，300+GB高压缩比 parquet分区表导入任务,不需要额外配置参数。
实现了新的排序shuffle机制，降低了排序过程中的合并开销，同时在celeborn下不再需要数据落盘。

新增加shuffle算法的切换逻辑，当列数超过一定数量，或者分区数超过300时，切换为MemorySortShuffle

How was this patch tested?

unit tests

(If this patch involves UI changes, please attach a screenshot; otherwise, remove this)

github-actions · 2024-05-28T08:13:20Z

Thanks for opening a pull request!

Could you open an issue for this pull request on Github Issues?

https://github.com/apache/incubator-gluten/issues

Then could you also rename commit message and pull request title in the following format?

[GLUTEN-${ISSUES_ID}][COMPONENT]feat/fix: ${detailed message}

See also:

Other pull requests

github-actions · 2024-05-28T08:13:25Z

Run Gluten Clickhouse CI

github-actions · 2024-05-28T09:19:02Z

Run Gluten Clickhouse CI

github-actions · 2024-05-29T02:47:20Z

Run Gluten Clickhouse CI

zhanglistar · 2024-05-29T04:12:58Z

cpp-ch/local-engine/Common/CHUtil.cpp

+        if (!backend_conf_map.contains(CH_RUNTIME_SETTINGS_PREFIX + "prefer_external_sort_block_bytes"))
+        {
+            auto mem_gb = task_memory / static_cast<double>(1_GiB);
+            // 2.8x+5, Heuristics calculate the block size of external sort, [8,16]


Just curious, how to get this formula?

大概定了一下 1G 8M 2G 10M 3G 14M 4G 16M四个数据点然后做了一个线性回归，数据点是跟据测试效果大致选择的

github-actions · 2024-05-29T07:00:58Z

Run Gluten Clickhouse CI

github-actions · 2024-05-29T10:32:54Z

Run Gluten Clickhouse CI

baibaichen

LGTM

github-actions · 2024-05-30T02:16:58Z

Run Gluten Clickhouse CI

github-actions · 2024-05-30T02:18:06Z

Run Gluten Clickhouse CI

GlutenPerfBot · 2024-05-30T06:16:08Z

===== Performance report for TPCH SF2000 with Velox backend, for reference only ====

query	log/native_5893_time.csv	log/native_master_05_29_2024_588faae35_time.csv	difference	percentage
q1	35.86	33.56	-2.303	93.58%
q2	24.15	23.85	-0.301	98.75%
q3	38.94	37.37	-1.565	95.98%
q4	31.11	32.42	1.311	104.21%
q5	69.37	69.83	0.463	100.67%
q6	7.55	7.54	-0.008	99.90%
q7	85.36	81.75	-3.603	95.78%
q8	85.51	86.57	1.052	101.23%
q9	119.92	118.35	-1.574	98.69%
q10	45.49	44.16	-1.330	97.08%
q11	20.66	22.23	1.567	107.59%
q12	27.24	26.72	-0.517	98.10%
q13	54.96	53.60	-1.356	97.53%
q14	21.72	17.59	-4.128	80.99%
q15	29.32	32.89	3.573	112.19%
q16	14.20	13.41	-0.797	94.39%
q17	102.99	103.62	0.633	100.61%
q18	147.18	144.53	-2.646	98.20%
q19	13.61	13.59	-0.023	99.83%
q20	30.06	29.66	-0.408	98.64%
q21	265.91	260.43	-5.486	97.94%
q22	12.05	13.84	1.788	114.84%
total	1283.16	1267.50	-15.658	98.78%

liuneng1994 force-pushed the adaptive-memory-controll branch from c32ce51 to cf82dbf Compare May 28, 2024 09:18

zhanglistar reviewed May 29, 2024

View reviewed changes

baibaichen force-pushed the adaptive-memory-controll branch from a546bbe to ca29456 Compare May 29, 2024 10:32

baibaichen approved these changes May 30, 2024

View reviewed changes

liuneng1994 closed this May 30, 2024

liuneng1994 reopened this May 30, 2024

liuneng1994 added 9 commits May 30, 2024 10:17

optimize sort and shuffle

5611115

change block size config

e901991

support memory sort local shuffle

6a2b471

fix bug

d82ada5

support memory sort shuffle

008326d

update ch version

e170798

fix check style

dc9a17d

fix bug

7bb907b

fix bug

ec11bdc

liuneng1994 force-pushed the adaptive-memory-controll branch from ca29456 to ec11bdc Compare May 30, 2024 02:17

baibaichen merged commit d35d1dc into apache:main May 30, 2024
40 checks passed

baibaichen mentioned this pull request Nov 16, 2024

Optimize sort spill Kyligence/ClickHouse#490

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CH] Adaptive sort memory controll and support memory sort shuffle #5893

[CH] Adaptive sort memory controll and support memory sort shuffle #5893

liuneng1994 commented May 28, 2024 •

edited

Loading

github-actions bot commented May 28, 2024

github-actions bot commented May 28, 2024

github-actions bot commented May 28, 2024

github-actions bot commented May 29, 2024

zhanglistar May 29, 2024 •

edited

Loading

liuneng1994 May 29, 2024 •

edited

Loading

github-actions bot commented May 29, 2024

github-actions bot commented May 29, 2024

baibaichen left a comment

github-actions bot commented May 30, 2024

github-actions bot commented May 30, 2024

GlutenPerfBot commented May 30, 2024

[CH] Adaptive sort memory controll and support memory sort shuffle #5893

[CH] Adaptive sort memory controll and support memory sort shuffle #5893

Conversation

liuneng1994 commented May 28, 2024 • edited Loading

What changes were proposed in this pull request?

How was this patch tested?

github-actions bot commented May 28, 2024

github-actions bot commented May 28, 2024

github-actions bot commented May 28, 2024

github-actions bot commented May 29, 2024

zhanglistar May 29, 2024 • edited Loading

Choose a reason for hiding this comment

liuneng1994 May 29, 2024 • edited Loading

Choose a reason for hiding this comment

github-actions bot commented May 29, 2024

github-actions bot commented May 29, 2024

baibaichen left a comment

Choose a reason for hiding this comment

github-actions bot commented May 30, 2024

github-actions bot commented May 30, 2024

GlutenPerfBot commented May 30, 2024

liuneng1994 commented May 28, 2024 •

edited

Loading

zhanglistar May 29, 2024 •

edited

Loading

liuneng1994 May 29, 2024 •

edited

Loading