[CH] Performance regression when reading files from Hive(HDFS) #7177

zhanglistar · 2024-09-10T03:20:08Z

CH (ClickHouse)

SQL: select count(distinct country) from ttt where day = '2024-08-26' and hour = '00'

Related PR: #6841

vanila:

About 4.6x slower than vanila 332, and 18x slower than last commit.

Spark-3.3.x

No response

No response

No response

The text was updated successfully, but these errors were encountered:

loneylee · 2024-09-10T03:49:41Z

Thanks for finding the problem, I will fix it as soon as possible.

zhanglistar added bug Something isn't working triage labels Sep 10, 2024

loneylee mentioned this issue Sep 10, 2024

[GLUTEN-7177][CH] Fix read hdfs performance issue #7187

Merged

baibaichen closed this as completed in #7187 Sep 11, 2024

Provide feedback