Configurable number of steps in agg/join hash table probe #6124

zhouyuan · 2023-08-16T06:37:51Z

zhouyuan
Aug 16, 2023

Hi,
There are always "4" steps when doing hashagg/hashjoin probe:
https://github.com/facebookincubator/velox/blob/main/velox/exec/HashTable.cpp#L440-L466
https://github.com/facebookincubator/velox/blob/main/velox/exec/HashTable.cpp#L578-L598

I think this trick is to enable cache line prefetch and auto vectorization to improve performance. Is the steps "4" picked by some benchmark results on a specific hardware? do you think it make sense to make this configurable? e.g., on some small instance, 1 step maybe better.

Thanks, -yuan

bgeng777 · 2023-10-16T12:45:54Z

bgeng777
Oct 16, 2023

I have the exact same question as Yuan's. It looks like "4" is a magic number which is decided in the init commit of the repo. It would be really helpful if any one can share the thoughts after the decision.

0 replies

mbasmanova · 2023-10-27T20:14:10Z

mbasmanova
Oct 27, 2023
Collaborator

CC: @oerling @Yuhta

Some context: #5637

0 replies

Yuhta · 2023-10-27T20:29:24Z

Yuhta
Oct 27, 2023
Collaborator

It's for prefetch and instruction level parallelism. In general it won't hurt if the pipeline or memory bandwidth is already full, so there is not much need to decrease it. If the pipeline or memory bandwidth is not full, there is some chance to further increase it, but we need more data to see. 4 should be enough for most platforms though. One thing makes it harder to be configurable is that this must be decided at compile time, so any configuration here need to be in the form of a macro or template parameter.

0 replies

zhouyuan · 2023-10-30T00:00:43Z

zhouyuan
Oct 30, 2023
Author

Thank you!

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Configurable number of steps in agg/join hash table probe #6124

{{title}}

Replies: 4 comments

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

Configurable number of steps in agg/join hash table probe #6124

zhouyuan Aug 16, 2023

Replies: 4 comments

bgeng777 Oct 16, 2023

mbasmanova Oct 27, 2023 Collaborator

Yuhta Oct 27, 2023 Collaborator

zhouyuan Oct 30, 2023 Author

zhouyuan
Aug 16, 2023

bgeng777
Oct 16, 2023

mbasmanova
Oct 27, 2023
Collaborator

Yuhta
Oct 27, 2023
Collaborator

zhouyuan
Oct 30, 2023
Author