fix: Reuse vector in LocalPartition #12002

Yuhta · 2025-01-02T15:04:11Z

Summary:
More than 10% of the CPU are spent on the destruction of local partition output when the load is high.

Also add some optimizations for serialization.

Differential Revision: D67742489

netlify · 2025-01-02T15:04:32Z

✅ Deploy Preview for meta-velox canceled.

Name	Link
🔨 Latest commit	`0891cb0`
🔍 Latest deploy log	https://app.netlify.com/sites/meta-velox/deploys/6781817ed0a26e00089f2c3b

facebook-github-bot · 2025-01-02T15:04:35Z

This pull request was exported from Phabricator. Differential Revision: D67742489

facebook-github-bot · 2025-01-02T23:27:13Z

This pull request was exported from Phabricator. Differential Revision: D67742489

xiaoxmeng

@Yuhta LGTM and thanks for the optimization % nits. It might be better to remove current_ handling in ByteStream as discussed offline, and it seems to cause tricky bug in the future.

xiaoxmeng · 2025-01-09T23:27:08Z

velox/common/memory/ByteStream.h

  }

  void appendBool(bool value, int32_t count);

+  /// Fast path used by appending one null in vector serialization.
+  template <bool kValue>
+  void appendOneBool() {


Can we have a unit test for this? Thanks!

I will remove the specialization in the new version, just make sure it's inlined should be enough

xiaoxmeng · 2025-01-09T23:29:46Z

velox/common/memory/ByteStream.h

@@ -411,8 +430,10 @@ class ByteOutputStream {
  // The total number of bytes allocated from 'arena_' in 'ranges_'.
  int64_t allocatedBytes_{0};

-  // Pointer to the current element of 'ranges_'.
-  ByteRange* current_{nullptr};
+  // Copy of the current element in 'ranges_'.  This is copied to avoid memory


I am not very sure if we want to this optimization until we see it cause noticeable regression on actual query. Thanks!

We do see a few percentage (~3%) improvements in the E2E query by removing the 2 extra hops.

velox/common/memory/ByteStream.h

velox/exec/LocalPartition.cpp

Summary: X-link: facebookincubator/nimble#122 More than 10% of the CPU are spent on the destruction of local partition output when the load is high. Also add some optimizations for serialization. Optimization on `ByteOutputStream::appendBool` does not show significant gain in the query in example (because they are a lot small batches), but it is net gain and would be significant in large batches, so I leave it in the code. Differential Revision: D67742489

facebook-github-bot · 2025-01-10T19:16:23Z

This pull request was exported from Phabricator. Differential Revision: D67742489

xiaoxmeng

@Yuhta LGTM. thanks!

xiaoxmeng · 2025-01-10T19:54:56Z

velox/exec/tests/LocalPartitionTest.cpp

+  vectorPool.push(makeVector(), 3);
+  vectorPool.push(makeVector(), 1);
+  auto vector = vectorPool.pop();
+  ASSERT_TRUE(vector);


nit: ASSERT_TRUE(vector != nullptr);

xiaoxmeng · 2025-01-10T19:55:00Z

velox/exec/tests/LocalPartitionTest.cpp

+  ASSERT_TRUE(vector);
+  ASSERT_EQ(vector.get(), vectors[0]);
+  vector = vectorPool.pop();
+  ASSERT_TRUE(vector);


Summary: X-link: facebookincubator/nimble#122 More than 10% of the CPU are spent on the destruction of local partition output when the load is high. Also add some optimizations for serialization. Optimization on `ByteOutputStream::appendBool` does not show significant gain in the query in example (because they are a lot small batches), but it is net gain and would be significant in large batches, so I leave it in the code. Differential Revision: D67742489

facebook-github-bot · 2025-01-10T20:22:59Z

This pull request was exported from Phabricator. Differential Revision: D67742489

Summary: X-link: facebookincubator/velox#12002 Pull Request resolved: #122 More than 10% of the CPU are spent on the destruction of local partition output when the load is high. Also add some optimizations for serialization. Optimization on `ByteOutputStream::appendBool` does not show significant gain in the query in example (because they are a lot small batches), but it is net gain and would be significant in large batches, so I leave it in the code. Reviewed By: xiaoxmeng Differential Revision: D67742489 fbshipit-source-id: 8e70dd128f31caa7909ed7c1e2b4ac1e59d7c87d

facebook-github-bot · 2025-01-10T23:28:33Z

This pull request has been merged in 9dcfd39.

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jan 2, 2025

facebook-github-bot added the fb-exported label Jan 2, 2025

Yuhta force-pushed the export-D67742489 branch from 232444d to d4fbc7e Compare January 2, 2025 23:27

xiaoxmeng reviewed Jan 10, 2025

View reviewed changes

Yuhta force-pushed the export-D67742489 branch from d4fbc7e to d377ecb Compare January 10, 2025 19:15

xiaoxmeng approved these changes Jan 10, 2025

View reviewed changes

Yuhta force-pushed the export-D67742489 branch from d377ecb to 0891cb0 Compare January 10, 2025 20:22

facebook-github-bot closed this in 9dcfd39 Jan 10, 2025

facebook-github-bot added the Merged label Jan 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Reuse vector in LocalPartition #12002

fix: Reuse vector in LocalPartition #12002

Yuhta commented Jan 2, 2025

netlify bot commented Jan 2, 2025 •

edited

Loading

facebook-github-bot commented Jan 2, 2025

facebook-github-bot commented Jan 2, 2025

xiaoxmeng left a comment

xiaoxmeng Jan 9, 2025

Yuhta Jan 10, 2025

xiaoxmeng Jan 9, 2025

Yuhta Jan 10, 2025

facebook-github-bot commented Jan 10, 2025

xiaoxmeng left a comment

xiaoxmeng Jan 10, 2025

xiaoxmeng Jan 10, 2025

facebook-github-bot commented Jan 10, 2025

facebook-github-bot commented Jan 10, 2025

fix: Reuse vector in LocalPartition #12002

fix: Reuse vector in LocalPartition #12002

Conversation

Yuhta commented Jan 2, 2025

netlify bot commented Jan 2, 2025 • edited Loading

✅ Deploy Preview for meta-velox canceled.

facebook-github-bot commented Jan 2, 2025

facebook-github-bot commented Jan 2, 2025

xiaoxmeng left a comment

Choose a reason for hiding this comment

xiaoxmeng Jan 9, 2025

Choose a reason for hiding this comment

Yuhta Jan 10, 2025

Choose a reason for hiding this comment

xiaoxmeng Jan 9, 2025

Choose a reason for hiding this comment

Yuhta Jan 10, 2025

Choose a reason for hiding this comment

facebook-github-bot commented Jan 10, 2025

xiaoxmeng left a comment

Choose a reason for hiding this comment

xiaoxmeng Jan 10, 2025

Choose a reason for hiding this comment

xiaoxmeng Jan 10, 2025

Choose a reason for hiding this comment

facebook-github-bot commented Jan 10, 2025

facebook-github-bot commented Jan 10, 2025

netlify bot commented Jan 2, 2025 •

edited

Loading