Improve unified scheduler pipelining by chunking #2882

ryoqun · 2024-09-10T07:32:57Z

Problem

Unlike blockstore-processor, unified scheduler can start processing transaction async as soon as it is fed with transactions. However, entries are currently fed in one large sweep when both catching-after-full-repair and normal replaying stage (the notorious 100ms):

agave/core/src/replay_stage.rs

Line 1149 in 34e9932

let timer = Duration::from_millis(100);

Summary of Changes

Introduce chunked entry load api in Blockstore and use it only for unified scheduler. Hence, subsequent deshredding can overlap in time with already-submitted unified scheduler processing, improving pipeline efficiency.

Lastly, note that this is optimization only applicable to block verification. block production by unified scheduler won't benefit at all. (however, some measurable gain for block verification, hence a this pr)

before

ledger processed in 13 seconds, 977 ms
ledger processed in 13 seconds, 777 ms
ledger processed in 13 seconds, 905 ms

after

maybe 4-5% gain

ledger processed in 13 seconds, 209 ms
ledger processed in 13 seconds, 276 ms
ledger processed in 13 seconds, 207 ms

Extracted from: #2325

ryoqun · 2024-09-10T07:42:23Z

ledger/src/blockstore.rs

@@ -3501,6 +3501,66 @@ impl Blockstore {
        Ok((entries, num_shreds, slot_meta.is_full()))
    }

+    pub fn get_chunked_slot_entries_in_block(


write tests?

ryoqun · 2024-09-10T07:55:35Z

ledger/src/blockstore_processor.rs

best viewed in hide-whitespace mode diff. :)

ledger/src/blockstore.rs

ryoqun · 2024-09-10T08:07:06Z

ledger/src/blockstore.rs

+            let keys = (start..=end).map(|index| (slot, u64::from(index)));
+            let range_shreds = self
+                .data_shred_cf
+                .multi_get_bytes(keys)


i wonder how much overhead it incurs here instead of one-giant invocation of .multi_get_bytes().

ryoqun · 2024-09-11T13:35:40Z

After #2881, the bench crossed the 13-sec wall:

ledger processed in 12 seconds, 920 ms
ledger processed in 12 seconds, 935 ms
ledger processed in 13 seconds, 77 ms

for reference here's recent blockstore-processor numbers:

ledger processed in 27 seconds, 124 ms
ledger processed in 26 seconds, 902 ms
ledger processed in 26 seconds, 997 ms

ryoqun · 2024-11-24T14:49:05Z

note to self: this pr is kind of stale.. currently it's known this chunked behavior adversely affects entry verification rayon thread group.

ryoqun force-pushed the chunked-entries branch from d4f3246 to 3e5555e Compare September 10, 2024 07:39

ryoqun commented Sep 10, 2024

View reviewed changes

ledger/src/blockstore_processor.rs Outdated

Copy link

Collaborator Author

ryoqun Sep 10, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

best viewed in hide-whitespace mode diff. :)

ryoqun commented Sep 10, 2024

View reviewed changes

ledger/src/blockstore.rs Outdated Show resolved Hide resolved

ryoqun commented Sep 10, 2024

View reviewed changes

ledger/src/blockstore.rs Outdated Show resolved Hide resolved

ryoqun commented Sep 10, 2024

View reviewed changes

ryoqun force-pushed the chunked-entries branch 8 times, most recently from c631208 to 6469bc9 Compare September 11, 2024 13:27

Improve unified scheduler pipelining by chunking

36dce60

ryoqun force-pushed the chunked-entries branch from 6469bc9 to 36dce60 Compare November 24, 2024 14:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve unified scheduler pipelining by chunking #2882

Improve unified scheduler pipelining by chunking #2882

ryoqun commented Sep 10, 2024 •

edited

Loading

ryoqun Sep 10, 2024

ryoqun Sep 10, 2024

ryoqun Sep 10, 2024

ryoqun commented Sep 11, 2024 •

edited

Loading

ryoqun commented Nov 24, 2024

Improve unified scheduler pipelining by chunking #2882

Are you sure you want to change the base?

Improve unified scheduler pipelining by chunking #2882

Conversation

ryoqun commented Sep 10, 2024 • edited Loading

Problem

Summary of Changes

before

after

ryoqun Sep 10, 2024

Choose a reason for hiding this comment

ryoqun Sep 10, 2024

Choose a reason for hiding this comment

ryoqun Sep 10, 2024

Choose a reason for hiding this comment

ryoqun commented Sep 11, 2024 • edited Loading

ryoqun commented Nov 24, 2024

ryoqun commented Sep 10, 2024 •

edited

Loading

ryoqun commented Sep 11, 2024 •

edited

Loading