Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

reworks max number of outgoing push messages #3016

Merged
merged 1 commit into from
Sep 30, 2024

Conversation

behzadnouri
Copy link

Problem

max_bytes for outgoing push messages is pretty outdated and does not allow gossip to function properly with current testnet cluster size.
In particular it does not allow to clear out queue of pending push messages unless the new_push_messages function is called very frequently which involves repeatedly locking/unlocking CRDS table.
Additionally leaving gossip entries in the queue for the next round will add delay to propagating push messages which can compound as messages go through several hops.

Summary of Changes

The commit reworks outbound limit to allow more messages to be pushed out each time new_push_messages is invoked.

@behzadnouri behzadnouri added the v2.0 Backport to v2.0 branch label Sep 27, 2024
Copy link

mergify bot commented Sep 27, 2024

Backports to the beta branch are to be avoided unless absolutely necessary for fixing bugs, security issues, and perf regressions. Changes intended for backport should be structured such that a minimum effective diff can be committed separately from any refactoring, plumbing, cleanup, etc that are not strictly necessary to achieve the goal. Any of the latter should go only into master and ride the normal stabilization schedule. Exceptions include CI/metrics changes, CLI improvements and documentation updates on a case by case basis.

max_bytes for outgoing push messages is pretty outdated and does not
allow gossip to function properly with current testnet cluster size.

In particular it does not allow to clear out queue of pending push
messages unless the new_push_messages function is called very frequently
which involves repeatedly locking/unlocking CRDS table.

Additionally leaving gossip entries in the queue for the next round will
add delay to propagating push messages which can compound as messages go
through several hops.
Copy link

@gregcusack gregcusack left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this does increase the number of CrdsValues sent per second (which is kind of the point). So, lets keep an eye on how push and prune metrics change as a result of this PR. with that said, lgtm!

this PR also enables unstaked nodes to join and stay in gossip.

@behzadnouri behzadnouri added the automerge automerge Merge this Pull Request automatically once CI passes label Sep 30, 2024
Copy link

mergify bot commented Sep 30, 2024

automerge label removed due to a CI failure

@mergify mergify bot removed the automerge automerge Merge this Pull Request automatically once CI passes label Sep 30, 2024
@behzadnouri behzadnouri merged commit 489f483 into anza-xyz:master Sep 30, 2024
41 checks passed
@behzadnouri behzadnouri deleted the max-num-pushes branch September 30, 2024 17:37
@behzadnouri behzadnouri restored the max-num-pushes branch September 30, 2024 17:37
@behzadnouri behzadnouri deleted the max-num-pushes branch September 30, 2024 17:37
mergify bot pushed a commit that referenced this pull request Sep 30, 2024
max_bytes for outgoing push messages is pretty outdated and does not
allow gossip to function properly with current testnet cluster size.

In particular it does not allow to clear out queue of pending push
messages unless the new_push_messages function is called very frequently
which involves repeatedly locking/unlocking CRDS table.

Additionally leaving gossip entries in the queue for the next round will
add delay to propagating push messages which can compound as messages go
through several hops.

(cherry picked from commit 489f483)
mergify bot added a commit that referenced this pull request Oct 1, 2024
#3038)

reworks max number of outgoing push messages (#3016)

max_bytes for outgoing push messages is pretty outdated and does not
allow gossip to function properly with current testnet cluster size.

In particular it does not allow to clear out queue of pending push
messages unless the new_push_messages function is called very frequently
which involves repeatedly locking/unlocking CRDS table.

Additionally leaving gossip entries in the queue for the next round will
add delay to propagating push messages which can compound as messages go
through several hops.

(cherry picked from commit 489f483)

Co-authored-by: behzad nouri <[email protected]>
ray-kast pushed a commit to abklabs/agave that referenced this pull request Nov 27, 2024
max_bytes for outgoing push messages is pretty outdated and does not
allow gossip to function properly with current testnet cluster size.

In particular it does not allow to clear out queue of pending push
messages unless the new_push_messages function is called very frequently
which involves repeatedly locking/unlocking CRDS table.

Additionally leaving gossip entries in the queue for the next round will
add delay to propagating push messages which can compound as messages go
through several hops.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
v2.0 Backport to v2.0 branch
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants