Add 'mnesia_table_loading_retry_timeout' and 'mnesia_table_loading_retry_limit' parameter(s) support #1038

alphamonkey79 · 2025-01-22T21:35:01Z

A restarted node will sync the schema and other information from its peers on boot. Before this process completes, the node won't be fully started and functional.
A stopping node picks an online cluster member (only disc nodes will be considered) to sync with after restart. Upon restart the node will try to contact that peer 10 times by default, with 30 second response timeouts.

In case the peer becomes available in that time interval, the node successfully starts, syncs what it needs from the peer and keeps going.

If the peer does not become available, the restarted node will give up and voluntarily stop. Such condition can be identified by the timeout (timeout_waiting_for_tables) warning messages in the logs that eventually lead to node startup failure:

This window of time can be adjusted using two configuration settings:

# wait for 60 seconds instead of 30
mnesia_table_loading_retry_timeout = 60000

# retry 15 times instead of 10
mnesia_table_loading_retry_limit = 15

By adjusting these settings and tweaking the time window in which known peer has to come back it is possible to account for cluster-wide redeployment scenarios that can be longer than 5 minutes to complete.

The text was updated successfully, but these errors were encountered:

wyardley · 2025-01-22T23:24:37Z

Are these settings that can be set via config_variables?

alphamonkey79 · 2025-01-23T13:11:57Z

@wyardley
I believe so.
Found these references in source:
https://github.com/rabbitmq/rabbitmq-server/blob/2f89bd91227cfbdf4a2024a4414cb3b93607bf69/deps/rabbit/priv/schema/rabbit.schema#L1552-L1563

I am working on a PR for adding and testing that functionality.

wyardley · 2025-01-23T16:23:18Z

Sounds great! You answered my next question 😉

alphamonkey79 changed the title ~~Add 'mnesia_table_loading' parameter(s) support~~ Add 'mnesia_table_loading_retry_timeout' and 'mnesia_table_loading_retry_limit' parameter(s) support Jan 22, 2025

alphamonkey79 added a commit to alphamonkey79/puppet-rabbitmq that referenced this issue Jan 23, 2025

(voxpupuli#1038) Add mnesia table loading variable support.

424cd68

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add 'mnesia_table_loading_retry_timeout' and 'mnesia_table_loading_retry_limit' parameter(s) support #1038

Add 'mnesia_table_loading_retry_timeout' and 'mnesia_table_loading_retry_limit' parameter(s) support #1038

alphamonkey79 commented Jan 22, 2025 •

edited

Loading

wyardley commented Jan 22, 2025

alphamonkey79 commented Jan 23, 2025

wyardley commented Jan 23, 2025

Add 'mnesia_table_loading_retry_timeout' and 'mnesia_table_loading_retry_limit' parameter(s) support #1038

Add 'mnesia_table_loading_retry_timeout' and 'mnesia_table_loading_retry_limit' parameter(s) support #1038

Comments

alphamonkey79 commented Jan 22, 2025 • edited Loading

wyardley commented Jan 22, 2025

alphamonkey79 commented Jan 23, 2025

wyardley commented Jan 23, 2025

alphamonkey79 commented Jan 22, 2025 •

edited

Loading