`lightning-liquidity`: Introduce `EventQueue` notifier and wake BP for message processing #3509

tnull · 2025-01-08T11:19:06Z

As discussed/mentioned in #3436, this is a prefactor to upcoming persistence/LSPS1 service/LSPS5 work.

Here, we introduce a EventQueueNotifierGuard type reducing the potential of lock contention arising from calling EventQueue::enqueue while holding peer state locks.

We furthermore wake the background processor to trigger message processing after we enqueue new messages.

tnull · 2025-01-16T08:57:59Z

Rebased after #3533 landed, should be ready for review (cc @TheBlueMatt)

TheBlueMatt · 2025-01-20T15:44:39Z

Why end up going with a direct function callback instead of the lightning Future logic (and integrating into the BP to drive the calls for LDK users and letting other users drive via an async task)?

tnull · 2025-01-22T11:16:43Z

Why end up going with a direct function callback instead of the lightning Future logic (and integrating into the BP to drive the calls for LDK users and letting other users drive via an async task)?

Mhh, that might be an alternative approach, yes. I wonder if it would make sense to add this more generally to the CustomMessageHandler interface as every CMH will likely need to call back into PeerHandler to trigger message processing?

tnull · 2025-01-28T15:51:14Z

Now switched to having the BP woken to trigger message processing. Had to rebase on top of #3546 to avoid silent rebase conflicts while adding a lightning-liquidity dependency on lightning-background-processor.

Should be generally good for review, but I'm currently still fighting some issues from RefCell not being Sync + Send in --no-default-features --features futures.

codecov · 2025-01-29T13:28:13Z

Codecov Report

Attention: Patch coverage is 70.39474% with 90 lines in your changes missing coverage. Please review.

Project coverage is 88.48%. Comparing base (2c3f11d) to head (eb3ea83).

Files with missing lines	Patch %	Lines
lightning-liquidity/src/lsps2/service.rs	59.15%	44 Missing and 14 partials ⚠️
lightning-background-processor/src/lib.rs	84.09%	14 Missing ⚠️
lightning-liquidity/src/lsps1/client.rs	0.00%	12 Missing ⚠️
lightning-liquidity/src/events.rs	85.71%	2 Missing and 2 partials ⚠️
lightning-liquidity/src/lsps0/client.rs	0.00%	2 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #3509      +/-   ##
==========================================
- Coverage   88.53%   88.48%   -0.05%     
==========================================
  Files         149      149              
  Lines      114985   115039      +54     
  Branches   114985   115039      +54     
==========================================
- Hits       101803   101795       -8     
- Misses      10693    10750      +57     
- Partials     2489     2494       +5

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

TheBlueMatt

Would be nice to get some additional context in some of the commit messages, especially "Relax Sync + Send bounds on BackgroundProcessor where possible" which really need an explainer of why we think its a good idea to do so.

TheBlueMatt · 2025-02-04T15:00:40Z

lightning-liquidity/src/lsps2/service.rs


 				let (expected_payment_size_msat, mpp_mode) =
 					if let Some(payment_size_msat) = payment_size_msat {
 						(*payment_size_msat, true)
 					} else {
 						debug_assert_eq!(num_htlcs, 1);
 						if num_htlcs != 1 {
+							// Revert the queue before error'ing


Not sure I understand the point of this logic here - we've pushed a new htlc onto payment_queue, but we're erroring here which causes the HTLC to be failed-back. The comment seems to imply we want to revert to the old queue state (which I think would be right?), but that isn't what we're doing.

Hmm, maybe comment could be more specific, but the idea here is to preserve the previous behavior, which keeps all HTLCs in the queue until we get HTLCHandlingFailed, at which point we need to clean up the queue, no?

Right, so two things:
(a) I think there's an existing bug here - LSPS2ServiceHandler::htlc_intercepted calls through to here, and when it gets an Err it calls channelmanager....fail_intercepted_htlc, but the code here keeps the HTLC listed as pending.
(b) doing the take at the top looks pretty brittle, and isn't required. eg the following patch compiles:

$ git diff -U2 diff --git a/lightning-liquidity/src/lsps2/service.rs b/lightning-liquidity/src/lsps2/service.rs index 69acc5ba6..f6ba6aec7 100644 --- a/lightning-liquidity/src/lsps2/service.rs +++ b/lightning-liquidity/src/lsps2/service.rs @@ -142,6 +142,5 @@ impl OutboundJITChannelState { let new_state; let res = match self { - OutboundJITChannelState::PendingInitialPayment { payment_queue: old_payment_queue } => { - let mut payment_queue = core::mem::take(old_payment_queue); + OutboundJITChannelState::PendingInitialPayment { payment_queue } => { let (total_expected_outbound_amount_msat, num_htlcs) = payment_queue.add_htlc(htlc); @@ -152,6 +151,4 @@ impl OutboundJITChannelState { debug_assert_eq!(num_htlcs, 1); if num_htlcs != 1 { - // Revert the queue before error'ing - core::mem::swap(old_payment_queue, &mut payment_queue); return Err(ChannelStateError( "Paying via multiple HTLCs is disallowed in \"no-MPP+var-invoice\" mode.".to_string() @@ -164,6 +161,4 @@ impl OutboundJITChannelState { || expected_payment_size_msat > opening_fee_params.max_payment_size_msat { - // Revert the queue before error'ing - core::mem::swap(old_payment_queue, &mut payment_queue); return Err(ChannelStateError( format!("Payment size violates our limits: expected_payment_size_msat = {}, min_payment_size_msat = {}, max_payment_size_msat = {}", @@ -181,6 +176,4 @@ impl OutboundJITChannelState { opening_fee } else { - // Revert the queue before error'ing - core::mem::swap(old_payment_queue, &mut payment_queue); return Err(ChannelStateError( format!("Could not compute valid opening fee with min_fee_msat = {}, proportional = {}, and expected_payment_size_msat = {}", @@ -199,5 +192,5 @@ impl OutboundJITChannelState { { new_state = OutboundJITChannelState::PendingChannelOpen { - payment_queue, + payment_queue: core::mem::take(payment_queue), opening_fee_msat, }; @@ -210,9 +203,7 @@ impl OutboundJITChannelState { if mpp_mode { new_state = - OutboundJITChannelState::PendingInitialPayment { payment_queue }; + OutboundJITChannelState::PendingInitialPayment { payment_queue: core::mem::take(payment_queue) }; Ok(None) } else { - // Revert the queue before error'ing - core::mem::swap(old_payment_queue, &mut payment_queue); return Err(ChannelStateError( "Intercepted HTLC is too small to pay opening fee".to_string(),

(a) I think there's an existing bug here - LSPS2ServiceHandler::htlc_intercepted calls through to here, and when it gets an Err it calls channelmanager....fail_intercepted_htlc, but the code here keeps the HTLC listed as pending.

Well, we def. need to keep the payment as pending, question is when we'd remove the HTLC. My thinking was that it would always happen as part of handling HTLCHandlingFailed, but IIUC, we need #3551 for this to work reliably in all cases.

(b) doing the take at the top looks pretty brittle, and isn't required. eg the following patch compiles:

I think I considered this but somehow arrived at the opinion this wouldn't work (maybe something to do with persistence, as this commit came from that draft branch). I however can't recall why I thought that and indeed it seems the above approach should work just as well as being cleaner. Now added in a fixup.

Well, we def. need to keep the payment as pending, question is when we'd remove the HTLC. My thinking was that it would always happen as part of handling HTLCHandlingFailed, but IIUC, we need #3551 for this to work reliably in all cases.

Ah, okay, I wasn't considering the fail_intercepted_htlc generated an HTLCHandlingFailed and we handled it that way....that said I'm not really convinced that's the right approach - that leaves a race where our state doesn't match the HTLC state and we might think we've received enough HTLCs if another one comes in before we process the event.

TheBlueMatt · 2025-02-04T15:03:16Z

lightning-liquidity/src/lsps2/service.rs

@@ -166,6 +163,8 @@ impl OutboundJITChannelState {
 				if expected_payment_size_msat < opening_fee_params.min_payment_size_msat
 					|| expected_payment_size_msat > opening_fee_params.max_payment_size_msat
 				{
+					// Revert the queue before error'ing
+					core::mem::swap(old_payment_queue, &mut payment_queue);


Might be easier to just push the HTLC before we move to the success block instead of pushing it then trying to revert before returning an Err...like this we'll end up adding a new error return in the future and forgetting to revert.

Yeah, although note that all following calculations are made based on the state with the HTLC already added. Pushing only in the success case would mean to refactor the logic to run the same calcualtions twice essentially.

lightning-liquidity/src/lsps2/service.rs

lightning-background-processor/src/lib.rs

tnull · 2025-02-05T08:42:34Z

Would be nice to get some additional context in some of the commit messages, especially "Relax Sync + Send bounds on BackgroundProcessor where possible" which really need an explainer of why we think its a good idea to do so.

Alright, now added some rationale there.

The previous transition pattern of `OutboundJITChannelState` was never great: we'd take `&mut self`, only to also return `Self` and required updating the state externally to the state transtion methods. In addition, we previously wrapped `PaymentQueue` in an `Arc<Mutex<..>>` to avoid cloning them during state transtions. Here, we clean up all of this, having the state transtion methods updating the state in-place and merely returning an `action` in the method's `Result`s. We also use `core::mem::take` to move the `payment_queue` to the new states without reallocation.

Previously, when enqueuing new events to the `EventQueue`, we'd directly attempt to wake any notifiers/notify any threads waiting on the `Condvar` about the newly available events. This could of course mean we'd notify them while ourselves still holding some locks, e.g., on the peer state. Here, we instead introduce a `EventQueueNotifierGuard` type that will notify about pending events if necesssary, which mitigates any potential lock contention: we now simply have to ensure that any method calling `enqueue` holds the notifier before retrieving any locks.

.. in order to make handling generics easier, just as we do with `AChannelManager`, `AOnionMessenger`, etc.

Instead of doing the callback dance, we have `lightning-background-processor` take `lightning-liquidity` as a dependency and wake the BP whenever we enqueue new messages to the `MessageQueue`.

It seems that a lot of the generics on `BackgroundProcessor` don't actually require the `Sync + Send` bounds. Here, we therefore drop them where possible as the unncessary bounds could result in the compiler disallowing the use of certain types that aren't `Sync + Send`, even if run threadless environments (i.e., some `no_std` environments).

.. we were already doing the right thing, but our docs were stale.

tnull requested a review from TheBlueMatt January 8, 2025 11:19

tnull force-pushed the 2025-01-liquidity-cleanup branch 3 times, most recently from 90256c6 to f24e843 Compare January 14, 2025 12:43

tnull added the lightning-liquidity label Jan 14, 2025

tnull force-pushed the 2025-01-liquidity-cleanup branch 2 times, most recently from ec27a7a to d24eca4 Compare January 16, 2025 08:57

TheBlueMatt added the weekly goal Someone wants to land this this week label Jan 18, 2025

tnull force-pushed the 2025-01-liquidity-cleanup branch 2 times, most recently from 90a1b34 to cfce460 Compare January 20, 2025 10:14

tnull force-pushed the 2025-01-liquidity-cleanup branch from cfce460 to cbd4754 Compare January 28, 2025 15:43

tnull mentioned this pull request Jan 28, 2025

Update main post-0.1 release #3546

Merged

tnull force-pushed the 2025-01-liquidity-cleanup branch from cbd4754 to 03eb649 Compare January 28, 2025 15:49

tnull changed the title ~~lightning-liquidity: Introduce MessageQueue and EventQueue notifier types~~ lightning-liquidity: Introduce EventQueue notifier and wake BP for message processing Jan 28, 2025

tnull force-pushed the 2025-01-liquidity-cleanup branch 2 times, most recently from ea35484 to 0845ea1 Compare January 29, 2025 13:19

tnull force-pushed the 2025-01-liquidity-cleanup branch 5 times, most recently from ca96d1f to 4af5fc3 Compare January 29, 2025 16:28

TheBlueMatt reviewed Feb 4, 2025

View reviewed changes

tnull force-pushed the 2025-01-liquidity-cleanup branch from 4af5fc3 to 30e5fab Compare February 5, 2025 08:42

tnull force-pushed the 2025-01-liquidity-cleanup branch from 30e5fab to eb3ea83 Compare February 5, 2025 11:18

tnull added 3 commits February 12, 2025 17:06

f Take payment_queue on success

31734a6

f Drop new_state for direct assignments to *self

3051418

tnull force-pushed the 2025-01-liquidity-cleanup branch 4 times, most recently from 45c9dc5 to cb2e114 Compare February 13, 2025 13:26

tnull added 10 commits February 13, 2025 14:56

f Derive Default

e9d09f8

f Fix rebase conflict

496f6fd

Make Notifier pub to allow it being used outside of lightning

2ffda3d

Introduce ALiquidityManager trait

f191895

.. in order to make handling generics easier, just as we do with `AChannelManager`, `AOnionMessenger`, etc.

Have LiqudidityManager wake BackgroundProcessor for msg processing

b55f25b

Instead of doing the callback dance, we have `lightning-background-processor` take `lightning-liquidity` as a dependency and wake the BP whenever we enqueue new messages to the `MessageQueue`.

f Relax bounds and switch to single-threaded RT on no-std

5cd01ce

f Also cfg_attr-out stop_sender

b849c1d

Clarify stale remove_stale.. docs and logs

e134368

.. we were already doing the right thing, but our docs were stale.

tnull force-pushed the 2025-01-liquidity-cleanup branch from cb2e114 to e134368 Compare February 13, 2025 13:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`lightning-liquidity`: Introduce `EventQueue` notifier and wake BP for message processing #3509

`lightning-liquidity`: Introduce `EventQueue` notifier and wake BP for message processing #3509

tnull commented Jan 8, 2025 •

edited

Loading

tnull commented Jan 16, 2025

TheBlueMatt commented Jan 20, 2025

tnull commented Jan 22, 2025

tnull commented Jan 28, 2025

codecov bot commented Jan 29, 2025 •

edited

Loading

TheBlueMatt left a comment

TheBlueMatt Feb 4, 2025

tnull Feb 5, 2025

TheBlueMatt Feb 6, 2025

tnull Feb 12, 2025 •

edited

Loading

TheBlueMatt Feb 13, 2025

TheBlueMatt Feb 4, 2025

tnull Feb 5, 2025

tnull commented Feb 5, 2025

lightning-liquidity: Introduce EventQueue notifier and wake BP for message processing #3509

Are you sure you want to change the base?

lightning-liquidity: Introduce EventQueue notifier and wake BP for message processing #3509

Conversation

tnull commented Jan 8, 2025 • edited Loading

tnull commented Jan 16, 2025

TheBlueMatt commented Jan 20, 2025

tnull commented Jan 22, 2025

tnull commented Jan 28, 2025

codecov bot commented Jan 29, 2025 • edited Loading

Codecov Report

TheBlueMatt left a comment

Choose a reason for hiding this comment

TheBlueMatt Feb 4, 2025

Choose a reason for hiding this comment

tnull Feb 5, 2025

Choose a reason for hiding this comment

TheBlueMatt Feb 6, 2025

Choose a reason for hiding this comment

tnull Feb 12, 2025 • edited Loading

Choose a reason for hiding this comment

TheBlueMatt Feb 13, 2025

Choose a reason for hiding this comment

TheBlueMatt Feb 4, 2025

Choose a reason for hiding this comment

tnull Feb 5, 2025

Choose a reason for hiding this comment

tnull commented Feb 5, 2025

`lightning-liquidity`: Introduce `EventQueue` notifier and wake BP for message processing #3509

`lightning-liquidity`: Introduce `EventQueue` notifier and wake BP for message processing #3509

tnull commented Jan 8, 2025 •

edited

Loading

codecov bot commented Jan 29, 2025 •

edited

Loading

tnull Feb 12, 2025 •

edited

Loading