multi: integrate new RBF co-op flow into the server+peer by Roasbeef · Pull Request #9575 · lightningnetwork/lnd

Roasbeef · 2025-03-04T02:59:17Z

Final PR split off from #8453. This contains commits that pertain primarily to exposing the new RBF coop close to the end user via updates to the rpcserver, server, and peer.

coderabbitai · 2025-03-04T02:59:24Z

Important

Review skipped

Auto reviews are limited to specific labels.

🏷️ Labels to auto review (1)

llm-review

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Generate unit testing code for this file.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
- @coderabbitai generate unit testing code for this file.
- @coderabbitai modularize this function.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read src/utils.ts and generate unit testing code.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
- @coderabbitai help me debug CodeRabbit configuration file.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai full review to do a full review from scratch and review all the files again.
@coderabbitai summary to regenerate the summary of the PR.
@coderabbitai generate docstrings to generate docstrings for this PR.
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai configuration to show the current CodeRabbit configuration for the repository.
@coderabbitai help to get help.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (`.coderabbit.yaml`)

You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
Please see the configuration documentation for more information.
If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

In the next commit, we'll start checking feature bits to decide how to init the chan closer. In the future, we can move the current chan closer handling logic into an `MsgEndpoint`, which'll allow us to get rid of the explicit chan closer map and direct handling.

In this commit, we use the interfaces we created in the prior commit to make a new method capable of spinning up the new rbf coop closer.

In this commit, we add a new composite chanCloserFsm type. This'll allow us to store a single value that might be a negotiator or and rbf-er. In a follow up commit, we'll use this to conditionally create the new rbf closer.

Roasbeef · 2025-03-06T00:31:13Z

Updated the base branch to be rbf-staging.

lnwallet/channel.go

peer/brontide.go

itest/lnd_coop_close_rbf_test.go

yyforyongyu

I think it's close - need to fix the flakes in the CI which are found in both unit test and itests. In addition the msgmux logger is not registered,

diff --git a/log.go b/log.go
index e4c1d2ebe..5d1f2567f 100644
--- a/log.go
+++ b/log.go
@@ -44,6 +44,7 @@ import (
 	"github.com/lightningnetwork/lnd/lnwallet/chanfunding"
 	"github.com/lightningnetwork/lnd/lnwallet/rpcwallet"
 	"github.com/lightningnetwork/lnd/monitoring"
+	"github.com/lightningnetwork/lnd/msgmux"
 	"github.com/lightningnetwork/lnd/netann"
 	"github.com/lightningnetwork/lnd/peer"
 	"github.com/lightningnetwork/lnd/peernotifier"
@@ -202,6 +203,7 @@ func SetupLoggers(root *build.SubLoggerManager, interceptor signal.Interceptor)
 	)
 	AddV1SubLogger(root, graphdb.Subsystem, interceptor, graphdb.UseLogger)
 	AddSubLogger(root, chainio.Subsystem, interceptor, chainio.UseLogger)
+	AddSubLogger(root, msgmux.Subsystem, interceptor, msgmux.UseLogger)
 }
 
 // AddSubLogger is a helper method to conveniently create and register the

I ran the test rbf_coop_close locally - the normal flow works as expected, the restart behavior seems weird, maybe it's related to the flake,

It's weird to see this error log once the connection is reestablished,

2025-03-07 17:18:23.706 [DBG] DISC syncer.go:450: Starting GossipSyncer([Bob])
...
2025-03-07 17:18:23.706 [TRC] MSGX msg_router.go:246: MsgRouter: unable to route msg msgmux.PeerMsg
2025-03-07 17:18:23.706 [ERR] PEER brontide.go:2136: Peer([Bob]): resend failed: unable to fetch channel sync messages for peer [Bob]@127.0.0.1:49436: unable to find closed channel summary

Then Alice sent a new shutdown request, using 1 sat/vb although it used 5 sat/vb before the reconnection,

2025-03-07 17:18:23.765 [INF] CHCL rbf_coop_transitions.go:488: ChannelPoint([ChanPoint: Alice=>Bob]): channel flushed! proceeding with co-op close
2025-03-07 17:18:23.765 [INF] CHCL rbf_coop_transitions.go:516: ChannelPoint([ChanPoint: Alice=>Bob]): using ideal_fee=1 sat/vb, absolute_fee=0.00000193 BTC
2025-03-07 17:18:23.765 [DBG] PFSM state_machine.go:591: FSM(rbf_chan_closer([ChanPoint: Alice=>Bob])): Adding new internal event to queue event=(*chancloser.SendOfferEvent)(0x1400286d010)({
 TargetFeeRate: (chainfee.SatPerVByte) 1 sat/vb
})

peer/brontide.go

yyforyongyu · 2025-03-07T08:25:38Z

peer/brontide.go

+// party (for w/e reason), but crashed before the close was complete.
+//
+//nolint:ll
+type shutdownInit = fn.Option[fn.Either[*htlcswitch.ChanClose, channeldb.ShutdownInfo]]


I think it'd be better if we could avoid using this kinda data structure in the future. fn.Either is typically used to convey a result or an error, though we can use it to carry either, but there's no guarantee that it won't be in an invalid state where both Right and Left are set. In contrast, I think this is guaranteed in Haskell via compile-time checks.

Aside from that, from robustness's perspective, every state should be handled . This construct alone creates four states - 1) it's none; 2) it's some and Either.Right 3) it's some and Either.Left 4) it's some and Either.Left and Either.Right (or Either is empty?). Across the codebase we just use WhenSome, MapOption or WhenLeft to avoid answering the question what if it should be none but it's some, what is option is none and the mapping does nothing, etc.

Moving forward I think it's best to come back to the zen of Go - we just stick to interfaces to decomposite complexity, and I will create a proposal for that.

fn.Either is typically used to convey a result or an error, though we can use it to carry either, but there's no guarantee that it won't be in an invalid state where both Right and Left are set

I think you're thinking of a result type? Such a type can be made using an either. The alternative to the usage here is creating a slim interface, that the relevant structs implement, then use that as the map value. The advantage of this approach vs that is that we retain the core types at the definition site (you can see exactly which two values are used here).

though we can use it to carry either, but there's no guarantee that it won't be in an invalid state where both Right and Left are set

So they can't actually both bet set, it's either or. The left vs right are private variables, and the exposed constructors only let you set one or the other. You don't always need to handle both instances, just like sometimes you do a type assertion and do nothing if it isn't the type you need to act on.

Aside from that, from robustness's perspective, every state should be handled . This construct alone creates four states - 1) it's none; 2) it's some and Either.Right 3) it's some and Either.Left 4) it's some and Either.Left and Either.Right (or Either is empty?)

So those states exist if this was an interface in a map (other than that 4th state which isn't possible). It can be empty, or one of the many structs that can implement the interface (which can't be known by looking at the map definition),. There's no exhaustive checking by the compiler, instead you need to find all the structs that implicitly implement the type. With the either, we can be explicit about what actually we're storing here vs the normal implicit interface implementations.

Across the codebase we just use WhenSome, MapOption or WhenLeft to avoid answering the question what if it should be none but it's some, what is option is none and the mapping does nothing, etc.

We're not avoiding answering the question with this functions. At times you want to do something if it's set, and nothing otherwise. Those functions let you express those patterns.

I think the advantage of using this type where applicable, is that you can be explicit about what is being stored/operated on. The alternative is to store a interface whose sole surface is bundling several types, but which types are stored there is implicit (once you arrive at the definition, you need to search further to see which structs can be stored there, so you can make sure that all the relevant cases are handled).

peer/brontide.go

yyforyongyu · 2025-03-07T09:03:17Z

itest/lnd_coop_close_rbf_test.go

+	alicePendingUpdate = aliceCloseUpdate.GetClosePending()
+	require.NotNil(ht, aliceCloseUpdate)
+	require.Equal(
+		ht, alicePendingUpdate.FeePerVbyte,


This got me to think maybe there should be a field to tell the user the RBF is failed? sth like Succeeded?

Issue with that is that we can't always detect if it failed or not. For Neutrino, we have no idea. One option we have is to always try to do a fee bump if the user wants one. The side effect of that is that we may end up paying more than the user expect, eg: they try to do 5 sat/vb, but 8 sat/vb is actually what'll trigger a replacement.

I'm open to modifying the broadcast flow in another PR. It'll need some changes in the protofsm executor, and reuslt in some additional states in the state machine (BroadcastPending -> BroadcastFailed -> BroadcastSuccess, etc). I want to restrict this PR to just fundamental blockers to merge, as we still need to do further interop, and get the rc in user's hands so we can get feedback on the RPC/CLI changes.

peer/brontide.go

Roasbeef · 2025-03-08T00:28:50Z

Then Alice sent a new shutdown request, using 1 sat/vb although it used 5 sat/vb before the reconnection,

So Alice is the one that initiated the initial shutdown via RPC request, we only store the shutdown message on disk, not the fee rate we used before the restart (see channeldb.ShutdownInfo`). We have behavior to restart the shutdown flow anew for the RBF update, at the same time, we'll then go to do a new manual update.

The "unable to route message" log is normal, I'll update the log there to log the type of the inner msg (the wire msg). This'll happen for any message that we don't have a registered endpoint to.

The close summary log is there as the channel hasn't fully closed yet.

Roasbeef · 2025-03-08T00:39:58Z

Here's the current test flake:

--- FAIL: TestLightningNetworkDaemon (475.94s)
    harness_setup.go:25: Setting up HarnessTest...
    harness_setup.go:38: Prepare the miner and mine blocks to activate segwit...
    harness_setup.go:46: Connecting the miner at 127.0.0.1:10045 with the chain backend...
    --- FAIL: TestLightningNetworkDaemon/tranche12/216-of-272/bitcoind/rbf_coop_close (57.51s)
        harness_node.go:403: Starting node (name=Alice) with PID=18535
        harness_node.go:403: Starting node (name=Bob) with PID=18588
        harness_assertion.go:471: 
            	Error Trace:	/home/runner/work/lnd/lnd/lntest/harness_assertion.go:471
            	            				/home/runner/work/lnd/lnd/lntest/harness.go:1300
            	            				/home/runner/work/lnd/lnd/itest/lnd_coop_close_rbf_test.go:98
            	            				/home/runner/work/lnd/lnd/lntest/harness.go:297
            	            				/home/runner/work/lnd/lnd/itest/lnd_test.go:130
            	Error:      	timeout
            	Test:       	TestLightningNetworkDaemon/tranche12/216-of-272/bitcoind/rbf_coop_close
            	Messages:   	timeout waiting for close channel update sent
        harness.go:375: finished test: rbf_coop_close, start height=630, end height=637, mined blocks=7

It actually doesn't have to do with restart logic. This is the location that's failing:

lnd/itest/lnd_coop_close_rbf_test.go

Lines 95 to 103 in fefd583

    
           // We'll now attempt a fee update that we can't actually pay for. This 
        
           // will actually show up as an error to the remote party. 
        
           aliceRejectedFeeRate = 100_000 
        
           _, _ = ht.CloseChannelAssertPending( 
        
           	alice, chanPoint, false, 
        
           	lntest.WithCoopCloseFeeRate(aliceRejectedFeeRate), 
        
           	lntest.WithLocalTxNotify(), 
        
           	lntest.WithExpectedErrString("cannot pay for fee"), 
        
           )

We're doing a fee update that we know we can't pay for. We don't even send any p2p messages here, but an error is sent back to the user over the RPC stream. AFAICT, I see the logs that shows a failure:

2025-02-27 00:31:50.922 [ERR] RPCS rpcserver.go:2959: [closechannel] unable to close ChannelPoint(c8f711f8fcbb24a425eb680bc8d13185b1066f6f438bc87631b579a3686251c3:0): cannot pay for fee of 0.19300000 BTC, only have 0.00499817 BTC local balance

But it doesn't actually reach the RPC client, resulting in a timeout 🤔....looking into it now.

When the test runs successfully, you'll see this:

    harness.go:1302: Test: rbf_coop_close, close channel got error: received err from close channel stream: rpc error: code = Unknown desc = cannot pay for fee of 0.19300000 BTC, only have 0.00499817 BTC local balance

In this commit, we fully integrate the new RBF close state machine into the peer. For the restart case after shutdown, we can short circuit the existing logic as the new FSM will handle retransmitting the shutdown message itself, and doesn't need to delegate that duty to the link. Unlike the existing state machine, we're able to restart the flow to sign a coop close with a new higher fee rate. In this case, we can now send multiple updates to the RPC caller, one for each newly singed coop close transaction. To implement the async flush case, we'll launch a new goroutine to wait until the state machine reaches the `ChannelFlushing` state, then we'll register the hook. We don't do this at start up, as otherwise the channel may _already_ be flushed, triggering an invalid state transition.

For now, we disallow the option to be used with the taproot chans option, as the new flow hasn't yet been updated for nonce usage.

This fixes some existing race conditions, as the `finalizeChanClosure` function was being called from outside the main event loop.

If we hit an error, we want to wipe the state machine state, which also includes removing the old endpoint.

This'll allow us to notify the caller each time a new coop close transaction with a higher fee rate is signed.

Resp is always nil, so we actually need to log event.Update here.

In this commit, we extend `CloseChannelAssertPending` with new args that returns the raw close status update (as we have more things we'd like to assert), and also allows us to pass in a custom fee rate.

We'll properly handle a protocol error due to user input by halting, and sending the error back to the user. When a user goes to issue a new update, based on which state we're in, we'll either kick off the shutdown, or attempt a new offer. This matches the new spec update where we'll only send `Shutdown` once per connection.

In this commit, we alter the existing co-op close flow to enable RBF bumps after re connection. With the new RBF close flow, it's possible that after a success round _and_ a re connection, either side wants to do another fee bump. Typically we route these requests through the switch, but in this case, the link no longer exists in the switch, so any requests to fee bump again would find that the link doesn't exist. In this commit, we implement a work around wherein if we have an RBF chan closer active, and the link isn't in the switch, then we just route the request directly to the chan closer via the peer. Once we have the chan closer, we can use the exact same flow as prior.

The itest has both sides try to close multiple times, each time with increasing fee rates. We also test the reconnection case, bad RBF updates, and instances where the local party can't actually pay for fees.

With this commit, we make sure we set the right height hint, even if the channel is a zero conf channel.

yyforyongyu · 2025-03-13T08:16:05Z

I analyzed the logs from this failed build, and I think the initial flow doesn't seem right. The following logs are taken after Alice started the first close channel request, for unknown reason Bob will always attempt a coop close even tho he didn't request it,

Bob received Alice's shutdown msg, and replied with a shutdown msg,

2025-03-12 23:12:52.584 [DBG] PEER brontide.go:2507: Peer([Alice]): Received Shutdown(chan_id=6c3ec6c0879c4b44a320356b0d3882518c5c6c3c8b619526e1d369039a77ff7d, script=51208aa3632da4368ea5d166dfcf82e1b3b9043e4b4479a248d8b94d0ff1d0086998) from [Alice]@127.0.0.1:52192
2025-03-12 23:12:52.608 [INF] PEER brontide.go:2847: Peer([Alice]): Delivery addr for channel close: bcrt1pmvpcucwpwc6678lhjujw4vqsvl022u3dwn3swd7l8atxx2tzmngqtddcml
2025-03-12 23:12:52.638 [DBG] PEER brontide.go:2507: Peer([Alice]): Sending Shutdown(chan_id=6c3ec6c0879c4b44a320356b0d3882518c5c6c3c8b619526e1d369039a77ff7d, script=5120db038e61c17635af1ff79724eab01067dea5722d74e30737df3f56632962dcd0) to [Alice]@127.0.0.1:52192

Bob then started his own shutdown process once the channel was flushed, although the RPC was not called,

2025-03-12 23:12:52.639 [INF] CHCL rbf_coop_transitions.go:488: ChannelPoint([ChanPoint: Alice=>Bob]): channel flushed! proceeding with co-op close
2025-03-12 23:12:52.639 [INF] CHCL rbf_coop_transitions.go:516: ChannelPoint([ChanPoint: Alice=>Bob]): using ideal_fee=1 sat/vb, absolute_fee=0.00000193 BTC
2025-03-12 23:12:52.639 [DBG] PFSM state_machine.go:591: FSM(rbf_chan_closer([ChanPoint: Alice=>Bob])): Adding new internal event to queue event=(*chancloser.SendOfferEvent)(0xc002126bd0)({
 TargetFeeRate: (chainfee.SatPerVByte) 1 sat/vb
})
2025-03-12 23:12:52.649 [INF] CHCL rbf_coop_transitions.go:797: closing w/ local_addr=5120db038e61c17635af1ff79724eab01067dea5722d74e30737df3f56632962dcd0, remote_addr=51208aa3632da4368ea5d166dfcf82e1b3b9043e4b4479a248d8b94d0ff1d0086998, fee=0.00000193 BTC

Bob then sent a closing complete with his own fee rate,

2025-03-12 23:12:52.649 [DBG] PEER brontide.go:2507: Peer([Alice]): Sending ClosingComplete(chan_id=6c3ec6c0879c4b44a320356b0d3882518c5c6c3c8b619526e1d369039a77ff7d, fee_sat=0.00000193 BTC, locktime=0) to [Alice]@127.0.0.1:52192

He also received Alice's closing complete,

2025-03-12 23:12:52.664 [DBG] PEER brontide.go:2507: Peer([Alice]): Received ClosingComplete(chan_id=6c3ec6c0879c4b44a320356b0d3882518c5c6c3c8b619526e1d369039a77ff7d, fee_sat=0.00000965 BTC, locktime=0) from [Alice]@127.0.0.1:52192

The rest of the msgs, which also shows Bob broadcast two closing txns, one from his, and the other from Alice,

2025-03-12 23:12:52.698 [DBG] PEER brontide.go:2507: Peer([Alice]): Sending ClosingSig(chan_id=6c3ec6c0879c4b44a320356b0d3882518c5c6c3c8b619526e1d369039a77ff7d) to [Alice]@127.0.0.1:52192

2025-03-12 23:12:52.698 [DBG] PFSM state_machine.go:435: FSM(rbf_chan_closer([ChanPoint: Alice=>Bob])): Broadcasting txn txid=93176923b8452ac9b1da93714b1c64f270ae1695935cd154b48b16cbe236ceba

2025-03-12 23:12:52.785 [DBG] PEER brontide.go:2507: Peer([Alice]): Received ClosingSig(chan_id=6c3ec6c0879c4b44a320356b0d3882518c5c6c3c8b619526e1d369039a77ff7d) from [Alice]@127.0.0.1:52192

2025-03-12 23:12:52.936 [DBG] PFSM state_machine.go:435: FSM(rbf_chan_closer([ChanPoint: Alice=>Bob])): Broadcasting txn txid=342abb7291c6493d5280441667729b8175cddcdaf047e15a80fbb383b3f6a93c

2025-03-12 23:12:52.937 [WRN] BTWL btcwallet.go:1275: Transaction 342abb7291c6493d5280441667729b8175cddcdaf047e15a80fbb383b3f6a93c not accepted by mempool: insufficient fee
2025-03-12 23:12:52.937 [ERR] PFSM state_machine.go:442: unable to broadcast txn: insufficient fee
2025-03-12 23:12:52.937 [INF] PFSM state_machine.go:603: FSM(rbf_chan_closer([ChanPoint: Alice=>Bob])): State transition from_state="ClosingNegotiation(local=LocalOfferSent(proposed_fee=0.00000193 BTC), remote=ClosePending(txid=93176923b8452ac9b1da93714b1c64f270ae1695935cd154b48b16cbe236ceba, party=Remote, fee_rate=5 sat/vb))" to_state="ClosingNegotiation(local=ClosePending(txid=342abb7291c6493d5280441667729b8175cddcdaf047e15a80fbb383b3f6a93c, party=Local, fee_rate=1 sat/vb), remote=ClosePending(txid=93176923b8452ac9b1da93714b1c64f270ae1695935cd154b48b16cbe236ceba, party=Remote, fee_rate=5 sat/vb))"

Bob finally started the coop process here, which means he shouldn't have attempted one before,

2025-03-12 23:13:01.997 [DBG] RPCS interceptor.go:791: [/lnrpc.Lightning/CloseChannel] requested

Alice's log for reference, we can decide the msg order from the timestamp,

2025-03-12 23:12:52.524 [DBG] RPCS interceptor.go:791: [/lnrpc.Lightning/CloseChannel] requested

2025-03-12 23:12:52.536 [INF] PEER brontide.go:4862: Peer([Bob]): Local close channel request is going to be delivered to the peer
2025-03-12 23:12:52.559 [INF] PEER brontide.go:2847: Peer([Bob]): Delivery addr for channel close: bcrt1p323kxtdyx682t5txml8c9cdnhyzruj6y0x3y3k9ef58lr5qgdxvq5ctp9m
2025-03-12 23:12:52.584 [DBG] PEER brontide.go:2507: Peer([Bob]): Sending Shutdown(chan_id=6c3ec6c0879c4b44a320356b0d3882518c5c6c3c8b619526e1d369039a77ff7d, script=51208aa3632da4368ea5d166dfcf82e1b3b9043e4b4479a248d8b94d0ff1d0086998) to [Bob]@127.0.0.1:10711


2025-03-12 23:12:52.639 [DBG] PEER brontide.go:2507: Peer([Bob]): Received Shutdown(chan_id=6c3ec6c0879c4b44a320356b0d3882518c5c6c3c8b619526e1d369039a77ff7d, script=5120db038e61c17635af1ff79724eab01067dea5722d74e30737df3f56632962dcd0) from [Bob]@127.0.0.1:10711

2025-03-12 23:12:52.649 [DBG] PEER brontide.go:2507: Peer([Bob]): Received ClosingComplete(chan_id=6c3ec6c0879c4b44a320356b0d3882518c5c6c3c8b619526e1d369039a77ff7d, fee_sat=0.00000193 BTC, locktime=0) from [Bob]@127.0.0.1:10711

2025-03-12 23:12:52.663 [DBG] PEER brontide.go:2507: Peer([Bob]): Sending ClosingComplete(chan_id=6c3ec6c0879c4b44a320356b0d3882518c5c6c3c8b619526e1d369039a77ff7d, fee_sat=0.00000965 BTC, locktime=0) to [Bob]@127.0.0.1:10711


2025-03-12 23:12:52.698 [DBG] PEER brontide.go:2507: Peer([Bob]): Received ClosingSig(chan_id=6c3ec6c0879c4b44a320356b0d3882518c5c6c3c8b619526e1d369039a77ff7d) from [Bob]@127.0.0.1:10711
2025-03-12 23:12:52.784 [DBG] PEER brontide.go:2507: Peer([Bob]): Sending ClosingSig(chan_id=6c3ec6c0879c4b44a320356b0d3882518c5c6c3c8b619526e1d369039a77ff7d) to [Bob]@127.0.0.1:10711

2025-03-12 23:12:52.785 [DBG] PFSM state_machine.go:435: FSM(rbf_chan_closer([ChanPoint: Alice=>Bob])): Broadcasting txn txid=342abb7291c6493d5280441667729b8175cddcdaf047e15a80fbb383b3f6a93c
2025-03-12 23:12:52.790 [WRN] BTWL btcwallet.go:1275: Transaction 342abb7291c6493d5280441667729b8175cddcdaf047e15a80fbb383b3f6a93c not accepted by mempool: insufficient fee
2025-03-12 23:12:52.790 [ERR] PFSM state_machine.go:442: unable to broadcast txn: insufficient fee


2025-03-12 23:12:52.806 [DBG] PFSM state_machine.go:435: FSM(rbf_chan_closer([ChanPoint: Alice=>Bob])): Broadcasting txn txid=93176923b8452ac9b1da93714b1c64f270ae1695935cd154b48b16cbe236ceba
2025-03-12 23:12:52.808 [WRN] BTWL btcwallet.go:1275: Transaction 93176923b8452ac9b1da93714b1c64f270ae1695935cd154b48b16cbe236ceba not accepted by mempool: txn-already-in-mempool

2025-03-12 23:12:52.923 [INF] PFSM state_machine.go:603: FSM(rbf_chan_closer([ChanPoint: Alice=>Bob])): State transition from_state="ClosingNegotiation(local=LocalOfferSent(proposed_fee=0.00000965 BTC), remote=ClosePending(txid=342abb7291c6493d5280441667729b8175cddcdaf047e15a80fbb383b3f6a93c, party=Remote, fee_rate=1 sat/vb))" to_state="ClosingNegotiation(local=ClosePending(txid=93176923b8452ac9b1da93714b1c64f270ae1695935cd154b48b16cbe236ceba, party=Local, fee_rate=5 sat/vb), remote=ClosePending(txid=342abb7291c6493d5280441667729b8175cddcdaf047e15a80fbb383b3f6a93c, party=Remote, fee_rate=1 sat/vb))"

yyforyongyu · 2025-03-13T08:20:45Z

itest/lnd_coop_close_rbf_test.go

+	_, aliceCloseUpdate = ht.CloseChannelAssertPending(
+		alice, chanPoint, false,
+		lntest.WithCoopCloseFeeRate(aliceRejectedFeeRate),
+		lntest.WithLocalTxNotify(), lntest.WithSkipMempoolCheck(),


Since the coop close would fail, there's no need to skip the mempool check because the old txid is still in the mempool?

We still need this here, just to skip the final assertion in CloseChannelAssertPending. With the API, close only fails (gRPC error) when we can't pay for the specified fee. Spurious RBF updates are surpressed in the form of a response that returns the target fee rate.

Crypt-iQ

LGTM pending linter, nice work 🦖

This fixes an issue in the itests in the restart case. We'd see an error like: ``` 2025-03-12 23:41:10.754 [ERR] PFSM state_machine.go:661: FSM(rbf_chan_closer(2f20725d9004f7fda7ef280f77dd8d419fd6669bda1a5231dd58d6f6597066e0:0)): Unable to apply event err="invalid state transition: received *chancloser.SendOfferEvent while in ClosingNegotiation(local=LocalOfferSent(proposed_fee=0.00000193 BTC), remote=ClosePending(txid=07229915459cb439bdb8ad4f5bf112dc6f42fca0192ea16a7d6dd05e607b92ae, party=Remote, fee_rate=1 sat/vb))" ``` We resolve this by waiting to send in the new request unil the old one has been completed.

Roasbeef requested review from Crypt-iQ and yyforyongyu March 4, 2025 02:59

Roasbeef force-pushed the rbf-staging-integration branch from c6cc135 to 8d4a3d7 Compare March 4, 2025 03:23

saubyk assigned Roasbeef Mar 4, 2025

saubyk added this to lnd v0.19 Mar 4, 2025

saubyk moved this to In Progress in lnd v0.19 Mar 4, 2025

saubyk added this to the v0.19.0 milestone Mar 4, 2025

Roasbeef force-pushed the rbf-staging-integration branch from 8d4a3d7 to 84969ec Compare March 5, 2025 02:25

Roasbeef added the no-changelog label Mar 5, 2025

Roasbeef added 3 commits March 5, 2025 16:28

peer: add initial awareness of new rbf coop closer

f6a6c65

In this commit, we use the interfaces we created in the prior commit to make a new method capable of spinning up the new rbf coop closer.

peer: add new composite chanCloserFsm type

bcfcaae

In this commit, we add a new composite chanCloserFsm type. This'll allow us to store a single value that might be a negotiator or and rbf-er. In a follow up commit, we'll use this to conditionally create the new rbf closer.

Roasbeef force-pushed the rbf-staging-integration branch from 84969ec to 22cad99 Compare March 6, 2025 00:28

Roasbeef changed the base branch from rbf-staging-state-machine to rbf-staging March 6, 2025 00:30

Crypt-iQ reviewed Mar 6, 2025

View reviewed changes

Roasbeef requested a review from Crypt-iQ March 7, 2025 00:07

yyforyongyu reviewed Mar 7, 2025

View reviewed changes

Roasbeef force-pushed the rbf-staging-integration branch from 22cad99 to fefd583 Compare March 7, 2025 21:04

Roasbeef added 8 commits March 7, 2025 17:10

peer: create ErrorReporter implementation for rbf-fsm

92b876d

peer: conditionally create new RBF chan closer

b0cca60

feature: add new NoRbfCoopClose option

daa437f

lncfg: add new protocol option - RbfCoopClose

6bace55

server: thread through new NoRbfCoopClose option

40969aa

For now, we disallow the option to be used with the taproot chans option, as the new flow hasn't yet been updated for nonce usage.

itest: update async coop close itests to also use new rbf flow

7e2a7a6

peer: make activeChanCloses a SyncMap

8144ff2

This fixes some existing race conditions, as the `finalizeChanClosure` function was being called from outside the main event loop.

Roasbeef added 5 commits March 7, 2025 17:10

peer: attempt to unregister endpoint before registering

db4fe04

If we hit an error, we want to wipe the state machine state, which also includes removing the old endpoint.

lnrpc: add fee rate and local close bool to PendingUpdate

bc0ab9d

This'll allow us to notify the caller each time a new coop close transaction with a higher fee rate is signed.

peer+rpc: set new rbf coop close rbf update fields

53b6b60

lntest: fix error message in WaitForChannelCloseEvent

ee18438

Resp is always nil, so we actually need to log event.Update here.

lntest+itest: extend CloseChannelAssertPending

678f243

In this commit, we extend `CloseChannelAssertPending` with new args that returns the raw close status update (as we have more things we'd like to assert), and also allows us to pass in a custom fee rate.

Roasbeef force-pushed the rbf-staging-integration branch 3 times, most recently from b97d255 to 4f24870 Compare March 11, 2025 19:11

Roasbeef added 2 commits March 11, 2025 16:45

Roasbeef force-pushed the rbf-staging-integration branch from 4f24870 to 4b9294e Compare March 11, 2025 21:47

Roasbeef added 5 commits March 12, 2025 17:59

itest: add new RBF coop close itest

0b530fb

The itest has both sides try to close multiple times, each time with increasing fee rates. We also test the reconnection case, bad RBF updates, and instances where the local party can't actually pay for fees.

multi: extract new DeriveHeightHint() function, use for new rbf closer

7cc9704

With this commit, we make sure we set the right height hint, even if the channel is a zero conf channel.

docs/release-notes: add rbf coop close section

8093cfe

msgmux: fix arg expectation for mock in unit test

67f9f38

lnwallet/chancloser: add docs for new rbf chan closer

3801f3a

Roasbeef force-pushed the rbf-staging-integration branch from 2ed12b1 to 3801f3a Compare March 12, 2025 22:59

yyforyongyu reviewed Mar 13, 2025

View reviewed changes

Crypt-iQ approved these changes Mar 18, 2025

View reviewed changes

Roasbeef force-pushed the rbf-staging-integration branch from 82cb727 to 7060765 Compare March 18, 2025 16:10

Roasbeef merged commit 4cf8de5 into lightningnetwork:rbf-staging Mar 18, 2025
29 checks passed

github-project-automation bot moved this from In Progress to Done in lnd v0.19 Mar 18, 2025

yyforyongyu mentioned this pull request Jul 17, 2025

chancloser: fix flakes in chancloser tests #10052

Merged

Conversation

Roasbeef commented Mar 4, 2025

Uh oh!

coderabbitai bot commented Mar 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Chat

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

CodeRabbit Configuration File (.coderabbit.yaml)

Documentation and Community

Uh oh!

Roasbeef commented Mar 6, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

yyforyongyu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

yyforyongyu Mar 7, 2025

Choose a reason for hiding this comment

Uh oh!

Roasbeef Mar 8, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

yyforyongyu Mar 7, 2025

Choose a reason for hiding this comment

Uh oh!

Roasbeef Mar 8, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Roasbeef commented Mar 8, 2025

Uh oh!

Roasbeef commented Mar 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yyforyongyu commented Mar 13, 2025

Uh oh!

yyforyongyu Mar 13, 2025

Choose a reason for hiding this comment

Uh oh!

Roasbeef Mar 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Crypt-iQ left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

coderabbitai bot commented Mar 4, 2025 •

edited

Loading

CodeRabbit Configuration File (`.coderabbit.yaml`)

Roasbeef commented Mar 8, 2025 •

edited

Loading

Roasbeef Mar 17, 2025 •

edited

Loading