txmgr: Restructure internals and add TxNotInMempoolTimeout by trianglesphere · Pull Request #5286 · ethereum-optimism/optimism

trianglesphere · 2023-03-29T00:17:15Z

Description

This restructures a lot of the internal of transaction sending. It adds a new flag and features where it aborts sending the transaction if the transaction is never seen on the network after a default of 2 minutes (configurable via txmgr flag).

The transaction manager's sendState has been modified to track successful publishes and the time of its creation. If it goes for TxNotInMempoolTimeout time without seeing a successful transaction publish (i.e. it believes the tx is not in the mempool) then it will abort.

The goal of this change is to ensure that the transaction manager does not get stuck in state where the transaction will not get confirmed while not accidentally recording transactions as failed when they in fact succeeded.

TODOs

Author or reviewer has added an entry to the current release notes draft, if appropriate.

changeset-bot · 2023-03-29T00:17:20Z

⚠️ No Changeset found

Latest commit: 9afb543

Merging this PR will not cause a version bump for any packages. If these changes should not result in a new version, you're good to go. If these changes should result in a version bump, you need to add a changeset.

This PR includes no changesets

When changesets are added to this PR, you'll see the packages that this PR includes changesets for and the associated semver types

Click here to learn what changesets are, and how to add one.

Click here if you're a maintainer who wants to add a changeset to this PR

netlify · 2023-03-29T00:17:23Z

✅ Deploy Preview for opstack-docs canceled.

Name	Link
🔨 Latest commit	`9afb543`
🔍 Latest deploy log	https://app.netlify.com/sites/opstack-docs/deploys/6427258303251700085e46ed

semgrep-app · 2023-03-29T16:50:00Z

Semgrep found 1 unchecked-type-assertion finding:

op-bindings/bindings/systemconfig.go: L380

Unchecked type assertion.

_{Created by unchecked-type-assertion.}

op-e2e/setup.go

codecov · 2023-03-29T19:12:16Z

Codecov Report

Merging #5286 (9afb543) into develop (7354398) will decrease coverage by 3.79%.
The diff coverage is 68.18%.

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #5286      +/-   ##
===========================================
- Coverage    39.93%   36.15%   -3.79%     
===========================================
  Files          382      227     -155     
  Lines        24376    19883    -4493     
  Branches       838        0     -838     
===========================================
- Hits          9734     7188    -2546     
+ Misses       13911    12000    -1911     
+ Partials       731      695      -36

Flag	Coverage Δ
bedrock-go-tests	`36.15% <68.18%> (-0.06%)`	⬇️
common-ts-tests	`?`
contracts-bedrock-tests	`?`
contracts-tests	`?`
core-utils-tests	`?`
dtl-tests	`?`
fault-detector-tests	`?`
sdk-tests	`?`

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
op-service/txmgr/cli.go	`43.05% <33.33%> (-0.03%)`	⬇️
op-service/txmgr/txmgr.go	`78.94% <68.90%> (-3.13%)`	⬇️
op-service/txmgr/send_state.go	`96.66% <100.00%> (+0.30%)`	⬆️

... and 156 files with indirect coverage changes

ajsutton

Generally looks good - left a few clarifying comments. Main concern for me is having to sleep in tests - that's a very common source of intermittency and leads to slower tests.

op-e2e/migration_test.go

op-service/txmgr/send_state.go

op-service/txmgr/send_state_test.go

op-service/txmgr/send_state.go

op-service/txmgr/txmgr.go

sebastianst

I'm wondering, do we even need the async tx sending in send with sendTxAsync via a goroutine and then waiting for the receipt on a channel?

It looks to me like the whole transaction sending loop could also just be a normal synchronous call to publishAndWaitForTx, which would simply return *types.Receipt, error. But maybe I'm just oversimplifying the algorithm here.

op-service/txmgr/send_state.go

op-service/txmgr/txmgr.go

trianglesphere · 2023-03-30T20:39:37Z

I'm wondering, do we even need the async tx sending in send with sendTxAsync via a goroutine and then waiting for the receipt on a channel?

It looks to me like the whole transaction sending loop could also just be a normal synchronous call to publishAndWaitForTx, which would simply return *types.Receipt, error. But maybe I'm just oversimplifying the algorithm here.

It's because when we update the gas price, we are waiting for multiple different transaction hashes. Say a tx gets included in L1, but we then published another transaction with a higher gas price before we realize. If we only waited for the second TX we would miss that the first TX we sent got included on L1.

We could make it synchronous by polling every transaction that has been sent

ajsutton · 2023-03-30T22:11:47Z

Actually I'm going to dismiss my review and just leave approval to Seb since he has better context. I really just meant to remove my request changes.

I'm happy with it, but leaving final approval to Seb.

sebastianst · 2023-03-31T14:01:08Z

I'm wondering, do we even need the async tx sending in send with sendTxAsync via a goroutine and then waiting for the receipt on a channel?
It looks to me like the whole transaction sending loop could also just be a normal synchronous call to publishAndWaitForTx, which would simply return *types.Receipt, error. But maybe I'm just oversimplifying the algorithm here.

It's because when we update the gas price, we are waiting for multiple different transaction hashes. Say a tx gets included in L1, but we then published another transaction with a higher gas price before we realize. If we only waited for the second TX we would miss that the first TX we sent got included on L1.

We could make it synchronous by polling every transaction that has been sent

Ah thanks, that makes sense. If we're running into problems with the current async architecture, I'd vote for this refactor. But fine for now since it works.

sebastianst

Looks great, just two open questions regarding error strings checking

op-service/txmgr/txmgr.go

trianglesphere · 2023-03-31T15:10:26Z

#5286 (comment)
Can't comment on that for some reason. @sebastianst

ethereum.NotFound is correctly returned by the client.

mergify · 2023-03-31T17:32:23Z

This PR has been added to the merge queue, and will be merged soon.

mergify · 2023-03-31T17:32:24Z

This PR is next in line to be merged, and will be merged as soon as checks pass.

mergify · 2023-03-31T17:33:22Z

This PR is next in line to be merged, and will be merged as soon as checks pass.

trianglesphere · 2023-03-31T18:20:33Z

@Mergifyio refresh

mergify · 2023-03-31T18:20:36Z

refresh

✅ Pull request refreshed

mergify · 2023-03-31T18:46:28Z

This PR has been added to the merge queue, and will be merged soon.

trianglesphere mentioned this pull request Mar 29, 2023

op-proposer: add L1 fee metrics #5268

Closed

1 task

trianglesphere force-pushed the jg/tx_lifecycle_mgmt branch 2 times, most recently from cf37742 to da9fe76 Compare March 29, 2023 16:49

trianglesphere force-pushed the jg/tx_lifecycle_mgmt branch 2 times, most recently from cf59022 to 5e7a716 Compare March 29, 2023 18:22

semgrep-app bot reviewed Mar 29, 2023

View reviewed changes

op-e2e/setup.go Show resolved Hide resolved

semgrep-app bot reviewed Mar 29, 2023

View reviewed changes

op-e2e/setup.go Show resolved Hide resolved

trianglesphere changed the title ~~txmgr: Restructure internals~~ txmgr: Restructure internals and add TxNotInMempoolTimeout Mar 29, 2023

trianglesphere force-pushed the jg/tx_lifecycle_mgmt branch 2 times, most recently from 5e2d5d0 to 2c546e3 Compare March 29, 2023 18:51

trianglesphere marked this pull request as ready for review March 29, 2023 18:51

trianglesphere requested review from a team as code owners March 29, 2023 18:51

trianglesphere requested review from ajsutton, protolambda, sebastianst and zhwrd and removed request for protolambda March 29, 2023 18:51

ajsutton requested changes Mar 30, 2023

View reviewed changes

sebastianst requested changes Mar 30, 2023

View reviewed changes

trianglesphere force-pushed the jg/tx_lifecycle_mgmt branch from 2c546e3 to 1d50cc9 Compare March 30, 2023 20:34

trianglesphere requested review from ajsutton and sebastianst March 30, 2023 20:36

ajsutton previously approved these changes Mar 30, 2023

View reviewed changes

trianglesphere mentioned this pull request Mar 30, 2023

txmgr: Simplify API #5306

Merged

1 task

sebastianst reviewed Mar 31, 2023

View reviewed changes

op-service/txmgr/txmgr.go Show resolved Hide resolved

op-service/txmgr/txmgr.go Show resolved Hide resolved

trianglesphere added 2 commits March 31, 2023 08:26

txmgr: Restructure internals

cd2975c

txmgr: Fake sendState clock in tests

18524db

trianglesphere force-pushed the jg/tx_lifecycle_mgmt branch from 90c5955 to 18524db Compare March 31, 2023 15:32

sebastianst approved these changes Mar 31, 2023

View reviewed changes

trianglesphere requested a review from mslipper March 31, 2023 17:25

mslipper approved these changes Mar 31, 2023

View reviewed changes

Merge branch 'develop' into jg/tx_lifecycle_mgmt

5c62840

mergify bot added the on-merge-train label Mar 31, 2023

mergify bot removed the on-merge-train label Mar 31, 2023

Merge branch 'develop' into jg/tx_lifecycle_mgmt

9afb543

trianglesphere merged commit f0676c1 into develop Mar 31, 2023

trianglesphere deleted the jg/tx_lifecycle_mgmt branch March 31, 2023 18:46

mergify bot added on-merge-train and removed on-merge-train labels Mar 31, 2023

seolaoh mentioned this pull request Apr 24, 2023

feat(batcher): apply op-batcher v1.0.3 to Kroma kroma-network/kroma#39

Merged

Conversation

trianglesphere commented Mar 29, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

changeset-bot bot commented Mar 29, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

⚠️ No Changeset found

Uh oh!

netlify bot commented Mar 29, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for opstack-docs canceled.

Uh oh!

semgrep-app bot commented Mar 29, 2023

Uh oh!

Uh oh!

Uh oh!

codecov bot commented Mar 29, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

ajsutton left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sebastianst left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

trianglesphere commented Mar 30, 2023

Uh oh!

ajsutton commented Mar 30, 2023

Uh oh!

sebastianst commented Mar 31, 2023

Uh oh!

sebastianst left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

trianglesphere commented Mar 31, 2023

Uh oh!

mergify bot commented Mar 31, 2023

Uh oh!

mergify bot commented Mar 31, 2023

Uh oh!

mergify bot commented Mar 31, 2023

Uh oh!

trianglesphere commented Mar 31, 2023

Uh oh!

mergify bot commented Mar 31, 2023

✅ Pull request refreshed

Uh oh!

mergify bot commented Mar 31, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

trianglesphere commented Mar 29, 2023 •

edited

Loading

changeset-bot bot commented Mar 29, 2023 •

edited

Loading

netlify bot commented Mar 29, 2023 •

edited

Loading

codecov bot commented Mar 29, 2023 •

edited

Loading