Reuse tunnel resolvers instead of creating one per connection attempt by rosstimothy · Pull Request #37566 · gravitational/teleport

rosstimothy · 2024-01-30T21:58:51Z

The CachingResolver is backed by a FnCache but does not expose a way to close the underlying cache. This leads to memory leaks as captured in #37025. Instead of modifying the resolver to allow explicit cleanup to occur, the resolvers were refactored to be created once per process instead of per connection attempt to the cluster. Since the cluster address is read from the config file, it won't be changed for the duration of the process which allows us to safely use a single resolver. The one potential downside to this approach is the cache may return possibly stale errors during an outage until the entry is TTLed.

Fixes #37025

changelog: Fix memory leak in tbot caused by never closing reverse tunnel address resolvers

The `CachingResolver` is backed by a `FnCache` but does not expose a way to close the underlying cache. This leads to memory leaks as captured in #37025. Instead of modifying the resolver to allow explicit cleanup to occur, the resolvers were refactored to be created once per process instead of per connenction attempt to the cluster. Since the cluster address is read from the config file, it won't be changed for the duration of the process which allows us to safely use a single resolver. The one potential downside to this approach is the cache may return possibly stale errors during an outage until the entry is TTLed. Fixes #37025

public-teleport-github-review-bot · 2024-02-02T17:30:54Z

@rosstimothy See the table below for backport results.

Branch	Result
branch/v12	Failed
branch/v13	Failed
branch/v14	Failed
branch/v15	Create PR

…#37566) The `CachingResolver` is backed by a `FnCache` but does not expose a way to close the underlying cache. This leads to memory leaks as captured in #37025. Instead of modifying the resolver to allow explicit cleanup to occur, the resolvers were refactored to be created once per process instead of per connenction attempt to the cluster. Since the cluster address is read from the config file, it won't be changed for the duration of the process which allows us to safely use a single resolver. The one potential downside to this approach is the cache may return possibly stale errors during an outage until the entry is TTLed. Fixes #37025

…#37566) (#37723) The `CachingResolver` is backed by a `FnCache` but does not expose a way to close the underlying cache. This leads to memory leaks as captured in #37025. Instead of modifying the resolver to allow explicit cleanup to occur, the resolvers were refactored to be created once per process instead of per connenction attempt to the cluster. Since the cluster address is read from the config file, it won't be changed for the duration of the process which allows us to safely use a single resolver. The one potential downside to this approach is the cache may return possibly stale errors during an outage until the entry is TTLed. Fixes #37025

…#37566) (#37719) The `CachingResolver` is backed by a `FnCache` but does not expose a way to close the underlying cache. This leads to memory leaks as captured in #37025. Instead of modifying the resolver to allow explicit cleanup to occur, the resolvers were refactored to be created once per process instead of per connenction attempt to the cluster. Since the cluster address is read from the config file, it won't be changed for the duration of the process which allows us to safely use a single resolver. The one potential downside to this approach is the cache may return possibly stale errors during an outage until the entry is TTLed. Fixes #37025

rosstimothy force-pushed the tross/process_resolver branch from 7089543 to 8749ef3 Compare January 31, 2024 15:28

rosstimothy changed the title ~~Refactor TeleportProcess to reuse a single tunnel resolver~~ Reuse tunnel resolvers instead of creating one per connection attempt Jan 31, 2024

rosstimothy force-pushed the tross/process_resolver branch 2 times, most recently from 46d6684 to b121c99 Compare January 31, 2024 15:55

rosstimothy force-pushed the tross/process_resolver branch from b121c99 to eed9af4 Compare January 31, 2024 16:03

rosstimothy requested a review from strideynet January 31, 2024 16:16

rosstimothy marked this pull request as ready for review January 31, 2024 16:16

github-actions Bot requested a review from rudream January 31, 2024 16:17

github-actions Bot added machine-id size/sm tctl tctl - Teleport admin tool labels Jan 31, 2024

strideynet approved these changes Jan 31, 2024

View reviewed changes

gravitational deleted a comment from github-actions Bot Jan 31, 2024

rosstimothy added backport/branch/v13 labels Jan 31, 2024

rosstimothy requested a review from zmb3 February 1, 2024 14:35

zmb3 approved these changes Feb 2, 2024

View reviewed changes

public-teleport-github-review-bot Bot removed the request for review from rudream February 2, 2024 16:59

rosstimothy added this pull request to the merge queue Feb 2, 2024

Merged via the queue into master with commit 327c877 Feb 2, 2024

rosstimothy deleted the tross/process_resolver branch February 2, 2024 17:28

rosstimothy mentioned this pull request Feb 2, 2024

[v15] Reuse tunnel resolvers instead of creating one per connection attempt #37718

Merged

rosstimothy mentioned this pull request Feb 2, 2024

[v14] Reuse tunnel resolvers instead of creating one per connection attempt #37719

Merged

rosstimothy mentioned this pull request Feb 2, 2024

[v13] Reuse tunnel resolvers instead of creating one per connection attempt #37723

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reuse tunnel resolvers instead of creating one per connection attempt#37566

Reuse tunnel resolvers instead of creating one per connection attempt#37566
rosstimothy merged 1 commit intomasterfrom
tross/process_resolver

rosstimothy commented Jan 30, 2024 •

edited

Loading

Uh oh!

public-teleport-github-review-bot Bot commented Feb 2, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

rosstimothy commented Jan 30, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

public-teleport-github-review-bot Bot commented Feb 2, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

rosstimothy commented Jan 30, 2024 •

edited

Loading