Slowness due to client-side throttling in v0.32.0 #2582

Ubiquitine · 2024-03-04T11:06:26Z

Describe the bug
After upgrading to v0.32.0, k9s started to response really slow when switching resources after few minutes of running.

To Reproduce
Steps to reproduce the behavior:

Launch k9s in terminal
Switch between resources/namespaces
After several minutes, switching between resources becomes slow - taking up to 10-20 seconds, during which k9 is unresponsive

Historical Documents
INFO log show lines like this during the issue:
I0304 18:39:43.076819 114467 request.go:697] Waited for 18.03660912s due to client-side throttling, not priority and fairness, request: GET:https://API_URL/apis/RESOURCES_PATHS

Expected behavior
Namespace and resources switch should not take more than couple seconds (at least it didn't in previous versions).

Versions (please complete the following information):

OS: Linux
K9s: 0.32.0
K8s: v1.24.17-eks-5e0fdde

Additional context
I also tried to clean ~/.kube/cache directory, but the issue keeps coming back.
To be fair, I have 120+ namespaces with deployments and jobs in each, but I guess something has been changed in client behavior in 0.32.0 that introduced this throttling and causes huge delays.

The text was updated successfully, but these errors were encountered:

derailed · 2024-03-04T21:26:21Z

@Ubiquitine Thank you for this report! Yikes not what I was expecting after a perf improvement pass ;(
Can you add more specifics here in terms of which cluster env (gke, azk,...) resource/namespace that suddenly caused the lags. What is your refresh rate set to? Also how many resources are we expecting when the lags occurred ie nsX/resY. Including logs could help us zero this in as well. Tx!
NOTE: I had tested with 10k namespaces/pods and did not see any issues...

Ubiquitine · 2024-03-05T02:50:49Z

Hi, More info:
Cluster: EKS v1.24.17-eks-5e0fdde.
21 nodes, 123 namespaces. In most namespaces there are 2-4 pods.
The issue observed when I try to switch to pods in some namespace by typing in console, e.g. pods namespace. But pretty much every resource that I try to see takes several seconds to display.
This is what I get in INFO log during the event:

I0305 10:39:51.001098  118588 request.go:697] Waited for 17.867920295s due to client-side throttling, not priority and fairness, request: GET:https://MASKED.gr7.us-east-1.eks.amazonaws.com/apis/ecr.aws.crossplane.io/v1alpha1?timeout=32s
I0305 10:40:01.001092  118588 request.go:697] Waited for 7.86665376s due to client-side throttling, not priority and fairness, request: GET:https://MASKED.gr7.us-east-1.eks.amazonaws.com/apis/monitoring.coreos.com/v1alpha1?timeout=32s
I0305 10:40:11.199175  118588 request.go:697] Waited for 18.064888002s due to client-side throttling, not priority and fairness, request: GET:https://MASKED.gr7.us-east-1.eks.amazonaws.com/apis/keycloak.org/v1alpha1?timeout=32s
......
......
I0305 10:46:48.582541  118588 request.go:697] Waited for 15.250904406s due to client-side throttling, not priority and fairness, request: GET:https://MASKED.gr7.us-east-1.eks.amazonaws.com/apis/acme.cert-manager.io/v1?timeout=32s
I0305 10:46:58.780758  118588 request.go:697] Waited for 5.463363034s due to client-side throttling, not priority and fairness, request: GET:https://MASKED.gr7.us-east-1.eks.amazonaws.com/apis/discovery.k8s.io/v1beta1?timeout=32s
I0305 10:47:08.781924  118588 request.go:697] Waited for 15.464305047s due to client-side throttling, not priority and fairness, request: GET:https://MASKED.gr7.us-east-1.eks.amazonaws.com/apis/route53.aws.crossplane.io/v1alpha1?timeout=32s
I0305 10:47:18.980168  118588 request.go:697] Waited for 5.666558067s due to client-side throttling, not priority and fairness, request: GET:https://MASKED.gr7.us-east-1.eks.amazonaws.com/apis/autoscaling/v2beta1?timeout=32s
......
......
I0305 11:00:56.732437  118778 request.go:697] Waited for 19.045184644s due to client-side throttling, not priority and fairness, request: GET:https://MASKED.gr7.us-east-1.eks.amazonaws.com/apis/vpcresources.k8s.aws/v1alpha1?timeout=32s
I0305 11:01:06.930518  118778 request.go:697] Waited for 9.261881093s due to client-side throttling, not priority and fairness, request: GET:https://MASKED.gr7.us-east-1.eks.amazonaws.com/apis/redshift.aws.crossplane.io/v1alpha1?timeout=32s
I0305 11:01:18.730249  118778 request.go:697] Waited for 1.068109485s due to client-side throttling, not priority and fairness, request: GET:https://MASKED.gr7.us-east-1.eks.amazonaws.com/apis/events.k8s.io/v1beta1?timeout=32s
I0305 11:01:28.730454  118778 request.go:697] Waited for 11.068214575s due to client-side throttling, not priority and fairness, request: GET:https://MASKED.gr7.us-east-1.eks.amazonaws.com/apis/vpcresources.k8s.aws/v1beta1?timeout=32s
......
......
I0305 11:01:38.929292  118778 request.go:697] Waited for 1.263211789s due to client-side throttling, not priority and fairness, request: GET:https://MASKED.gr7.us-east-1.eks.amazonaws.com/apis/apps/v1?timeout=32s
I0305 11:01:48.929449  118778 request.go:697] Waited for 11.263007094s due to client-side throttling, not priority and fairness, request: GET:https://MASKED.gr7.us-east-1.eks.amazonaws.com/apis/notification.aws.crossplane.io/v1alpha1?timeout=32s
I0305 11:01:59.128108  118778 request.go:697] Waited for 1.221652111s due to client-side throttling, not priority and fairness, request: GET:https://MASKED.gr7.us-east-1.eks.amazonaws.com/apis/authentication.k8s.io/v1?timeout=32s
I0305 11:02:09.328875  118778 request.go:697] Waited for 11.422229987s due to client-side throttling, not priority and fairness, request: GET:https://MASKED.gr7.us-east-1.eks.amazonaws.com/apis/identity.aws.crossplane.io/v1beta1?timeout=32s
I0305 11:02:19.528098  118778 request.go:697] Waited for 1.576175443s due to client-side throttling, not priority and fairness, request: GET:https://MASKED.gr7.us-east-1.eks.amazonaws.com/apis/events.k8s.io/v1beta1?timeout=32s

This is my config.yaml

k9s:
  screenDumpDir: /tmp/k9s-screens
  refreshRate: 2
  maxConnRetry: 5
  readOnly: false
  noExitOnCtrlC: false
  ui:
    enableMouse: false
    headless: false
    logoless: true
    crumbsless: true
    noIcons: true
    skin: transparent
    reactive: false
  skipLatestRevCheck: false
  disablePodCounting: false
  shellPod:
    image: busybox:1.35.0
    namespace: default
    limits:
      cpu: 100m
      memory: 100Mi
  imageScans:
    enable: false
    exclusions:
      namespaces: []
      labels: {}
  liveViewAutoRefresh: false
  logger:
    tail: 100
    buffer: 5000
    sinceSeconds: 300
    fullScreen: false
    textWrap: false
    showTime: false
  thresholds:
    cpu:
      critical: 90
      warn: 70
    memory:
      critical: 90
      warn: 70

UPD: more logs

seanmuth · 2024-03-05T17:25:47Z

Also experiencing pretty significant lag/slowness on 0.32.1.

My k9s.log doesn't show any client-side throttling errors, but I'm wondering if I'm looking at the right log?

Happy to attach logs/configs to help debug.

Versions (please complete the following information):

OS: macOS 13.4
K9s: 0.32.1
kubectl: 1.24.6
K8s: multiple, seen on v1.26.13-gke.1052000, v1.25.16-eks-77b1e4e, etc

I've seen this slowness across the board. I work at a SaaS company and we have clusters in all three major clouds, having namespaces in the range of 20-200+, with pod counts from 200-5000+.

config.yaml

k9s:
  liveViewAutoRefresh: false
  refreshRate: 2
  maxConnRetry: 5
  ui:
    skin: nightfox-astro
  # enableMouse: false
  # enableImageScan: false
  # headless: false
  # logoless: false
  # crumbsless: false
  readOnly: false
  noExitOnCtrlC: false
  # noIcons: false
  shellPod:
    image: busybox:1.35.0
    namespace: default
    limits:
      cpu: 100m
      memory: 100Mi
  skipLatestRevCheck: false
  logger:
    tail: 200
    buffer: 5000
    sinceSeconds: 60
    textWrap: false
    showTime: false
  thresholds:
    cpu:
      critical: 90
      warn: 70
    memory:
      critical: 90
      warn: 70
  screenDumpDir: /Users/seanmuth/code/k9s/screendumps
  disablePodCounting: false

views.yaml

# $XDG_CONFIG_HOME/k9s/views.yaml
views:
  # alters the pod view column layout
  v1/pods:
    sortColumn: NAME:desc
    columns:
      - NAMESPACE
      - NAME
      - STATUS
      - READY
      - RESTARTS
      - CPU
      - MEM
      - IP
      - NODE
      - AGE

E: add version info

mythbai · 2024-03-05T22:21:41Z

Just upgraded from 0.2x to 0.32.1 today, and everything is extremely slow. I work for a Telco company remotely through VPN. Installed K9s in windows 10 Enterprise WSL. There is no relavant log find in k9s.* log files.

derailed · 2024-03-06T03:45:10Z

Let see if we're happier on v0.32.2??

tasszz2k · 2024-03-06T03:57:00Z

I have gotten the same issue

Ubiquitine · 2024-03-06T05:41:25Z

I can confirm that the the issue seems to be fixed in v0.32.2. At least in my case.

Mr-DeWitt · 2024-04-26T12:54:28Z

Same, using version 0.32.4

derailed added bug Something isn't working question Further information is requested labels Mar 4, 2024

derailed added a commit that referenced this issue Mar 5, 2024

[Bug] Fix #2582

61ee824

derailed mentioned this issue Mar 5, 2024

K9s release v0.32.2 #2598

Merged

derailed closed this as completed in ecd33ff Mar 6, 2024

derailed added a commit that referenced this issue Mar 6, 2024

[Bug] Fix #2582

36c1270

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Slowness due to client-side throttling in v0.32.0 #2582

Slowness due to client-side throttling in v0.32.0 #2582

Ubiquitine commented Mar 4, 2024 •

edited

Loading

derailed commented Mar 4, 2024 •

edited

Loading

Ubiquitine commented Mar 5, 2024 •

edited

Loading

seanmuth commented Mar 5, 2024 •

edited

Loading

mythbai commented Mar 5, 2024

derailed commented Mar 6, 2024

tasszz2k commented Mar 6, 2024

Ubiquitine commented Mar 6, 2024 •

edited

Loading

Mr-DeWitt commented Apr 26, 2024

Slowness due to client-side throttling in v0.32.0 #2582

Slowness due to client-side throttling in v0.32.0 #2582

Comments

Ubiquitine commented Mar 4, 2024 • edited Loading

derailed commented Mar 4, 2024 • edited Loading

Ubiquitine commented Mar 5, 2024 • edited Loading

seanmuth commented Mar 5, 2024 • edited Loading

mythbai commented Mar 5, 2024

derailed commented Mar 6, 2024

tasszz2k commented Mar 6, 2024

Ubiquitine commented Mar 6, 2024 • edited Loading

Mr-DeWitt commented Apr 26, 2024

Ubiquitine commented Mar 4, 2024 •

edited

Loading

derailed commented Mar 4, 2024 •

edited

Loading

Ubiquitine commented Mar 5, 2024 •

edited

Loading

seanmuth commented Mar 5, 2024 •

edited

Loading

Ubiquitine commented Mar 6, 2024 •

edited

Loading