node_namespace_pod_container:container_cpu_usage_seconds_total:sum_rate #1056

xiaozhangzhang1 · 2021-03-24T09:01:55Z

record: node_namespace_pod_container:container_cpu_usage_seconds_total:sum_rate
expr: sum by(cluster, namespace, pod, container) (rate(container_cpu_usage_seconds_total{container!="POD",image!="",job="kubelet",metrics_path="/metrics/cadvisor"}[5m])) * on(cluster, namespace, pod) group_left(node) topk by(cluster, namespace, pod) (1, max by(cluster, namespace, pod, node) (kube_pod_info{node!=""}))

this record rule is not work in promethues ,if i change on(cluster, namespace, pod) is on( namespace, pod),it works

Prometheus Operator version:
release-0.6
Kubernetes version information:

kubectl version
Client Version: version.Info{Major:"1", Minor:"18", GitVersion:"v1.18.14", GitCommit:"89182bdd065fbcaffefec691908a739d161efc03", GitTreeState:"clean", BuildDate:"2020-12-18T12:11:25Z", GoVersion:"go1.13.15", Compiler:"gc", Platform:"linux/amd64"}
Server Version: version.Info{Major:"1", Minor:"18", GitVersion:"v1.18.14", GitCommit:"89182bdd065fbcaffefec691908a739d161efc03", GitTreeState:"clean", BuildDate:"2020-12-18T12:02:35Z", GoVersion:"go1.13.15", Compiler:"gc", Platform:"linux/amd64"}

The text was updated successfully, but these errors were encountered:

paulfantom · 2021-03-24T09:14:48Z

It looks like you have a cluster label in one metric but not in the other. Does container_cpu_usage_seconds_total{container!="POD",image!="",job="kubelet",metrics_path="/metrics/cadvisor", cluster!=""} and kube_pod_info{node!="",cluster!=""} return any output?

xiaozhangzhang1 · 2021-03-24T09:24:01Z

It looks like you have a cluster label in one metric but not in the other. Does container_cpu_usage_seconds_total{container!="POD",image!="",job="kubelet",metrics_path="/metrics/cadvisor", cluster!=""} and kube_pod_info{node!="",cluster!=""} return any output?

i did, return no data

xiaozhangzhang1 · 2021-03-24T09:30:49Z

It looks like you have a cluster label in one metric but not in the other. Does container_cpu_usage_seconds_total{container!="POD",image!="",job="kubelet",metrics_path="/metrics/cadvisor", cluster!=""} and kube_pod_info{node!="",cluster!=""} return any output?

kube_pod_info{node!="",cluster!=""} return many
kube_pod_info{cluster="",container="kube-rbac-proxy-main",created_by_kind="",created_by_name="",host_ip="",instance="",job="kube-state-metrics",namespace="default",node="master01",pod="netshoot",pod_ip="",uid="ef6d61ac-fed4-4ee3-b757-de912a6863fb"}

xiaozhangzhang1 · 2021-03-24T09:31:16Z

container_cpu_usage_seconds_total{container!="POD",image!="",job="kubelet",metrics_path="/metrics/cadvisor", cluster!=""}
return no data

paulfantom · 2021-03-24T09:43:23Z

It seems that your cluster is not configured correctly and you have cluster label attached to metrics from kube-state-metrics, but not to metrics from kubelet. You need to have it in both places.

xiaozhangzhang1 · 2021-03-24T09:55:32Z

It seems that your cluster is not configured correctly and you have cluster label attached to metrics from kube-state-metrics, but not to metrics from kubelet. You need to have it in both places.

thanks ,i got it ,yes ,i did cluster label to kube-state-metrics, i did the same to kubelet, but it not works ,

ArchiFleKs · 2021-07-20T17:37:18Z

I have the same issue where CPU usage is not working anymore on grafana

rouja · 2021-10-05T17:32:36Z

Hi,

I'm not sure but I think this issue was fixed in :

78a4677

It seems that the record node_namespace_pod_container:container_cpu_usage_seconds_total:sum_rate was replaced by node_namespace_pod_container:container_cpu_usage_seconds_total:sum_irate

SonalJain1707 · 2022-11-16T09:25:57Z

I am also facing same issue

sum(node_namespace_pod_container:container_cpu_usage_seconds_total:sum_irate{cluster="$cluster", namespace="$namespace"}) / sum(kube_pod_container_resource_requests{job="kube-state-metrics", cluster="$cluster", namespace="$namespace", resource="cpu"})

Does not return data

rmn-lux · 2022-12-12T13:15:21Z

+1 the same thing

I am also facing same issue

sum(node_namespace_pod_container:container_cpu_usage_seconds_total:sum_irate{cluster="$cluster", namespace="$namespace"}) / sum(kube_pod_container_resource_requests{job="kube-state-metrics", cluster="$cluster", namespace="$namespace", resource="cpu"})

Does not return data

jeremydescamps · 2022-12-22T09:24:33Z

When querying node_namespace_pod_container:container_cpu_usage_seconds_total:sum_irate on my prometheus instance, that does not return anything.
From what exporter this metric come from ?

fguiet · 2022-12-22T09:49:01Z

Hi there,

Prometheus stack chart : kube-prometheus-stack-43.1.1, App Version: 0.61.1
K8s deployed with Rancher Docker version 2.7 : 1.24.4

To me, it was related to this issue : k3s-io/k3s#5782
As mentioned in the issue, image label is now missing.

Workaround : I removed the image!="" label in all the rules from prometheus-stack-kube-prom-k8s.rules.yaml file and now my grafana dashboard work like a charm

# Extract from file : prometheus-stack-kube-prom-k8s.rules.yaml
- name: k8s.rules
      rules:
        - expr: >-
            sum by (cluster, namespace, pod, container) (
              irate(container_cpu_usage_seconds_total{job="kubelet", metrics_path="/metrics/cadvisor", image!=""}[5m])
            ) * on (cluster, namespace, pod) group_left(node) topk by (cluster,
            namespace, pod) (
              1, max by(cluster, namespace, pod, node) (kube_pod_info{node!=""})
            )
          record: >-
            node_namespace_pod_container:container_cpu_usage_seconds_total:sum_irate

This file has to be modified as well : prometheus-stack-kube-prom-k8s-resources-workload.yaml
Remove : container!="" and image!=""
Don't forget to kill pod : prometheus-stack-grafana so dashboards get updated !

github-actions · 2023-02-21T03:40:38Z

This issue has been automatically marked as stale because it has not had any activity in the last 60 days. Thank you for your contributions.

bmgante · 2023-03-17T18:30:09Z

Hi @fguiet
Got stuck in this problem as well, i am using latest helm chart version.
I am using minikube v1.28.0.

I've already removed the label image!="" from k8s.rules and cpu dashboards started working.

However, i still have issues for memory dashboard which basically use metric container_memory_working_set_bytes.

Example:
sum(container_memory_working_set_bytes{job="kubelet", metrics_path="/metrics/cadvisor", cluster="$cluster", namespace="$namespace", pod="$pod", container!="", image!=""}) by (container)

From prometheus, there is no cluster, container or image labels for this metric. Did you face this issue as well and if yes, how did you fix it?

Similar issue with dashboards using fs metrics (and probably a lot of other metrics from cadvisor):
sum by(container) (rate(container_fs_reads_total{job="kubelet", metrics_path="/metrics/cadvisor", device=~"(/dev/)?(mmcblk.p.+|nvme.+|rbd.+|sd.+|vd.+|xvd.+|dm-.+|md.+|dasd.+)", container!="", cluster="$cluster", namespace="$namespace", pod="$pod"}[$__rate_interval]))

Thanks

sirajkrm · 2023-03-27T13:57:54Z

FWIW this has been addressed in later versions of Rancher Server 2.6.11 along upgrading k8s to 1.24.10-rancher4-1

anthosz · 2023-05-25T07:29:00Z

When querying node_namespace_pod_container:container_cpu_usage_seconds_total:sum_irate on my prometheus instance, that does not return anything. From what exporter this metric come from ?

Same for me, did you found it finally?

Using the last helm chart of prometheus stack

zuchka · 2023-07-12T22:54:05Z

I'm hitting this as well. any ideas?

jpiazza35 · 2023-07-20T15:03:21Z

same issue here

DhruvPatel2647 · 2023-08-01T08:20:39Z

if you have included this in the values of prometheus :

before: kubelet:
serviceMonitor:
https: false

After (This works):
kubelet:
serviceMonitor:
https: true
because Kubelet is responsible for that metrics.

for me I have disabled http in service-monitor for kubelt then I research it and foud that kublelt hhtp shoulbe enabled that is http:true

gustavofbreunig · 2023-09-06T14:48:12Z

I'm facing this same issue, reported here.

Removed container!="" and image!="" from prometheus-stack-kube-prom-k8s.rules.yaml worked.

mohamadkhani · 2023-09-08T11:23:23Z

Thanks to @gustavofbreunig.

Removing all image!="" from charts/kube-prometheus-stack/templates/prometheus/rules-1.14/k8s.rules.yaml file, fixed my problem too.

This is my fork if any one wants to check it.

gustavofbreunig · 2023-09-08T14:28:56Z

Related issue: google/cadvisor#3336

gustavofbreunig · 2023-09-08T15:25:10Z

It was a rancher issue, corrected on v1.24.10-rancher4-1

rancher/rancher#38934

proceed to close the issue

nathanmcgarvey-modopayments · 2023-09-08T23:06:16Z

Related issue: google/cadvisor#3336

Also related details if you are on Docker-Desktop: docker/for-mac#6969

Edit: ....or potentially just using the docker driver for minikube or Docker Desktop or really anything that involves Docker.

github-actions · 2023-11-08T03:37:35Z

This issue has been automatically marked as stale because it has not had any activity in the last 60 days. Thank you for your contributions.

github-actions · 2024-03-08T03:37:27Z

This issue was closed because it has not had any activity in the last 120 days. Please reopen if you feel this is still valid.

xiaozhangzhang1 added the kind/bug label Mar 24, 2021

paulfantom added kind/support and removed kind/bug labels Mar 24, 2021

chrisho mentioned this issue Sep 9, 2022

KubeVirtComponentExceedsRequestedCPU/Memory are not working properly kubevirt/kubevirt#8439

Closed

konstantin-921 mentioned this issue Dec 23, 2022

[rancher-monitoring] CPU and memory metrics for pods do not work rancher/rancher#43475

Closed

jpicara mentioned this issue Jan 5, 2023

[kube-prometheus-stack] No $cluster variable is being taken from any ServiceMonitor prometheus-community/helm-charts#2887

Closed

github-actions bot added the stale label Feb 21, 2023

chrisho mentioned this issue Feb 28, 2023

[BUG] The alerts KubeVirtComponentExceedsRequestedMemory and KubeVirtComponentExceedsRequestedCPU not working harvester/harvester#3562

Closed

github-actions bot removed the stale label Mar 18, 2023

github-actions bot added the stale label Nov 8, 2023

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Mar 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

node_namespace_pod_container:container_cpu_usage_seconds_total:sum_rate #1056

node_namespace_pod_container:container_cpu_usage_seconds_total:sum_rate #1056

xiaozhangzhang1 commented Mar 24, 2021 •

edited by paulfantom

Loading

paulfantom commented Mar 24, 2021

xiaozhangzhang1 commented Mar 24, 2021

xiaozhangzhang1 commented Mar 24, 2021 •

edited

Loading

xiaozhangzhang1 commented Mar 24, 2021

paulfantom commented Mar 24, 2021

xiaozhangzhang1 commented Mar 24, 2021

ArchiFleKs commented Jul 20, 2021

rouja commented Oct 5, 2021 •

edited

Loading

SonalJain1707 commented Nov 16, 2022 •

edited

Loading

rmn-lux commented Dec 12, 2022

jeremydescamps commented Dec 22, 2022 •

edited

Loading

fguiet commented Dec 22, 2022 •

edited

Loading

github-actions bot commented Feb 21, 2023

bmgante commented Mar 17, 2023 •

edited

Loading

sirajkrm commented Mar 27, 2023

anthosz commented May 25, 2023

zuchka commented Jul 12, 2023

jpiazza35 commented Jul 20, 2023

DhruvPatel2647 commented Aug 1, 2023 •

edited

Loading

gustavofbreunig commented Sep 6, 2023

mohamadkhani commented Sep 8, 2023 •

edited

Loading

gustavofbreunig commented Sep 8, 2023

gustavofbreunig commented Sep 8, 2023

nathanmcgarvey-modopayments commented Sep 8, 2023 •

edited

Loading

github-actions bot commented Nov 8, 2023

github-actions bot commented Mar 8, 2024

node_namespace_pod_container:container_cpu_usage_seconds_total:sum_rate #1056

node_namespace_pod_container:container_cpu_usage_seconds_total:sum_rate #1056

Comments

xiaozhangzhang1 commented Mar 24, 2021 • edited by paulfantom Loading

paulfantom commented Mar 24, 2021

xiaozhangzhang1 commented Mar 24, 2021

xiaozhangzhang1 commented Mar 24, 2021 • edited Loading

xiaozhangzhang1 commented Mar 24, 2021

paulfantom commented Mar 24, 2021

xiaozhangzhang1 commented Mar 24, 2021

ArchiFleKs commented Jul 20, 2021

rouja commented Oct 5, 2021 • edited Loading

SonalJain1707 commented Nov 16, 2022 • edited Loading

rmn-lux commented Dec 12, 2022

jeremydescamps commented Dec 22, 2022 • edited Loading

fguiet commented Dec 22, 2022 • edited Loading

github-actions bot commented Feb 21, 2023

bmgante commented Mar 17, 2023 • edited Loading

sirajkrm commented Mar 27, 2023

anthosz commented May 25, 2023

zuchka commented Jul 12, 2023

jpiazza35 commented Jul 20, 2023

DhruvPatel2647 commented Aug 1, 2023 • edited Loading

gustavofbreunig commented Sep 6, 2023

mohamadkhani commented Sep 8, 2023 • edited Loading

gustavofbreunig commented Sep 8, 2023

gustavofbreunig commented Sep 8, 2023

nathanmcgarvey-modopayments commented Sep 8, 2023 • edited Loading

github-actions bot commented Nov 8, 2023

github-actions bot commented Mar 8, 2024

xiaozhangzhang1 commented Mar 24, 2021 •

edited by paulfantom

Loading

xiaozhangzhang1 commented Mar 24, 2021 •

edited

Loading

rouja commented Oct 5, 2021 •

edited

Loading

SonalJain1707 commented Nov 16, 2022 •

edited

Loading

jeremydescamps commented Dec 22, 2022 •

edited

Loading

fguiet commented Dec 22, 2022 •

edited

Loading

bmgante commented Mar 17, 2023 •

edited

Loading

DhruvPatel2647 commented Aug 1, 2023 •

edited

Loading

mohamadkhani commented Sep 8, 2023 •

edited

Loading

nathanmcgarvey-modopayments commented Sep 8, 2023 •

edited

Loading