Unable to see container's metrics about external volumes attached (k8s persistent volumes) #1702

eedugon · 2017-07-23T08:39:06Z

Hello,

We are having a discussion at "prometheus-operator" project, and based on @brancz suggestion, we are raising the topic here, as our problem seems related with cadvisor in some way.
(this is the original thread, just in case: #prometheus-operator/prometheus-operator#485)

In a kubernetes + prometheus-operator environment we are not able to get any kind of metric about persistent volumes that are attached to containers in pods, and we don't really understand why.

Keeping out of the topic that Kubernetes should probably provide information about them, because PVs are actually K8s resources, we are wondering how we can't find any information at cadvisor/kubelet level metrics.

For example, a container might show the following disks (by df -h directly in a bash of the container):

# df -h
overlay 154G 12G 136G 9% /
/dev/xvdbg       20G   46M   19G   1% /var/lib/zookeeper (THIS IS THE PERSISTENT VOLUME)
/dev/xvda1 154G 7.2G 140G 5% /etc/hosts
...
...

In that output, the "154G" device is the main disk of the container owner (k8s node), and we have no problem finding metrics about that disk at all levels.
But the other device (/dev/xvdbg) is actually the persistent volume that is mounted in the container, and there's no track of it at all in the metrics.

container_fs_usage_bytes information about that pod/container has only:
container_fs_usage_bytes{container_name="kafka",device="/dev/xvda1",.....}
Which represents actually the physical disk of the k8s node, owning the container.
But there's noting else about /dev/xvdbg, and that's what we are looking for...

Do you have any suggestion, idea or explanation for this behavior? Are we missing something at configuration level to make kubelet reporting this?

Note1: At node level, lsblk shows the disk of the PV:

admin@node$ lsblk 
NAME    MAJ:MIN   RM  SIZE RO TYPE MOUNTPOINT
xvda    202:0      0  164G  0 disk 
`-xvda1 202:1      0  164G  0 part /
xvdbg   202:14848  0   20G  0 disk /var/lib/kubelet/pods/ae6c1d55-6d7a-11e7-bc61-068b6da15381/volumes/kubernetes.io~aws-ebs

# and it's also mounted
/dev/xvdbg       20G   46M   19G   1% /var/lib/kubelet/plugins/kubernetes.io/aws-ebs/mounts/aws/eu-west-2a/vol-0bafa0be2b20de87a

In the node we have cadvisor/kubelet metrics and also node_exporter.... none of them are reporting what we are looking for. I suspect we are missing some kind of mapping or configuration somewhere, because the disk is there...

Thanks very much in advance, any help will be appreciated,
Eduardo
(+ @hartmut-pq)

The text was updated successfully, but these errors were encountered:

dashpole · 2017-07-24T15:03:57Z

The current scope of cAdvisor is restricted to monitoring containers. It provides the kubelet with container metrics only, and has no concept of volumes, pods, or any other higher-level Kubernetes API objects. The kubelet has a built-in cadvisor, which monitors containers in kubernetes. The kubelet currently takes the container metrics provided by cAdvisor, and combines them with volume metrics, and kubernetes-specific metadata (e.g. container->pod mappings) to produce the summary API, which is exposed by the kubelet at 10255/stats/summary.

The underlying issue you are raising here is that the kubelet does not expose this via prometheus.

hartmut-pq · 2017-07-24T15:11:35Z

Hi @dashpole so you're saying all relevant information is available / provided by cAdvisor already but not mapped/interpreted by the kubelet->summary implementation yet?

dashpole · 2017-07-24T15:15:29Z

No, cAdvisor does not provide volume metrics, and has no concept of volumes or pods, which is why there are no volume metrics exposed via prometheus.
The kubelet itself collects and exposes volume metrics via the summary API, but doesnt expose it in any other way (e.g. prometheus)

brancz · 2017-07-24T15:22:21Z

Interesting, thanks for the insight @dashpole. Seems like we should maybe work towards exposing the stats API as Prometheus metrics by the kubelet.

dashpole · 2017-07-24T15:34:22Z

Would just adding volume metrics to the kubelet's prometheus endpoint be sufficient?

eedugon · 2017-07-24T18:32:25Z

Hi @dashpole ,

Yes, the information we are looking for is available in :10255/stats/summary, and as you said, that's not translated to any kind of metric (that was actually the issue to me).

I don't want to ask for anything out of the scope for any component, that's the reason my question is more or less.... "who do you think should be responsible of translating that into a prometheus metric"?

Your proposal "just adding volume metrics to the kubelet's prometheus endpoint" is exactly what I was looking for, but i don't know if by kubelet or whom (controller-manager?).

I also fully agree that cadvisor should know nothing about k8s concepts like persistent volumes, but from container perspective, if a docker has a filesystem mounted, I expect prometheus metrics to be exported about that filesystem, which is actually very important to track.

Let me know your view and thanks in advance!
Edu

hartmut-pq · 2017-07-25T07:48:07Z

@brancz so would this be escalated with kubernetes/kubernetes then?
May this be closely related to the works/proposal for 1.8? (prometheus-operator/prometheus-operator#485 (comment))
Who's going to push it to the next round then? :-)

brancz · 2017-07-25T08:30:15Z

@gnufied told me there is something already in the works, maybe he can comment on where we should go next.

gnufied · 2017-07-27T13:44:32Z

@brancz yes I was talking about this proposal - kubernetes/community#855 cc @jingxu97

brancz · 2017-07-27T14:33:42Z

@gnufied I don't want to hijack that PR so I'll ask here. Do you think it's reasonable to also add all of those metrics as metrics exposed in the /metrics endpoint of the kubelet?

dashpole · 2017-07-27T15:13:48Z

I think it is reasonable. We should also have a discussion on the relationship between kubernetes and prometheus. Seems odd to have some metrics on the cadvisor port, and others on the kubelet port. I think ideally, the prometheus endpoint should mirror the information provided by the kubelet's http endpoints (e.g summary API).

dashpole · 2017-07-27T15:18:24Z

I guess my biggest concern/question is that I don't know how prometheus deals with metrics changing. What about a metric (name, format, labels) can change across a release without causing disruption? Prometheus doesnt look like it has versioning.

hartmut-pq · 2017-07-27T16:28:49Z

anyone pls correct my if I'm wrong - but @dashpole prometheus merely reads what's there - and it's entirely up to the prometheus consumers do deal with changed labels/data... The major usage basically are prometheus alert rules and grafana to visualise based on the metrics.

hartmut-pq · 2017-07-27T16:32:54Z

@dashpole the proposal you mentioned will simply expose the storage metrics - and seem to aim to map/provide PVCReference.
Most benefit in terms of monitoring - if you have a quick look at the original comment/issue here prometheus-operator/prometheus-operator#485 (comment) would be to be allowed to alert if PVs are running out of storage...

gnufied · 2017-07-27T16:35:49Z

@brancz I think that is doable. but @jingxu97 will have more information. BTW - which metric endpoint, there seems to be 2 of them with kubelet. :-)

hartmut-pq · 2017-07-27T16:37:09Z

or e.g. mirror existing metrics like
node_filesystem_avail
node_filesystem_files
node_filesystem_files_free
node_filesystem_free
node_filesystem_readonly
node_filesystem_size

eedugon · 2017-07-28T07:37:55Z

@gnufied : I think it should be :10255/metrics, what other endpoint do you see?

@dashpole : When you mention cadvisor port, what port do you mean? I thought in terms of metrics, cadvisor/kubelet where going to expose only one set of metrics. If there are 2 endpoints let me know which is the other one and I will check if the info we are looking for is already available there.

And as you mention, the "summary API" has the correct and complete set of information. So, I don't know who, but someone needs to put that information into a "metrics" format.

But when you talk about "prometheus" and "kubelet" relationship I don't completely get you.

the prometheus endpoint should mirror the information provided by the kubelet's http endpoints (e.g summary API)
There are no prometheus endpoints, as prometheus does not generate metrics. If with that you mean cadvisor or kubelet "metrics" endpoint, then I agree, but prometheus won't be able to mirror anything, as prometheus is only a metrics consumer.

Regarding metrics names and changes I agree with @hartmut-pq , there won't be any important disruption in prometheus if that happens, just that all components that read data from prometheus should be updated to start digging for the new names.

The information we are looking for from containers point of view is the same information you are already providing for other filesystems of the container (size, usage bytes, free bytes, ...). (container_fs_*)...

This is info from a container is taken from stats/summary endpoint:

    "volume": [
     {
      "time": "2017-07-28T07:32:22Z",
      "availableBytes": 97228500992,
      "capacityBytes": 105555197952,
      "usedBytes": 2941210624,
      "inodesFree": 6551983,
      "inodes": 6553600,
      "inodesUsed": 1617,
      "name": "datadir"
     }

And that volume (container_fs) is completely missing in the kubelet metrics endpoint (that I also thought it was cadvisor endpoint).

jingxu97 · 2017-07-28T16:11:55Z

@eedugon Thanks for your comment and feedback. I am currentlto y working on a feature to expose storage metrics to users which will address your issue.
My understanding is that when we say "prometheus endpoint" because kubelet registers metrics to prometheus which allows to expose the registered metrics via HTTP. Based on your feedback and others, I think we plan to register the volume metrics to prometheus so that user can use /metric endpoint to check the information. One problem is that it seems some users might want to use PVC as index and some want to use PV name

brancz · 2017-07-31T07:29:14Z

@jingxu97 that sounds great!

Regarding:

One problem is that it seems some users might want to use PVC as index and some want to use PV name

I think this is something we can discuss further in the proposal. Feel free to tag us once you have something ready. 🙂

eedugon · 2017-09-16T09:21:11Z

@jingxu97 : just for curiosity... any progress about this issue? Thanks in advance!

jingxu97 · 2017-10-07T00:08:01Z

@eedugon sorry that I missed your message. Currently you can use the following ways to get PVC metrcis

Run "kubectl proxy" first. Then " curl localhost:8001/api/v1/proxy/nodes/<nodename>:10250/metrics"
Ssh to the node, Run "curl -k http://localhost:10255/metrics"

Please let me know if you have problems or questions. Thanks~!

tiloso · 2017-11-13T14:49:38Z

@eedugon Kubernetes 1.8 exposes following volume related metrics for Prometheus which can be used to monitor PVC disk usage:

kubelet_volume_stats_available_bytes
kubelet_volume_stats_capacity_bytes
kubelet_volume_stats_inodes
kubelet_volume_stats_inodes_free
kubelet_volume_stats_inodes_used
kubelet_volume_stats_used_bytes

eedugon · 2017-11-14T12:36:25Z

Thanks a lot, @tiloso and @jingxu97 !

brancz · 2017-11-14T14:02:32Z

Would love to see a dashboard and or alerting rules with these! 🙂

f0 · 2017-11-14T15:29:33Z

@tiloso i run k8s 1.8.3 with vsphere volumes, i do not have these metrics....

tiloso · 2017-11-14T16:55:44Z

PVC volume related metrics have been introduced by this commit which seems to be part of Kubernetes >= 1.8.0.

Here's a very simple example of how we use it to get an idea of the disk usage:
kubectl's output (removed some columns)

kubectl get pvc --all-namespaces
NAMESPACE    NAME                    STATUS    CAPACITY   ACCESS MODES   STORAGECLASS
monitoring   data-prometheus-0       Bound     60Gi       RWO            gp2
streaming    data-nats-streaming-0   Bound     60Gi       RWO            gp2

Grafana query

kubelet_volume_stats_used_bytes / kubelet_volume_stats_capacity_bytes * 100

Grafana legend

{{ namespace }} | {{ persistentvolumeclaim }}

f0 · 2017-11-14T17:09:10Z

@tiloso i see, but i do not have metrics with the name kubelet_volume* (not in prometheus, not when curling the kubelet) but i do have pvc's.
Does your kubelet run in a Container? Maybe there is something i need to mount , so the kubelet see the volume...

brancz · 2017-11-15T10:56:22Z

@tiloso awesome! I can definitely see very nice predict_linear queries to do early alerts on volumes filling up! 🙂

piwi91 · 2017-12-19T15:45:04Z

I don't see these metrics too... @f0 did you found a solution?

dashpole · 2018-04-13T16:28:10Z

yes, these metrics were added in kubernetes/kubernetes#51553, which first became available in 1.8.0.

gnufied · 2018-04-13T17:09:29Z

@cyrus-mc that was fixed recently and we have backported it to 1.9 kubernetes/kubernetes#60013

If you are still on 1.9 - you should upgrade to next version with the fix.

juliohm1978 · 2018-05-03T20:45:39Z

Apologies for joining late on this thread... but I am confused by @dashpole's comment.

Regardless of how the kubelet collects and aggregates cAdvisor metrics, your explanation does not make it clear to me why the device in question (/dev/xvdbg) does not appear in cAdvisor metrics to begin with.

I'm running into a similar issue on our k8s clusters, where we need to gather metrics on volumes mounted by each container. Independently of how k8s concepts, I expected cAdvisor to spit metrics on every device it finds inside the container. I'm not sure the answer provides an explanation.

I'm glad to use kublet's kubelet_volume_stats_* metrics, but I'm running the latest k8s v1.10.2 and these metrics are still missing.

Anyone else still having issues on this? I can't find a way to get kubelet_volume_stats_* do show up.

dashpole · 2018-05-03T20:51:36Z

@juliohm1978 You need to query the kubelet's /metrics endpoint, rather than cadvisor's /metrics endpoint.

juliohm1978 · 2018-05-03T21:02:10Z

I could be wrong. Isn't this the kubelet endpoint?

root@infra03-lab:~# curl -s http://infra03-lab:10255/metrics | grep kubelet_volume | wc -l
0

infra03-lab being the node hosting a container with a mounted nfs volume.

gnufied · 2018-05-03T21:06:15Z

@juliohm1978 you are not seeing those metrics for nfs volumes because nfs volume plugin does not implement necessary metric interface. If you are up to it - see kubernetes/kubernetes#62644 github issue about how to fix it.

The reason some of the volume types don't implement metric interface is because - we haven't had a pressing need until now. We welcome any patches for fixing it though.

juliohm1978 · 2018-05-03T21:07:23Z

Excellent. Thank you!

chhetripradeep · 2018-05-31T04:03:01Z

Strange, that i am seeing stale metrics for the volumes which are no longer attached to the host. I am running 1.8.11. Anyone has seen this issue ? Is upgrading to 1.9.latest the only way to solve this.

aghassabian · 2018-06-18T13:11:50Z

@tiloso Actually I can fetch the kubelet_volume_* metrics, but I get multiple values for a single persistent volume on different nodes. I have a cluster on GKE and claim a PV for a pod in namespace X. when I query for kubelet_volume_stats_used_bytes, I get multiple records for same persistent volume in the same namespace with different values on different cluster nodes. Do you have any idea or have you ever faced this issue?

andrezaycev · 2018-10-30T17:02:47Z

Have the same situation. pv+pvc under nfs-deployment+gce disk
container_fs and kubelet_volume have not needed nfsstorage stats
Maybe anyone have new solution since Jun2018?

szediktam · 2019-02-13T14:10:32Z

@juliohm1978 you are not seeing those metrics for nfs volumes because nfs volume plugin does not implement necessary metric interface. If you are up to it - see kubernetes/kubernetes#62644 github issue about how to fix it.

The reason some of the volume types don't implement metric interface is because - we haven't had a pressing need until now. We welcome any patches for fixing it though.

host-path and nfs volume does not implement metric interface for now? @gnufied

agolomoodysaada · 2019-04-29T16:25:27Z

Is it possible to extend container_fs_io_time_seconds_total and similar metrics for PVCs? I am not able to access metrics for any of the mounted PVC disks.
I can only access them via node_disk_io_time_seconds_total from node-exporter but there's no way to automatically map those to the container mounting the disk without some seriously manual process. I'm happy to contribute a fix for this with some guidance.

ctyjrsy · 2019-07-02T07:40:53Z

I am running a k8s cluster on digital ocean, are do-block-storage volume type supported in prometheus monitoring via publish of kubelet_volume* metrics. I can not access these metrics via grafana (Prometheus Operator setup)

Running K8S 1.13+

marzottifabio · 2019-07-05T13:50:31Z

I don't see these metrics too... with last k8s version.....

dashpole · 2019-07-08T15:56:37Z

I'm closing this issue, as it is out of the scope of cAdvisor. If there is a specific kubernetes volume type that doesn't have metrics or bug with the existing kubelet volume metrics, that would probably be best tracked with an issue in k/k.

SmilingNavern · 2019-10-04T10:08:09Z

For anyone struggling to find answer like i did. If you use some cis plugin for persistent volume and this plugin doesn't expose metrics you won't find them. For example: kubernetes-sigs/aws-ebs-csi-driver#223

aasourav · 2024-08-01T05:31:48Z

@eedugon Kubernetes 1.8 exposes following volume related metrics for Prometheus which can be used to monitor PVC disk usage:
kubelet_volume_stats_available_bytes
kubelet_volume_stats_capacity_bytes
kubelet_volume_stats_inodes
kubelet_volume_stats_inodes_free
kubelet_volume_stats_inodes_used
kubelet_volume_stats_used_bytes

kubelet_volume_stats_available_bytes This metric provides an invalid value.

eedugon changed the title ~~Unable to see container's metrics about external volumes attached~~ Unable to see container's metrics about external volumes attached (k8s persistent volumes) Jul 23, 2017

zetaab mentioned this issue Apr 16, 2018

How to get kubelet_volume* metrics working? kubernetes/kubernetes#62644

Closed

HaveFun83 mentioned this issue Apr 23, 2018

kubelet_volume* / persistent volume metrics missing rook/rook#1659

Closed

ChrisCavegn mentioned this issue Jun 20, 2018

Expose kubelet_volume* metrics NetApp/trident#134

Closed

cirocosta mentioned this issue Dec 4, 2018

Create Internal helm-deployed Concourse Instance concourse/concourse#2876

Closed

3 tasks

agolomoodysaada mentioned this issue Apr 29, 2019

Metrics Request: FS metrics such as Disk IO for PVCs kubernetes/kubernetes#77215

Open

dashpole closed this as completed Jul 8, 2019

jacksontj mentioned this issue Nov 7, 2019

Feature Request: Report disk usage for volumes from this driver digitalocean/csi-digitalocean#203

Closed

JinLinGan mentioned this issue Jun 15, 2020

Does the k3s kubelet support reporting back CSI kubelet_volume metrics? k3s-io/k3s#1396

Closed

intelliguy mentioned this issue Jun 23, 2020

Prometheus not collecting kubelet_volume_* metrics rancher/rancher#27522

Closed

razvan-ig mentioned this issue Sep 28, 2020

Volume metrics? nutanix/csi-plugin#23

Closed

dev-samples mentioned this issue May 4, 2021

List available bytes for PVCs? prometheus-operator/kube-prometheus#1134

Closed

StianOvrevage mentioned this issue May 2, 2023

Expose disk usage metrics grafana/loki#9363

Open

ringerc mentioned this issue Sep 4, 2024

container_fs metrics {device} has no label or info-metric to associate it with a volumeattachment or persistentvolumeclaim #3588

Open

Unable to see container's metrics about external volumes attached (k8s persistent volumes) #1702

Unable to see container's metrics about external volumes attached (k8s persistent volumes) #1702

Comments

eedugon commented Jul 23, 2017 • edited Loading

dashpole commented Jul 24, 2017

hartmut-pq commented Jul 24, 2017

dashpole commented Jul 24, 2017

brancz commented Jul 24, 2017

dashpole commented Jul 24, 2017

eedugon commented Jul 24, 2017

hartmut-pq commented Jul 25, 2017

brancz commented Jul 25, 2017

gnufied commented Jul 27, 2017

brancz commented Jul 27, 2017

dashpole commented Jul 27, 2017

dashpole commented Jul 27, 2017 • edited Loading

hartmut-pq commented Jul 27, 2017

hartmut-pq commented Jul 27, 2017

gnufied commented Jul 27, 2017

hartmut-pq commented Jul 27, 2017

eedugon commented Jul 28, 2017

jingxu97 commented Jul 28, 2017

brancz commented Jul 31, 2017

eedugon commented Sep 16, 2017

jingxu97 commented Oct 7, 2017 • edited Loading

tiloso commented Nov 13, 2017

eedugon commented Nov 14, 2017

brancz commented Nov 14, 2017

f0 commented Nov 14, 2017

tiloso commented Nov 14, 2017 • edited Loading

f0 commented Nov 14, 2017 • edited Loading

brancz commented Nov 15, 2017

piwi91 commented Dec 19, 2017

dashpole commented Apr 13, 2018

gnufied commented Apr 13, 2018

juliohm1978 commented May 3, 2018 • edited Loading

dashpole commented May 3, 2018

juliohm1978 commented May 3, 2018

gnufied commented May 3, 2018

juliohm1978 commented May 3, 2018

chhetripradeep commented May 31, 2018

aghassabian commented Jun 18, 2018

andrezaycev commented Oct 30, 2018

szediktam commented Feb 13, 2019

agolomoodysaada commented Apr 29, 2019 • edited Loading

ctyjrsy commented Jul 2, 2019

marzottifabio commented Jul 5, 2019

dashpole commented Jul 8, 2019

SmilingNavern commented Oct 4, 2019

aasourav commented Aug 1, 2024

eedugon commented Jul 23, 2017 •

edited

Loading

dashpole commented Jul 27, 2017 •

edited

Loading

jingxu97 commented Oct 7, 2017 •

edited

Loading

tiloso commented Nov 14, 2017 •

edited

Loading

f0 commented Nov 14, 2017 •

edited

Loading

juliohm1978 commented May 3, 2018 •

edited

Loading

agolomoodysaada commented Apr 29, 2019 •

edited

Loading