kubernetes · k8s-ci-robot · Aug 6, 2025 · Jul 29, 2025 · Jul 29, 2025 · Aug 4, 2025
diff --git a/content/en/docs/concepts/cluster-administration/system-metrics.md b/content/en/docs/concepts/cluster-administration/system-metrics.md
@@ -179,11 +179,11 @@ flag to expose these alpha stability metrics.
 
 ### kubelet Pressure Stall Information (PSI) metrics
 
-{{< feature-state for_k8s_version="v1.33" state="alpha" >}}
+{{< feature-state for_k8s_version="v1.34" state="beta" >}}
 
-As an alpha feature, Kubernetes lets you configure kubelet to collect Linux kernel
+As a beta feature, Kubernetes lets you configure kubelet to collect Linux kernel
 [Pressure Stall Information](https://docs.kernel.org/accounting/psi.html)
-(PSI) for CPU, memory and IO usage.
+(PSI) for CPU, memory and I/O usage.
 The information is collected at node, pod and container level.
 The metrics are exposed at the `/metrics/cadvisor` endpoint with the following names:
 
@@ -196,10 +196,11 @@ container_pressure_io_stalled_seconds_total
 container_pressure_io_waiting_seconds_total
 ```
 
-You must enable the `KubeletPSI` [feature gate](/docs/reference/command-line-tools-reference/feature-gates/)
-to use this feature. The information is also exposed in the
+This feature is enabled by default, by setting the `KubeletPSI` [feature gate](/docs/reference/command-line-tools-reference/feature-gates/). The information is also exposed in the
 [Summary API](/docs/reference/instrumentation/node-metrics#psi).
 
+You can learn how to interpret the PSI metrics in [Understand PSI Metrics](/docs/reference/instrumentation/understand-psi-metrics/).
+
 #### Requirements
 
 Pressure Stall Information requires:

diff --git a/content/en/docs/reference/command-line-tools-reference/feature-gates/KubeletPSI.md b/content/en/docs/reference/command-line-tools-reference/feature-gates/KubeletPSI.md
@@ -9,5 +9,9 @@ stages:
   - stage: alpha
     defaultValue: false
     fromVersion: "1.33"
+    toVersion: "1.33"
+  - stage: beta 
+    defaultValue: true
+    fromVersion: "1.34"
 ---
 Enable kubelet to surface Pressure Stall Information (PSI) metrics in the Summary API and Prometheus metrics.
diff --git a/content/en/docs/reference/instrumentation/node-metrics.md b/content/en/docs/reference/instrumentation/node-metrics.md
@@ -45,16 +45,17 @@ the kubelet [fetches Pod- and container-level metric data using CRI](/docs/refer
 
 ## Pressure Stall Information (PSI) {#psi}
 
-{{< feature-state for_k8s_version="v1.33" state="alpha" >}}
+{{< feature-state for_k8s_version="v1.34" state="beta" >}}
 
-As an alpha feature, Kubernetes lets you configure kubelet to collect Linux kernel
+As a beta feature, Kubernetes lets you configure kubelet to collect Linux kernel
 [Pressure Stall Information](https://docs.kernel.org/accounting/psi.html)
-(PSI) for CPU, memory and IO usage. The information is collected at node, pod and container level.
+(PSI) for CPU, memory, and I/O usage. The information is collected at node, pod and container level.
 See [Summary API](/docs/reference/config-api/kubelet-stats.v1alpha1/) for detailed schema.
-You must enable the `KubeletPSI` [feature gate](/docs/reference/command-line-tools-reference/feature-gates/)
-to use this feature. The information is also exposed in
+This feature is enabled by default, by setting the `KubeletPSI` [feature gate](/docs/reference/command-line-tools-reference/feature-gates/). The information is also exposed in
 [Prometheus metrics](/docs/concepts/cluster-administration/system-metrics#psi-metrics).
 
+You can learn how to interpret the PSI metrics in [Understand PSI Metrics](/docs/reference/instrumentation/understand-psi-metrics/).
+
 ### Requirements
 
 Pressure Stall Information requires:

diff --git a/content/en/docs/reference/instrumentation/understand-psi-metrics.md b/content/en/docs/reference/instrumentation/understand-psi-metrics.md
@@ -0,0 +1,203 @@
+---
+title: Understand Pressure Stall Information (PSI) Metrics
+content_type: reference
+weight: 50
+description: >-
+  Detailed explanation of Pressure Stall Information (PSI) metrics and how to use them to identify resource pressure in Kubernetes.
+---
+
+<!-- overview -->
+
+{{< feature-state for_k8s_version="v1.34" state="beta" >}}
+
+As a beta feature, Kubernetes lets you configure the kubelet to collect Linux kernel
+[Pressure Stall Information](https://docs.kernel.org/accounting/psi.html)
+(PSI) for CPU, memory, and I/O usage. The information is collected at node, pod and container level.
+This feature is enabled by default by setting the `KubeletPSI` [feature gate](/docs/reference/command-line-tools-reference/feature-gates/).
+
+PSI metrics are exposed through two different sources:
+- The kubelet's [Summary API](/docs/reference/config-api/kubelet-stats.v1alpha1/), which provides PSI data at the node, pod, and container level.
+- The `/metrics/cadvisor` endpoint on the kubelet, which exposes PSI metrics in the [Prometheus format](/docs/concepts/cluster-administration/system-metrics#psi-metrics).
+
+### Requirements
+
+Pressure Stall Information requires:
+
+- [Linux kernel versions 4.20 or later](/docs/reference/node/kernel-version-requirements#requirements-psi).
+- [cgroup v2](/docs/concepts/architecture/cgroups)
+
+<!-- body -->
+
+## Understanding PSI Metrics
+
+Pressure Stall Information (PSI) metrics are provided for three resources: CPU, memory, and I/O. They are categorized into two main types of pressure: `some` and `full`.
+
+*   **`some`**: This value indicates that some tasks (one or more) are stalled on a resource. For example, if some tasks are waiting for I/O, this metric will increase. This can be an early indicator of resource contention.
+*   **`full`**: This value indicates that *all* non-idle tasks are stalled on a resource simultaneously. This signifies a more severe resource shortage, where the entire system is unable to make progress.
+
+Each pressure type provides four metrics: `avg10`, `avg60`, `avg300`, and `total`. The `avg` values represent the percentage of wall-clock time that tasks were stalled over 10-second, 60-second, and 3-minute moving averages. The `total` value is a cumulative counter in microseconds showing the total time tasks have been stalled.
+
+## Example Scenarios
+
+You can use a simple Pod with a stress-testing tool to generate resource pressure and observe the PSI metrics. The following examples use the `agnhost` container image, which includes the `stress` tool.
+
+### Generating CPU Pressure
+
+Create a Pod that generates CPU pressure using the `stress` utility. This workload will put a heavy load on one CPU core.
+
+Create a file named `cpu-pressure-pod.yaml`:
+```yaml
+apiVersion: v1
+kind: Pod
+metadata:
+  name: cpu-pressure-pod
+spec:
+  restartPolicy: Never
+  containers:
+  - name: cpu-stress
+    image: registry.k8s.io/e2e-test-images/agnhost:2.47
+    args:
+    - "stress"
+    - "--cpus"
+    - "1"
+```
+
+Apply it to your cluster: `kubectl apply -f cpu-pressure-pod.yaml`
+
+#### Observing CPU Pressure
+
+After the Pod is running, you can observe the CPU pressure through either the Summary API or the Prometheus metrics endpoint.
+
+**Using the Summary API:**
+
+Watch the summary stats for your node. In a separate terminal, run:
+```shell
+# Replace <node-name> with the name of a node in your cluster
+kubectl get --raw "/api/v1/nodes/<node-name>/proxy/stats/summary" | jq '.pods[] | select(.podRef.name | contains("cpu-pressure-pod"))'
+```
+You will see the `some` PSI metrics for CPU increase in the summary API output. The `avg10` value for `some` pressure should rise above zero, indicating that tasks are spending time stalled on the CPU.
+
+**Using the Prometheus metrics endpoint:**
+
+Query the `/metrics/cadvisor` endpoint to see the `container_pressure_cpu_waiting_seconds_total` metric.
+```shell
+# Replace <node-name> with the name of the node where the pod is running
+kubectl get --raw "/api/v1/nodes/<node-name>/proxy/metrics/cadvisor" | \
+    grep 'container_pressure_cpu_waiting_seconds_total{container="cpu-stress",pod="cpu-pressure-pod"}'
+```
+The output should show an increasing value, indicating that the container is spending time stalled waiting for CPU resources.
+
+#### Cleanup
+
+Clean up the Pod when you are finished:
+```shell
+kubectl delete pod cpu-pressure-pod
+```
+
+### Generating Memory Pressure
+
+This example creates a Pod that continuously writes to files in the container's writable layer, causing the kernel's page cache to grow and forcing memory reclamation, which generates pressure.
+
+Create a file named `memory-pressure-pod.yaml`:
+```yaml
+apiVersion: v1
+kind: Pod
+metadata:
+  name: memory-pressure-pod
+spec:
+  restartPolicy: Never
+  containers:
+  - name: memory-stress
+    image: registry.k8s.io/e2e-test-images/agnhost:2.47
+    command: ["/bin/sh", "-c"]
+    args:
+    - "i=0; while true; do dd if=/dev/zero of=testfile.$i bs=1M count=50 &>/dev/null; i=$(((i+1)%5)); sleep 0.1; done"
+    resources:
+      limits:
+        memory: "200M"
+      requests:
+        memory: "200M"
+```
+
+Apply it to your cluster: `kubectl apply -f memory-pressure-pod.yaml`
+
+#### Observing Memory Pressure
+
+**Using the Summary API:**
+
+In the summary output, you will observe an increase in the `full` PSI metrics for memory, indicating that the system is under significant memory pressure.
+```shell
+# Replace <node-name> with the name of a node in your cluster
+kubectl get --raw "/api/v1/nodes/<node-name>/proxy/stats/summary" | jq '.pods[] | select(.podRef.name | contains("memory-pressure-pod"))'
+```
+
+**Using the Prometheus metrics endpoint:**
+
+Query the `/metrics/cadvisor` endpoint to see the `container_pressure_memory_waiting_seconds_total` metric.
+```shell
+# Replace <node-name> with the name of the node where the pod is running
+kubectl get --raw "/api/v1/nodes/<node-name>/proxy/metrics/cadvisor" | \
+    grep 'container_pressure_memory_waiting_seconds_total{container="memory-stress",pod="memory-pressure-pod"}'
+```
+In the output, you will observe an increasing value for the metric, indicating that the system is under significant memory pressure.
+
+#### Cleanup
+
+Clean up the Pod when you are finished:
+```shell
+kubectl delete pod memory-pressure-pod
+```
+
+### Generating I/O Pressure
+
+This Pod generates I/O pressure by repeatedly writing a file to disk and using `sync` to flush the data from memory, which creates I/O stalls.
+
+Create a file named `io-pressure-pod.yaml`:
+```yaml
+apiVersion: v1
+kind: Pod
+metadata:
+  name: io-pressure-pod
+spec:
+  restartPolicy: Never
+  containers:
+  - name: io-stress
+    image: registry.k8s.io/e2e-test-images/agnhost:2.47
+    command: ["/bin/sh", "-c"]
+    args:
+      - "while true; do dd if=/dev/zero of=testfile bs=1M count=128 &>/dev/null; sync; rm testfile &>/dev/null; done"
+```
+
+Apply this to your cluster: `kubectl apply -f io-pressure-pod.yaml`
+
+#### Observing I/O Pressure
+
+**Using the Summary API:**
+
+You will see the `some` PSI metrics for I/O increase as the Pod continuously writes to disk.
+```shell
+# Replace <node-name> with the name of a node in your cluster
+kubectl get --raw "/api/v1/nodes/<node-name>/proxy/stats/summary" | jq '.pods[] | select(.podRef.name | contains("io-pressure-pod"))'
+```
+
+**Using the Prometheus metrics endpoint:**
+
+Query the `/metrics/cadvisor` endpoint to see the `container_pressure_io_waiting_seconds_total` metric.
+```shell
+# Replace <node-name> with the name of the node where the pod is running
+kubectl get --raw "/api/v1/nodes/<node-name>/proxy/metrics/cadvisor" | \
+    grep 'container_pressure_io_waiting_seconds_total{container="io-stress",pod="io-pressure-pod"}'
+```
+You will see the metric's value increase as the Pod continuously writes to disk.
+
+#### Cleanup
+
+Clean up the Pod when you are finished:
+```shell
+kubectl delete pod io-pressure-pod
+```
+
+## {{% heading "whatsnext" %}}
+
+The task pages for [Troubleshooting Clusters](/docs/tasks/debug/debug-cluster/) discuss
+how to use a metrics pipeline that rely on these data.
diff --git a/content/en/docs/reference/node/_index.md b/content/en/docs/reference/node/_index.md
@@ -25,3 +25,5 @@ Kubernetes documentation, including:
 * [Node Metrics Data](/docs/reference/instrumentation/node-metrics).
 
 * [CRI Pod & Container Metrics](/docs/reference/instrumentation/cri-pod-container-metrics).
+
+* [Understand Pressure Stall Information (PSI) Metrics](/docs/reference/instrumentation/understand-psi-metrics).
Original file line number	Diff line number	Diff line change
Expand Up		@@ -25,3 +25,5 @@ Kubernetes documentation, including:
		* [Node Metrics Data](/docs/reference/instrumentation/node-metrics).

		* [CRI Pod & Container Metrics](/docs/reference/instrumentation/cri-pod-container-metrics).

		* [Understand Pressure Stall Information (PSI) Metrics](/docs/reference/instrumentation/understand-psi-metrics).