Scheduled_duration metric is rarely sent, and with too many tags #45285
Labels
area:core
area:metrics
kind:bug
This is a clearly a bug
needs-triage
label for new issues that we didn't triage yet
Apache Airflow version
2.10.4
If "Other Airflow 2 version" selected, which one?
No response
What happened?
Hello!
We are running Airflow 2.10.4 on EKS version 1.30.6.
I set up statsd metric collection as described here.
Using datadog agent running as a daemonset on each node to collect the metrics.
I am able to receive and search all metrics successfully in datadog, but task.scheduled_duration is very rarely sent, and when it's sent the numbers and tagging don't make sense to me.
For example, over the last week I only have a single datapoint at 3.26k, and it seems like it's tagged with several task_ids and dag_ids.
What you think should happen instead?
I expect to receive the metric every time a task is scheduled, and tagged correctly with only the relevant task_id and dag_id.
How to reproduce
Deployment details listed below, please let me know if there are any other missing information that might be relevant.
Operating System
Debian GNU/Linux 12 (bookworm)
Versions of Apache Airflow Providers
Deployment
Official Apache Airflow Helm Chart
Deployment details
Airflow 2.10.4 on EKS 1.30, using official helm chart version 1.13.1.
Datadog version -
datadog/agent:7.56.2
Airflow metrics configuration:
Anything else?
Thank you, and please let me know if there are any additional details I can provide to help triage or reproduce this issue.
Are you willing to submit PR?
Code of Conduct
The text was updated successfully, but these errors were encountered: