You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Make queued time metric reset when a job gets requeued (#3875)
* Make queued time metric reset when a job gets requeued
Currently the queue duration metric always uses the jobs created time to determine how long it has been in the queue
When jobs gets requeued, the time in state should start from when it was requeued rather than when the job was created
- As it has only been in the queue since its last run terminated
This was producing large "jumps" in queued time as jobs were tried for 15 minutes, requeued and immediately reporting 15 minutes in the queue - despite only being requeued a few seconds ago
It is reasonable we may want to know when these retries are occurring / time they are costing, but I think they should be separate metrics, i.e:
- Retry count
- Total time queued? Total time attempting to start? Something along this line
Signed-off-by: JamesMurkin <[email protected]>
* Include actual logic change
Signed-off-by: JamesMurkin <[email protected]>
---------
Signed-off-by: JamesMurkin <[email protected]>
0 commit comments