You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
What is the underlying problem you're trying to solve?
We have logs that can help identify context deadline exceeded and context cancel events. But from monitoring and alerting perspective there is no metric today (referring to this list).
Describe the ideal solution
I think it would be good to have metrics dedicated to context cancels, along with a reason tag maybe (check request timed out, http send timed out, context cancelled during X eval, etc)
Describe a "Good Enough" solution
We could skip having a reason tag in the short term, but a basic metric would definitely be helpful
Additional Context
N/A
The text was updated successfully, but these errors were encountered:
The opa-envoy plugin has the option to include performance metrics via prometheus. We could add a counter in there for this. These metrics are then surfaced via the Status API.
I see a TODO here, is this why the error is not propagated to envoy plugin where ideally the complete decision log (including input) should be logged? Any pointers so I can understand this issue better would be helpful. Thanks!
The code you're referred to is old. OPA uses the main branch not master. That code has been removed and we maintain metrics for errors in the decision log plugin. For the specific error in your log, there is currently no counter to track it. So we can add one or like I mentioned previously you can add a counter in the plugin itself.
What is the underlying problem you're trying to solve?
We have logs that can help identify context deadline exceeded and context cancel events. But from monitoring and alerting perspective there is no metric today (referring to this list).
Describe the ideal solution
I think it would be good to have metrics dedicated to context cancels, along with a reason tag maybe (check request timed out, http send timed out, context cancelled during X eval, etc)
Describe a "Good Enough" solution
We could skip having a reason tag in the short term, but a basic metric would definitely be helpful
Additional Context
N/A
The text was updated successfully, but these errors were encountered: