-
-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
429 (TooManyRequests) Responses from ClearML #492
Comments
We could try spitting out the header as well: |
Oh, good point! |
It looks like we are making extra calls to |
It's happened again since you made that change. Here's the log:
So yes, no
That's true - and an easy fix. You think that's sufficient to give us these errors though? |
We should remove the extra calls and increase the polling timeout by a bit. Hopefully that will be enough to fix this issue. |
I wonder - did you see the messages from IDX on the ClearML channel?- it could be that they are the ones causing the issue. It may be that our request comes in right when they are in the middle of DoS'ing the server and we get the "TooManyRequests" error. It shouldn't happen because we are using different user creds than IDX, but it could still be the case. Either way, we should elegantly handle it. |
I don't think I'm on that channel. Could you add me or point me there? Also, @johnml1135, unless you're already working on this one actively, I think I'll go ahead and do this while I'm waiting for review. We've been getting a lot of these errors. |
Go for it. |
Waiting to see if once PR #504 is on QA, the errors stop appearing. |
In recent weeks, we've seen exceptions thrown from the
ClearMLMonitorService
regarding a failure to parse responses from ClearML. John added improved logging and the issue has reoccurred, so I peeked at the logs and the unexpected responses are 429s. Do we need to poll more infrequently and/or change our http client retry strategy? The response we're getting from ClearML does not seem to specify aRetry-After
unfortunately. I haven't yet found any information regarding the rate-limiting on ClearML - maybe it's worth reaching out to them (?).The text was updated successfully, but these errors were encountered: