-
Notifications
You must be signed in to change notification settings - Fork 46
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Datasets not working from one of our servers: getting timeouts connecting to api.ncbi.nlm.nih.gov #449
Comments
Hi @corneliusroemer, Are you still seeing the same error? -Eric |
Hi @ericcox1, (I am part of the same team as Cornelius) We are still seeing the same issue which we now have a better repro for:
we get
which is fairly self-explanatory that you think we are hitting you with too many requests. We are actively working to mirror our own copy of the datasets we use to prevent this. We believe we shouldn't have issued many requests from this IP in recent weeks, so if it is possible to clear the ban we'd be grateful. |
Hi @theosanderson, Thanks for this information. There isn't a block at the datasets level but we're going to reach out to another team to get this ban lifted. -Eric |
Thanks @ericcox1 and @theosanderson. I can add that the blocking doesn't just affect Loculus servers, it also occurs to other teams/individuals we work with, some also on Hetzner servers, some working from totally different University subnets. |
Hi @corneliusroemer and @theosanderson We're in the process of unblocking some IPs. However, it's important to note that recent attempts to download the entire NCBI database have significantly strained our systems. These actions have not only slowed down our services but have also impacted access for other users. Thank you for your understanding and cooperation. Nuala Nuala O'Leary, Ph.D. |
Thank you @olearyna , For our part we have never made any attempts to download the entire database. We have made repeated requests for a limited set of organisms and have now worked to de-duplicate these to minimize impact on NCBI. All the best, Theo |
Hi @corneliusroemer and @theosanderson, You should now be unblocked. Thanks for your patience. If you're still having any trouble, could you share the URL and your IP address? We'd be happy to help troubleshoot. Nuala |
Thank you @olearyna, but unfortunately we are still seeing this. Our IP is Our command to reproduce is which returns:
|
Ok, thank you. We're discussing we're discussing with our systems team. We'll keep you posted. Nuala |
Hi @corneliusroemer and @theosanderson, You should be unblocked now. Can you try again and let us know if you're still having problems? Thanks |
Apologies that we were so slow to respond. I have just checked and we still appear to be blocked. This is less of a pressing issue for us as we now have workarounds to mirror the relevant organisms for us - but just to let you know. |
As mentioned in #448, we at Loculus (https://github.com/loculus-project/loculus/actions) have consistently been getting errors using datasets since about 2024-02-04 - but only on the Hetzner servers we run our previews on. Note: We can't reproduce locally. Our servers make a fair number of requests, so it's possible we've been rate limited or blocked - however without getting a proper 429 http status code.
We noticed that from this server, requests to
api.ncbi.nlm.nih.gov
time out:I'm just opening this issue for general awareness, and to help other datasets users who might be getting the same error.
I'm aware of the banner on NCBI websites and will email the help desk:
For debugging purposes, our requests should be coming from
65.108.0.35
.The text was updated successfully, but these errors were encountered: