-
-
Notifications
You must be signed in to change notification settings - Fork 1.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Timeout issue while Scraping many urls in Smart Scraper MultiGraph #865
Comments
Hi @Qunlexie, great idea, please can yo make the PR? |
PeriniM
added a commit
that referenced
this issue
Jan 6, 2025
Hey @Qunlexie, in the next release you can pass the graph_config = {
"llm": {
"api_key": os.getenv("OPENAI_API_KEY"),
"model": "openai/gpt-4o-mini",
},
"verbose": True,
"headless": False,
"loader_kwargs": {
"retry_limit": 3,
"timeout": 10,
},
} |
Thanks for fixing this faster than I could. Is this also present in the beta or no? |
it should be also in the beta |
github-actions bot
pushed a commit
that referenced
this issue
Jan 6, 2025
## [1.35.0](v1.34.2...v1.35.0) (2025-01-06) ### Features * ⏰added graph timeout and fixed model_tokens param ([#810](#810) [#856](#856) [#853](#853)) ([01a331a](01a331a)) * ⛏️ enhanced contribution and precommit added ([fcbfe78](fcbfe78)) * add codequality workflow ([4380afb](4380afb)) * add timeout and retry_limit in loader_kwargs ([#865](#865) [#831](#831)) ([21147c4](21147c4)) * serper api search ([1c0141f](1c0141f)) ### Bug Fixes * browserbase integration ([752a885](752a885)) * local html handling ([2a15581](2a15581)) ### CI * **release:** 1.34.2-beta.1 [skip ci] ([f383e72](f383e72)), closes [#861](#861) [#861](#861) * **release:** 1.34.2-beta.2 [skip ci] ([93fd9d2](93fd9d2)) * **release:** 1.34.3-beta.1 [skip ci] ([013a196](013a196)), closes [#861](#861) [#861](#861) * **release:** 1.35.0-beta.1 [skip ci] ([c5630ce](c5630ce)), closes [#865](#865) [#831](#831) * **release:** 1.35.0-beta.2 [skip ci] ([f21c586](f21c586)) * **release:** 1.35.0-beta.3 [skip ci] ([cb54d5b](cb54d5b)) * **release:** 1.35.0-beta.4 [skip ci] ([6e375f5](6e375f5)), closes [#810](#810) [#856](#856) [#853](#853)
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Describe the bug
There is a separate timeout issue when using Smart Scraper MuliGraph. It only times out when scraping many links at once usually for >=10. I was able to solve this by increasing the timeout and retry limits.
Scrapegraph-ai/scrapegraphai/docloaders/chromium.py
Line 29 in 6b98d84
I am thinking maybe we can parameterize this in some way so that it is easily configurable.
I can probably make a PR if Mods think it is a worthy fix.
The text was updated successfully, but these errors were encountered: