Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Timeout issue while Scraping many urls in Smart Scraper MultiGraph #865

Closed
Qunlexie opened this issue Jan 4, 2025 · 5 comments
Closed

Comments

@Qunlexie
Copy link

Qunlexie commented Jan 4, 2025

Describe the bug
There is a separate timeout issue when using Smart Scraper MuliGraph. It only times out when scraping many links at once usually for >=10. I was able to solve this by increasing the timeout and retry limits.

I am thinking maybe we can parameterize this in some way so that it is easily configurable.

I can probably make a PR if Mods think it is a worthy fix.

@VinciGit00
Copy link
Collaborator

Hi @Qunlexie, great idea, please can yo make the PR?

@PeriniM
Copy link
Collaborator

PeriniM commented Jan 6, 2025

Hey @Qunlexie, in the next release you can pass the retry_limit and timeout in the loader_kwargs settings!

graph_config = {
    "llm": {
        "api_key": os.getenv("OPENAI_API_KEY"),
        "model": "openai/gpt-4o-mini",
    },
    "verbose": True,
    "headless": False,
    "loader_kwargs": {
        "retry_limit": 3,
        "timeout": 10,
    },
}

@Qunlexie
Copy link
Author

Qunlexie commented Jan 6, 2025

Thanks for fixing this faster than I could. Is this also present in the beta or no?

@VinciGit00
Copy link
Collaborator

VinciGit00 commented Jan 6, 2025

it should be also in the beta

github-actions bot pushed a commit that referenced this issue Jan 6, 2025
## [1.35.0-beta.1](v1.34.3-beta.1...v1.35.0-beta.1) (2025-01-06)

### Features

* ⛏️ enhanced contribution and precommit added ([fcbfe78](fcbfe78))
* add timeout and retry_limit in loader_kwargs ([#865](#865) [#831](#831)) ([21147c4](21147c4))

### Bug Fixes

* local html handling ([2a15581](2a15581))
github-actions bot pushed a commit that referenced this issue Jan 6, 2025
## [1.35.0](v1.34.2...v1.35.0) (2025-01-06)

### Features

* ⏰added graph timeout and fixed model_tokens param ([#810](#810) [#856](#856) [#853](#853)) ([01a331a](01a331a))
* ⛏️ enhanced contribution and precommit added ([fcbfe78](fcbfe78))
* add codequality workflow ([4380afb](4380afb))
* add timeout and retry_limit in loader_kwargs ([#865](#865) [#831](#831)) ([21147c4](21147c4))
* serper api search ([1c0141f](1c0141f))

### Bug Fixes

* browserbase integration ([752a885](752a885))
* local html handling ([2a15581](2a15581))

### CI

* **release:** 1.34.2-beta.1 [skip ci] ([f383e72](f383e72)), closes [#861](#861) [#861](#861)
* **release:** 1.34.2-beta.2 [skip ci] ([93fd9d2](93fd9d2))
* **release:** 1.34.3-beta.1 [skip ci] ([013a196](013a196)), closes [#861](#861) [#861](#861)
* **release:** 1.35.0-beta.1 [skip ci] ([c5630ce](c5630ce)), closes [#865](#865) [#831](#831)
* **release:** 1.35.0-beta.2 [skip ci] ([f21c586](f21c586))
* **release:** 1.35.0-beta.3 [skip ci] ([cb54d5b](cb54d5b))
* **release:** 1.35.0-beta.4 [skip ci] ([6e375f5](6e375f5)), closes [#810](#810) [#856](#856) [#853](#853)
@PeriniM
Copy link
Collaborator

PeriniM commented Jan 6, 2025

@Qunlexie now it is in the new release! v1.35.0 #869

@PeriniM PeriniM closed this as completed Jan 6, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants