-
Notifications
You must be signed in to change notification settings - Fork 173
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: Updated ADAPTER_LLMW_MAX_POLLS to 120 for 1 hour extraction #904
base: main
Are you sure you want to change the base?
Conversation
Quality Gate passedIssues Measures |
|
# 500 mins to allow 1500 (max pages limit) * 20 (approx time in sec to process a page) | ||
ADAPTER_LLMW_MAX_POLLS=1000 | ||
# ~60 mins (assuming it'll be enough to process 1500 pages with LLMW v2) | ||
ADAPTER_LLMW_MAX_POLLS=120 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@chandrasekharan-zipstack but what about for the V1? Doesn't it use the same ENV?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@ritwik-g you're right, v1 uses the same envs. Shall we discourage support for v1 by changing this env (reducing the time / max possible pages) and instead enforce support with v2 for more number of pages?
Realistically speaking - I doubt if any user has such large extraction times.
Worst case,
- either we'll have to let this env be and take action after we sunset v1
- or introduce a new set of envs for v2 and update that (involves changes in the SDK, so I'm not a fan of this)
# 500 mins to allow 1500 (max pages limit) * 20 (approx time in sec to process a page) | ||
ADAPTER_LLMW_MAX_POLLS=1000 | ||
# ~60 mins (assuming it'll be enough to process 1500 pages with LLMW v2) | ||
ADAPTER_LLMW_MAX_POLLS=120 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@chandrasekharan-zipstack Will this not cause timeout if it exceeds max 15min time which we have set in gunicorn ?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@harini-venkataraman the timeout exists only for extractions that happens with a web UI. This large setting is mainly ideal for async pipeline based extractions where such gunicorn timeouts will not play a role
What
Why
Can this PR break any existing features. If yes, please list possible items. If no, please explain why. (PS: Admins do not merge the PR without this section filled)
Env Config
Related Issues or PRs
Notes on Testing
Checklist
I have read and understood the Contribution Guidelines.