-
Notifications
You must be signed in to change notification settings - Fork 5.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
MagenticOne OCR Error #4482
Comments
Can you provide your full conversation log? |
@ekzhu it's very long, but I'm just testing the first question in GAIA (https://huggingface.co/datasets/gaia-benchmark/GAIA?row=0); here is the log from the start all the way until the first OCR error:
|
@skychwang We've been noticing that on our end too, with the OpenAI endpoint (we ran our evaluations on Azure). though model versions are the same, filters etc may diverge. It's possible we can tweak the prompt a little to minimize the issue, but we might also need to seek another source of OCR. (Previously we were using EasyOCR but it had many errors -- GPT was quick, easy, apparently reliable, and we were using it already anyways). |
@afourney I had a similar issue (dev8 version), If I recall I haven't noticed it in dev 6 version, any suggestions? |
@yanivvak Yeah, that part of the code hasn't changed from dev6 to dev8 (or really even dev1). The model providers have changed something on their end regarding filtering -- as far as I can tell. We're investigating some remediations:
My recommendation right now, if feasible, is to use multi-model models in all agents, and just remove the OCR output for now. |
OCR is not strictly necessary, and can now be disabled via the constructor. Recommend mitigating this issue by disabling it for now. |
What happened?
Running a query by executing
python examples/example.py --logs_dir ./logs
, I sometimes get the following:What did you expect to happen?
Whenever OCR is attempted, it is broken. Should this be happening?
How can we reproduce it (as minimally and precisely as possible)?
I'm assuming that this is GPT4o refusing to OCR? Doing a web search query and this happens all the time.
AutoGen version
Newest
Which package was this bug in
Core
Model used
gpt-4o-2024-05-13
Python version
No response
Operating system
No response
Any additional info you think would be helpful for fixing this bug
No response
The text was updated successfully, but these errors were encountered: