Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Erroneous Text output for IE task #21

Closed
riteshKumarUMass opened this issue Aug 12, 2022 · 2 comments
Closed

Erroneous Text output for IE task #21

riteshKumarUMass opened this issue Aug 12, 2022 · 2 comments

Comments

@riteshKumarUMass
Copy link

Hi,
I tried fine tuning the model with custom receipt dataset for IE task and noticed issues with the output text extracted for given set of keys. It either misses out or add extra 1-2 characters to the actual text present in the document and this pattern is very frequent. I am using the default input_size: [1280, 960]. The images are really clear where any other off the shelf OCR model is able to extract text with no errors. I fine-tuned the model with 400 images with 15 keys and tested it on 100 samples. Has anyone encountered such issue?

@gwkrsrch
Copy link
Collaborator

Hi,
It will depend on the data/task. If you can share the input image with errors (and any additional helpful information), we may find a reason/solution faster. Given the limited information, I can give you a basic checklist:

Plus, there is a hands-on tutorial at this link that might be helpful to you. In addition, checking other reported/resolved issues in this repository will also be useful. Hope this helps :)

@gwkrsrch
Copy link
Collaborator

Close this issue since there has been no update for a long time. Feel free to reopen it if you have anything new for sharing and debugging :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants