You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This commit was created on GitHub.com and signed with GitHub’s verified signature.
0.14.10
Enhancements
Update unstructured-client dependency Change unstructured-client dependency pin back to
greater than min version and updated tests that were failing given the update.
.doc files are now supported in the arm64 image.. libreoffice24 is added to the arm64 image, meaning .doc files are now supported. We have follow on work planned to investigate adding .ppt support for arm64 as well.
Add table detection metrics: recall, precision and f1
Remove unused _with_spans metrics
Features
Fixes
Fix counting false negatives and false positives in table structure evaluation
Fix Slack CI test Change channel that Slack test is pointing to because previous test bot expired
Remove NLTK download Removes nltk.download in favor of downloading from an S3 bucket we host to mitigate CVE-2024-39705