Skip to content

0.14.10

Compare
Choose a tag to compare
@MthwRobinson MthwRobinson released this 09 Jul 11:33
· 190 commits to main since this release
7b25dfc

0.14.10

Enhancements

  • Update unstructured-client dependency Change unstructured-client dependency pin back to
    greater than min version and updated tests that were failing given the update.
  • .doc files are now supported in the arm64 image.. libreoffice24 is added to the arm64 image, meaning .doc files are now supported. We have follow on work planned to investigate adding .ppt support for arm64 as well.
  • Add table detection metrics: recall, precision and f1
  • Remove unused _with_spans metrics

Features

Fixes

  • Fix counting false negatives and false positives in table structure evaluation
  • Fix Slack CI test Change channel that Slack test is pointing to because previous test bot expired
  • Remove NLTK download Removes nltk.download in favor of downloading from an S3 bucket we host to mitigate CVE-2024-39705