Releases: CogStack/tika-service
Releases · CogStack/tika-service
v0.5.3
v0.5.2
Minor release:
- fixed issue where some documents did not get buffered into the stream properly, resulting in 0 length errors.
Full Changelog: v0.5.1...v0.5.2
v0.5.1
The is a minor release, summary of changes:
- default docker base image change from openjdk to ubuntu:kinetic after noticing drastic performance improvements (the performance of the image is now closer to that of running the .JAR file directly on one's system).
- possible bugfix to RenderState errors.
Full Changelog: 0.5.0...v0.5.1
v0.5.0
Updates:
- Tika updated to 2.5.0
- Dependencies updated to latest available versions
- Added extra configurable options for PDF OCR extractions
- Encoding output issue fixes
- Docker service Tesseract OCR env updates (dynamic number of threads enabled, do not change unless necessary).
Full Changelog: 0.4.0...0.5.0
v0.4.0
0.3.1
Tika 2.2.1
Various application configuration changes (see README.md) that allows more granular control over how images are processed (DPI, resizing, etc.)
Batch processing improvements.
Full Changelog: https://github.com/CogStack/tika-service/commits/0.3.1
Version 0.2.0
Key changes include:
- Added threading control for Tomcat serving endpoint
- Update Tika to 1.24 version
- Update Tesseract to 4.1.1 version
Initial version 0.1.0
Initial stable version release