A minimal Apache Tika Docker image.
The image is based on openjdk:11.0.8-jre-slim and no additional dependencies are installed to keep the image size to minimum.
docker pull asandeepin/docker-tikaapp:latest
docker run asandeepin/docker-tikaapp:latest tika [option...] [file]
docker run -itd asandeepin/docker-tikaapp:latest /bin/bash
docker exec -it <container_name/id> tika [option...] [file]
Make sure to mount volumes to source input files from host machine inside container.
For more command line options and how to use batch operations refer to official getting started guide.
To push images for new version to Docker hub, follow below steps.
-
Clone the repository.
git clone [email protected]:asandeep/docker-tikaapp.git
-
To build images locally and play around, run below command:
./build.sh <TIKA_VERSION>
-
Once satisfied, update the version string in
current_version
file. -
Create a PR to merge changes to master branch. Once PR is merged, autobuild will automatically push the latest image with relevant tags to Docker hub.