Tim Allison created TIKA-4667:
---------------------------------
Summary: Add tess4j wrapper in 4.x
Key: TIKA-4667
URL: https://issues.apache.org/jira/browse/TIKA-4667
Project: Tika
Issue Type: New Feature
Reporter: Tim Allison
A long while ago we declined the contribution of a tess4j wrapper. The reason
was that we didn't want to be responsible for getting tess4j working on
everyone's various OS.
I still don't think we want this responsibility, but I think we should make it
available for testing and evaluation. Given we know the OS of the docker image,
if it is substantially better than shelling out to tesseract, we could put it
in our server docker image.
The other thing that's changes is tika-pipes. I had concerns about native code.
That would now be isolated into the pipes forked process so we don't have to
worry as much about damaging the main jvm.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)