Tim Allison created TIKA-4667:
---------------------------------

             Summary: Add tess4j wrapper in 4.x
                 Key: TIKA-4667
                 URL: https://issues.apache.org/jira/browse/TIKA-4667
             Project: Tika
          Issue Type: New Feature
            Reporter: Tim Allison


A long while ago we declined the contribution of a tess4j wrapper. The reason 
was that we didn't want to be responsible for getting tess4j working on 
everyone's various OS.

I still don't think we want this responsibility, but I think we should make it 
available for testing and evaluation. Given we know the OS of the docker image, 
if it is substantially better than shelling out to tesseract, we could put it 
in our server docker image.

The other thing that's changes is tika-pipes. I had concerns about native code. 
That would now be isolated into the pipes forked process so we don't have to 
worry as much about damaging the main jvm.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to