[
https://issues.apache.org/jira/browse/TIKA-2995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16979298#comment-16979298
]
Tim Allison commented on TIKA-2995:
---
I'm happy to bump the markLimit. What do others think?
You
Tim Barrett created TIKA-2995:
-
Summary: markLimit too small in
org.apache.tika.parser.microsoft.POIFSContainerDetector
Key: TIKA-2995
URL: https://issues.apache.org/jira/browse/TIKA-2995
Project: Tika
K. Sounds like an example Docker file will meet your needs, Eric?
Users can currently build their own images with the Docker file in
tika-server, and there's logical-spark.
As noted, there are complexities with distributing an image.
Between those two options, folks should basically be ok.
That makes sense. Having a robust Dockerfile, even if it isn’t published, is
a great way of modeling best practices in running Tika in server mode.
> On Nov 21, 2019, at 3:26 AM, Nick Burch wrote:
>
> On Thu, 21 Nov 2019, Oleg Tikhonov wrote:
>> My question is more pragmatic.
>> What we
On Thu, 21 Nov 2019, Oleg Tikhonov wrote:
My question is more pragmatic.
What we put inside the Dockerfile, on which image it will be based on (say
Ubuntu) ...
What will contain an entrypoint? Tika Server? Should we "install" a
tesseract? Anything more?
If we want to be trendy, then Sergey
My question is more pragmatic.
What we put inside the Dockerfile, on which image it will be based on (say
Ubuntu) ...
What will contain an entrypoint? Tika Server? Should we "install" a
tesseract? Anything more?
Thanks,
Oleg
On Thu, Nov 21, 2019 at 4:46 AM Chris Mattmann wrote:
> Yeah