[jira] [Commented] (TIKA-2995) markLimit too small in org.apache.tika.parser.microsoft.POIFSContainerDetector

2019-11-21 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-2995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16979298#comment-16979298 ] Tim Allison commented on TIKA-2995: --- I'm happy to bump the markLimit. What do others think? You

[jira] [Created] (TIKA-2995) markLimit too small in org.apache.tika.parser.microsoft.POIFSContainerDetector

2019-11-21 Thread Tim Barrett (Jira)
Tim Barrett created TIKA-2995: - Summary: markLimit too small in org.apache.tika.parser.microsoft.POIFSContainerDetector Key: TIKA-2995 URL: https://issues.apache.org/jira/browse/TIKA-2995 Project: Tika

Re: [EXTERNAL] Docker image along with 1.23?

2019-11-21 Thread Tim Allison
K. Sounds like an example Docker file will meet your needs, Eric? Users can currently build their own images with the Docker file in tika-server, and there's logical-spark. As noted, there are complexities with distributing an image. Between those two options, folks should basically be ok.

Re: [EXTERNAL] Docker image along with 1.23?

2019-11-21 Thread Eric Pugh
That makes sense. Having a robust Dockerfile, even if it isn’t published, is a great way of modeling best practices in running Tika in server mode. > On Nov 21, 2019, at 3:26 AM, Nick Burch wrote: > > On Thu, 21 Nov 2019, Oleg Tikhonov wrote: >> My question is more pragmatic. >> What we

Re: [EXTERNAL] Docker image along with 1.23?

2019-11-21 Thread Nick Burch
On Thu, 21 Nov 2019, Oleg Tikhonov wrote: My question is more pragmatic. What we put inside the Dockerfile, on which image it will be based on (say Ubuntu) ... What will contain an entrypoint? Tika Server? Should we "install" a tesseract? Anything more? If we want to be trendy, then Sergey

Re: [EXTERNAL] Docker image along with 1.23?

2019-11-21 Thread Oleg Tikhonov
My question is more pragmatic. What we put inside the Dockerfile, on which image it will be based on (say Ubuntu) ... What will contain an entrypoint? Tika Server? Should we "install" a tesseract? Anything more? Thanks, Oleg On Thu, Nov 21, 2019 at 4:46 AM Chris Mattmann wrote: > Yeah