RE: Disabling Zip bomb detection in Tika

2016-09-22 Thread Allison, Timothy B.
MostlyPassThroughHtmlMapper passes through, well, mostly everything. -Original Message- From: Erick Erickson [mailto:erickerick...@gmail.com] Sent: Thursday, September 22, 2016 12:47 PM To: solr-user <solr-user@lucene.apache.org> Subject: Re: Disabling Zip bomb detection in Tika So far a Tika JIRA seem

Re: Disabling Zip bomb detection in Tika

2016-09-22 Thread Erick Erickson
So far a Tika JIRA seems like the right thing. Tim is "a well known entity" in Solr though so I'm sure he'll move it over to Solr if appropriate. Erick On Thu, Sep 22, 2016 at 9:43 AM, Rodrigo Rosenfeld Rosas wrote: > Here it is. Not sure if it's clear enough

Re: Disabling Zip bomb detection in Tika

2016-09-22 Thread Rodrigo Rosenfeld Rosas
Here it is. Not sure if it's clear enough though: https://issues.apache.org/jira/browse/TIKA-2091 Or should I have created the ticket in the Solr project instead? Em 22-09-2016 13:32, Rodrigo Rosenfeld Rosas escreveu: This is one of the documents:

RE: Disabling Zip bomb detection in Tika

2016-09-22 Thread Allison, Timothy B.
with the results of that, go for it. -Original Message- From: Rodrigo Rosenfeld Rosas [mailto:rr_ro...@yahoo.com.br.INVALID] Sent: Thursday, September 22, 2016 12:27 PM To: solr-user@lucene.apache.org Subject: Re: Disabling Zip bomb detection in Tika Great, thanks for the URL, I'll check

Re: Disabling Zip bomb detection in Tika

2016-09-22 Thread Rodrigo Rosenfeld Rosas
This is one of the documents: https://www.sec.gov/Archives/edgar/data/1472033/000119380513001310/e611133_f6ef-eutelsat.htm I'll try to create a ticket for this on Jira if I find its location but feel free to open it yourself if you prefer, just let me know. Em 22-09-2016 12:33, Allison,

Re: Disabling Zip bomb detection in Tika

2016-09-22 Thread Rodrigo Rosenfeld Rosas
he.org' <u...@tika.apache.org> Subject: RE: Disabling Zip bomb detection in Tika I don't think that's configurable at the moment. Tika-colleagues, any recommendations? If you're able to share the file on Tika's jira, we'd be happy to take a look. You shouldn't be getting the zip

RE: Disabling Zip bomb detection in Tika

2016-09-22 Thread Allison, Timothy B.
> I'll try to get a sample HTML yielding to this problem and attach it to Jira. Great! Tika 1.14 is around the corner...if this is an easy fix ... :) Thank you.

Re: Disabling Zip bomb detection in Tika

2016-09-22 Thread Erick Erickson
looks like Nick (gagravarr) has answered on SO -- can't do it in Tika >>> currently. >>> >>> -Original Message- >>> From: Allison, Timothy B. [mailto:talli...@mitre.org] >>> Sent: Thursday, September 22, 2016 10:42 AM >>> To: solr-use

Re: Disabling Zip bomb detection in Tika

2016-09-22 Thread Rodrigo Rosenfeld Rosas
org> Subject: RE: Disabling Zip bomb detection in Tika I don't think that's configurable at the moment. Tika-colleagues, any recommendations? If you're able to share the file on Tika's jira, we'd be happy to take a look. You shouldn't be getting the zip bomb unless there is a mismatch between o

Re: Disabling Zip bomb detection in Tika

2016-09-22 Thread Rodrigo Rosenfeld Rosas
) has answered on SO -- can't do it in Tika currently. -Original Message- From: Allison, Timothy B. [mailto:talli...@mitre.org] Sent: Thursday, September 22, 2016 10:42 AM To: solr-user@lucene.apache.org Cc: 'u...@tika.apache.org' <u...@tika.apache.org> Subject: RE: Disabling Zi

RE: Disabling Zip bomb detection in Tika

2016-09-22 Thread Allison, Timothy B.
org> Subject: RE: Disabling Zip bomb detection in Tika I don't think that's configurable at the moment. Tika-colleagues, any recommendations? If you're able to share the file on Tika's jira, we'd be happy to take a look. You shouldn't be getting the zip bomb unless there is a mismatch between o

RE: Disabling Zip bomb detection in Tika

2016-09-22 Thread Allison, Timothy B.
I don't think that's configurable at the moment. Tika-colleagues, any recommendations? If you're able to share the file on Tika's jira, we'd be happy to take a look. You shouldn't be getting the zip bomb unless there is a mismatch between opening and closing tags (which could point to a bug