[
https://issues.apache.org/jira/browse/TIKA-2676?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16518427#comment-16518427
]
Slava G commented on TIKA-2676:
---
Hi Tim,
I would like to share more of stacktrace but here's a problem,
Slava G created TIKA-2676:
-
Summary: After switching to TIKA 1.18 from !.17 started to get
exception
Key: TIKA-2676
URL: https://issues.apache.org/jira/browse/TIKA-2676
Project: Tika
Issue Type:
[
https://issues.apache.org/jira/browse/TIKA-2676?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Slava G updated TIKA-2676:
--
Description:
I recently switched from TIKA 1.17 to TIKA 1.18 (I'm using tika to parse
emails).
And I started
[
https://issues.apache.org/jira/browse/TIKA-2676?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Slava G updated TIKA-2676:
--
Summary: After switching to TIKA 1.18 from 1.17 started to get exception
(was: After switching to TIKA 1.18
[
https://issues.apache.org/jira/browse/TIKA-2727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16632536#comment-16632536
]
Slava G commented on TIKA-2727:
---
Hi Tim,
Sorry I didn't get you, about what email ?
Thanks
> Parsing and
[
https://issues.apache.org/jira/browse/TIKA-2727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16625985#comment-16625985
]
Slava G commented on TIKA-2727:
---
Thanks, will look.
Could be that in 1.19 solution is not always working ?
[
https://issues.apache.org/jira/browse/TIKA-2727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16626408#comment-16626408
]
Slava G commented on TIKA-2727:
---
Tried to reproduce, after few hundreds xml that was transfer to TIKA for
[
https://issues.apache.org/jira/browse/TIKA-2727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16626408#comment-16626408
]
Slava G edited comment on TIKA-2727 at 9/24/18 8:34 PM:
Tried to reproduce, after
[
https://issues.apache.org/jira/browse/TIKA-2727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16625041#comment-16625041
]
Slava G commented on TIKA-2727:
---
Hi,
Testing the 1.19 and seems that on some files that it was stuck (1.17)
[
https://issues.apache.org/jira/browse/TIKA-2727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16625041#comment-16625041
]
Slava G edited comment on TIKA-2727 at 9/23/18 9:06 PM:
Hi,
Testing the 1.19 and
[
https://issues.apache.org/jira/browse/TIKA-2727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16626437#comment-16626437
]
Slava G edited comment on TIKA-2727 at 9/24/18 8:41 PM:
10 iterations inside for
[
https://issues.apache.org/jira/browse/TIKA-2727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16626437#comment-16626437
]
Slava G commented on TIKA-2727:
---
10 iterations inside for loop (same thread) , file
[
https://issues.apache.org/jira/browse/TIKA-2727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Slava G updated TIKA-2727:
--
Attachment: 1_6e4b115e-7d2d-45f1-a842-35b5ad7ba559
> Parsing and detect mime type of XML file stuck in infinite
[
https://issues.apache.org/jira/browse/TIKA-2727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16626445#comment-16626445
]
Slava G commented on TIKA-2727:
---
Ok, thanks hope you'll be able to fix this quick.
Thanks a lot
>
Slava G created TIKA-2727:
-
Summary: Parsing and detect mime type of XML file stuck in
infinite loop
Key: TIKA-2727
URL: https://issues.apache.org/jira/browse/TIKA-2727
Project: Tika
Issue Type:
[
https://issues.apache.org/jira/browse/TIKA-2727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16617833#comment-16617833
]
Slava G commented on TIKA-2727:
---
I'm using TIKA directly in my code,
Does sersion 1.19 solves this issue
[
https://issues.apache.org/jira/browse/TIKA-2727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16617833#comment-16617833
]
Slava G edited comment on TIKA-2727 at 9/17/18 5:23 PM:
I'm using TIKA directly in
[
https://issues.apache.org/jira/browse/TIKA-2727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16617928#comment-16617928
]
Slava G commented on TIKA-2727:
---
Will definitely work to provide as much as possible information to solve
[
https://issues.apache.org/jira/browse/TIKA-2676?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16619646#comment-16619646
]
Slava G commented on TIKA-2676:
---
We're passing null as a content type, in this case. All other cases where
[
https://issues.apache.org/jira/browse/TIKA-2676?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16619714#comment-16619714
]
Slava G commented on TIKA-2676:
---
Well, the flow is next, we're parsing MimeMessage, taking kust budy as
[
https://issues.apache.org/jira/browse/TIKA-2676?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16619727#comment-16619727
]
Slava G commented on TIKA-2676:
---
Well, I'm not using ActivationDataFlavor directly in any part of this flow.
[
https://issues.apache.org/jira/browse/TIKA-2676?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16620204#comment-16620204
]
Slava G commented on TIKA-2676:
---
I wish I knew :) but it's 99% coming from that area. I'll change log
[
https://issues.apache.org/jira/browse/TIKA-2727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16617516#comment-16617516
]
Slava G commented on TIKA-2727:
---
Great !!! Thanks.
Is the jdk.xml.entityExpansionLimit relevant only for
[
https://issues.apache.org/jira/browse/TIKA-2727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16617516#comment-16617516
]
Slava G edited comment on TIKA-2727 at 9/17/18 1:22 PM:
Great !!! Thanks.
Is the
[
https://issues.apache.org/jira/browse/TIKA-2727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16618986#comment-16618986
]
Slava G commented on TIKA-2727:
---
The problems is that issue not always reproducible, I can't point on some
[
https://issues.apache.org/jira/browse/TIKA-2676?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16619160#comment-16619160
]
Slava G commented on TIKA-2676:
---
I wish it would be that simple , but the only version is mime4j 0.8.1
And
Slava G created TIKA-2832:
-
Summary: Very slow large PDF text extraction
Key: TIKA-2832
URL: https://issues.apache.org/jira/browse/TIKA-2832
Project: Tika
Issue Type: Bug
Components:
[
https://issues.apache.org/jira/browse/TIKA-2832?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16777982#comment-16777982
]
Slava G commented on TIKA-2832:
---
Thanks, I'll, but such a slow parsing, isn't a bug ? For me it's looks like
[
https://issues.apache.org/jira/browse/TIKA-2832?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16782886#comment-16782886
]
Slava G commented on TIKA-2832:
---
Thanks for fixing it so fast.
> Very slow large PDF text extraction
>
29 matches
Mail list logo