[
https://issues.apache.org/jira/browse/TIKA-1739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14903340#comment-14903340
]
Nick Burch commented on TIKA-1739:
--
I'm not sure that the cTAKES parser should be creating an
[
https://issues.apache.org/jira/browse/TIKA-1739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14903252#comment-14903252
]
Chris A. Mattmann commented on TIKA-1739:
-
OK [~totaro] I implemented your solution (see attached
[
https://issues.apache.org/jira/browse/TIKA-1739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14903252#comment-14903252
]
Chris A. Mattmann edited comment on TIKA-1739 at 9/22/15 7:13 PM:
--
OK
[
https://issues.apache.org/jira/browse/TIKA-1739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14903188#comment-14903188
]
Chris A. Mattmann commented on TIKA-1739:
-
Thanks Giuseppe, so I will try this fix now and update
Lewis John McGibbney created TIKA-1741:
--
Summary: Include CTAKESConfig.properties within tika-parsers
resources by default
Key: TIKA-1741
URL: https://issues.apache.org/jira/browse/TIKA-1741
[
https://issues.apache.org/jira/browse/TIKA-1737?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14902528#comment-14902528
]
Tim Allison edited comment on TIKA-1737 at 9/22/15 4:16 PM:
bq. there were
[
https://issues.apache.org/jira/browse/TIKA-1739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14903123#comment-14903123
]
Giuseppe Totaro commented on TIKA-1739:
---
Hi [~chrismattmann], Hi [~gagravarr],
I looked at the last
[
https://issues.apache.org/jira/browse/TIKA-1739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14903389#comment-14903389
]
Nick Burch commented on TIKA-1739:
--
We explicitly don't let you set an {{AutoDetectParser}} in the config,
[
https://issues.apache.org/jira/browse/TIKA-1737?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14903422#comment-14903422
]
Alan Burlison commented on TIKA-1737:
-
bq. Re the ArrayIndexOutOfBoundsException - are you using
[
https://issues.apache.org/jira/browse/TIKA-1739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14903391#comment-14903391
]
Nick Burch commented on TIKA-1739:
--
We explicitly don't let you set an {{AutoDetectParser}} in the config,
[
https://issues.apache.org/jira/browse/TIKA-1739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14903390#comment-14903390
]
Nick Burch commented on TIKA-1739:
--
We explicitly don't let you set an {{AutoDetectParser}} in the config,
Nathan Dire created TIKA-1742:
-
Summary: StackOverflowError parsing a PDF with
ExtractInlineImages=true
Key: TIKA-1742
URL: https://issues.apache.org/jira/browse/TIKA-1742
Project: Tika
Issue
[
https://issues.apache.org/jira/browse/TIKA-1744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Yaniv Kunda updated TIKA-1744:
--
Attachment: TIKA-1744.patch
> Use java.nio.file.Path in TikaInputStream
>
[
https://issues.apache.org/jira/browse/TIKA-1739?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nick Burch updated TIKA-1739:
-
Comment: was deleted
(was: We explicitly don't let you set an {{AutoDetectParser}} in the config,
it's
Bob Paulin created TIKA-1743:
Summary: NetworkParser can create Unbounded Number of Threads
Key: TIKA-1743
URL: https://issues.apache.org/jira/browse/TIKA-1743
Project: Tika
Issue Type: Bug
[
https://issues.apache.org/jira/browse/TIKA-1737?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14903537#comment-14903537
]
Tilman Hausherr commented on TIKA-1737:
---
No, PDFBOX-2987 is another one I fixed for you. The NPE in
[
https://issues.apache.org/jira/browse/TIKA-1739?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nick Burch updated TIKA-1739:
-
Comment: was deleted
(was: We explicitly don't let you set an {{AutoDetectParser}} in the config,
it's
[
https://issues.apache.org/jira/browse/TIKA-1742?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nathan Dire updated TIKA-1742:
--
Description:
Here's the file:
http://nlp.stanford.edu/~socherr/EMNLP2013_RNTN.pdf
Code to repro
[
https://issues.apache.org/jira/browse/TIKA-1743?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14903878#comment-14903878
]
Tyler Palsulich commented on TIKA-1743:
---
[Copied from the list]
This sounds like a great idea! We
This sounds like a great idea! We should make the size of the pool
configurable with TikaConfig.
On Tue, Sep 22, 2015, 3:04 PM Bob Paulin (JIRA) wrote:
> Bob Paulin created TIKA-1743:
>
>
> Summary: NetworkParser can create
[
https://issues.apache.org/jira/browse/TIKA-1745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Yaniv Kunda updated TIKA-1745:
--
Attachment: TIKA-1745.patch
> Add methods accepting java.nio.file.Path to org.apache.tika.Tika and
>
[
https://issues.apache.org/jira/browse/TIKA-1739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14903670#comment-14903670
]
Chris A. Mattmann commented on TIKA-1739:
-
So, I'm going to take this to the list, but here is the
[
https://issues.apache.org/jira/browse/TIKA-1739?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chris A. Mattmann resolved TIKA-1739.
-
Resolution: Won't Fix
Nick suggested a work-around, works fine.
> cTAKESParser doesn't
[
https://issues.apache.org/jira/browse/TIKA-1746?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Yaniv Kunda updated TIKA-1746:
--
Attachment: TIKA-1746.patch
> modify TikaFileTypeDetector to use new detect method accepting
>
[
https://issues.apache.org/jira/browse/TIKA-1740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14902592#comment-14902592
]
Tim Allison commented on TIKA-1740:
---
Oops. Nick beat me to it. That was plan B.
[~gagravarr], do you
[
https://issues.apache.org/jira/browse/TIKA-1737?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14902657#comment-14902657
]
Alan Burlison edited comment on TIKA-1737 at 9/22/15 1:51 PM:
--
bq. Could we
[
https://issues.apache.org/jira/browse/TIKA-1734?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14902668#comment-14902668
]
Hudson commented on TIKA-1734:
--
SUCCESS: Integrated in tika-trunk-jdk1.7 #852 (See
[
https://issues.apache.org/jira/browse/TIKA-1739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14902601#comment-14902601
]
Nick Burch commented on TIKA-1739:
--
I can't actually use the cTAKES parser on my machine - I tried
[
https://issues.apache.org/jira/browse/TIKA-1734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison resolved TIKA-1734.
---
Resolution: Fixed
r1704620.
Thank you, [~kunda]!
> Use java.nio.file.Path in TemporaryResources
>
[
https://issues.apache.org/jira/browse/TIKA-1726?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14902613#comment-14902613
]
Tim Allison commented on TIKA-1726:
---
Thank you, [~kkrugler]. [~kunda], is there enough consensus on this
[
https://issues.apache.org/jira/browse/TIKA-1737?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14902622#comment-14902622
]
Tim Allison commented on TIKA-1737:
---
Could we have done something at the Tika level to cause this...I
[
https://issues.apache.org/jira/browse/TIKA-1737?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14902659#comment-14902659
]
Tim Allison commented on TIKA-1737:
---
bq. dating back as far as 1992
Y, I just confirmed that I can't
[
https://issues.apache.org/jira/browse/TIKA-1737?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14902657#comment-14902657
]
Alan Burlison commented on TIKA-1737:
-
.bq Could we have done something at the Tika level to cause
[
https://issues.apache.org/jira/browse/TIKA-1740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14902694#comment-14902694
]
Andrea commented on TIKA-1740:
--
Thanks for your reply. Of course I can create my own Recursive parser, but it
Yes, using getPath() for the getFile() counterpart.
I'll prepare patches in a few hours.
On Sep 22, 2015 4:35 PM, "Tim Allison (JIRA)" wrote:
>
> [
>
Thank _you_ for all of your work in modernizing us. With your efforts, we'll
be able to deprecate TikaInputStream#get(PunchCard pc) soon. :)
>>Regarding FilenameUtils.getName() - I believe that its functionality can be
>>replaced by Path.getFileName() - and in a platform-aware manner, as each
[
https://issues.apache.org/jira/browse/TIKA-1737?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14902528#comment-14902528
]
Tim Allison commented on TIKA-1737:
---
bq. there were many more that just had a single line of error
Try
[
https://issues.apache.org/jira/browse/TIKA-1737?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14902580#comment-14902580
]
Alan Burlison commented on TIKA-1737:
-
The heap dump is huge and the profiler struggles to cope so I
[
https://issues.apache.org/jira/browse/TIKA-1737?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14902522#comment-14902522
]
Tim Allison commented on TIKA-1737:
---
Thank you, [~tilman]!
> PDFBox 1.8.10 is still a basket case
>
[
https://issues.apache.org/jira/browse/TIKA-1734?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14902567#comment-14902567
]
Tim Allison commented on TIKA-1734:
---
About to commit, unless you'd like to. :)
> Use java.nio.file.Path
[
https://issues.apache.org/jira/browse/TIKA-1740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14902585#comment-14902585
]
Nick Burch commented on TIKA-1740:
--
You might be better off writing your own Recursion handler. Take a
Andrea created TIKA-1740:
Summary: RecursiveParserWrapper returning ContentHandler-s
Key: TIKA-1740
URL: https://issues.apache.org/jira/browse/TIKA-1740
Project: Tika
Issue Type: Wish
[
https://issues.apache.org/jira/browse/TIKA-1734?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14902553#comment-14902553
]
Bob Paulin commented on TIKA-1734:
--
+1 from me on this [~kunda]
> Use java.nio.file.Path in
[
https://issues.apache.org/jira/browse/TIKA-1740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14902591#comment-14902591
]
Tim Allison commented on TIKA-1740:
---
How about we store a list of pairs instead of
[
https://issues.apache.org/jira/browse/TIKA-1739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14902823#comment-14902823
]
Chris A. Mattmann commented on TIKA-1739:
-
Nick I wonder if the approval got lost in email or in
[
https://issues.apache.org/jira/browse/TIKA-1737?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14902835#comment-14902835
]
Tim Allison commented on TIKA-1737:
---
See PDFBOX-2986 for a resource leak discovered through testing
46 matches
Mail list logo