[jira] [Commented] (TIKA-2743) Replace com.sun.xml.bind:jaxb-impl and jaxb-core by org.glassfish.jaxb:jaxb-runtime and jaxb-core

2018-11-09 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2743?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16681602#comment-16681602 ] Uwe Schindler commented on TIKA-2743: - bq. Tim Allison shouldn't jaxb-runtime have runtime, rather

[jira] [Commented] (TIKA-2722) Don't call Date.toString (Possible issue with JDK 11)

2018-09-05 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16604643#comment-16604643 ] Uwe Schindler commented on TIKA-2722: - bq. I reported it to Oracle using their normal channel for

[jira] [Commented] (TIKA-2722) Don't call Date.toString (Possible issue with JDK 11)

2018-09-05 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16604641#comment-16604641 ] Uwe Schindler commented on TIKA-2722: - Cool thanks for the reproducer. That's indeed a bug, as you

[jira] [Commented] (TIKA-2722) Don't call Date.toString (Possible issue with JDK 11)

2018-09-05 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16604516#comment-16604516 ] Uwe Schindler commented on TIKA-2722: - [~dsmiley]: I think this is a bug in Java 11. I know there were

[jira] [Comment Edited] (TIKA-2667) Upgrade jmatio to 1.4

2018-06-20 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16518490#comment-16518490 ] Uwe Schindler edited comment on TIKA-2667 at 6/20/18 7:04 PM: -- It's OK

[jira] [Comment Edited] (TIKA-2667) Upgrade jmatio to 1.4

2018-06-20 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16518490#comment-16518490 ] Uwe Schindler edited comment on TIKA-2667 at 6/20/18 7:01 PM: -- It's OK

[jira] [Comment Edited] (TIKA-2667) Upgrade jmatio to 1.4

2018-06-20 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16518490#comment-16518490 ] Uwe Schindler edited comment on TIKA-2667 at 6/20/18 7:00 PM: -- It's OK

[jira] [Commented] (TIKA-2667) Upgrade jmatio to 1.4

2018-06-20 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16518490#comment-16518490 ] Uwe Schindler commented on TIKA-2667: - It's OK because it wont fail, but I dont understand the need to

[jira] [Commented] (TIKA-2667) Upgrade jmatio to 1.3

2018-06-14 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16512016#comment-16512016 ] Uwe Schindler commented on TIKA-2667: - Hi, I just looked at your code change in jmatio. The

[jira] [Commented] (TIKA-1830) Upgrade to PDFBox 1.8.11 when available

2016-01-13 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1830?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15096384#comment-15096384 ] Uwe Schindler commented on TIKA-1830: - It would be good to update to 1.8.11 as soon as it is out,

[jira] [Commented] (TIKA-1830) Upgrade to PDFBox 1.8.11 when available

2016-01-13 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1830?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15096663#comment-15096663 ] Uwe Schindler commented on TIKA-1830: - bq. Speaking of integration with Solr, would you have a

[jira] [Commented] (TIKA-1824) Tika 2.0 - Create Initial Parser Modules

2016-01-13 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15096668#comment-15096668 ] Uwe Schindler commented on TIKA-1824: - Hi, as invited on TIKA-1830, here some comments from Apache

[jira] [Updated] (TIKA-1758) BatchCommandLineBuilder fails on systems with whitespace in path

2015-09-30 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1758?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated TIKA-1758: Description: All tests for CLI module fail with errors like that: {noformat} Tests run: 6,

[jira] [Commented] (TIKA-1757) tika-batch tests fail on systems with whitespace or special chars in folder name

2015-09-30 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1757?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14938915#comment-14938915 ] Uwe Schindler commented on TIKA-1757: - The other issue is different, I opened TIKA-1758 > tika-batch

[jira] [Created] (TIKA-1757) tika-batch tests fail on systems with whitespace or special chars in folder name

2015-09-30 Thread Uwe Schindler (JIRA)
Uwe Schindler created TIKA-1757: --- Summary: tika-batch tests fail on systems with whitespace or special chars in folder name Key: TIKA-1757 URL: https://issues.apache.org/jira/browse/TIKA-1757 Project:

[jira] [Created] (TIKA-1756) Update forbiddenapis to v2.0

2015-09-30 Thread Uwe Schindler (JIRA)
Uwe Schindler created TIKA-1756: --- Summary: Update forbiddenapis to v2.0 Key: TIKA-1756 URL: https://issues.apache.org/jira/browse/TIKA-1756 Project: Tika Issue Type: Improvement

[jira] [Created] (TIKA-1758) BatchCommandLineBuilder fails on systems with whitespace in path

2015-09-30 Thread Uwe Schindler (JIRA)
Uwe Schindler created TIKA-1758: --- Summary: BatchCommandLineBuilder fails on systems with whitespace in path Key: TIKA-1758 URL: https://issues.apache.org/jira/browse/TIKA-1758 Project: Tika

[jira] [Commented] (TIKA-1757) tika-batch tests fail on systems with whitespace or special chars in folder name

2015-09-30 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1757?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14938906#comment-14938906 ] Uwe Schindler commented on TIKA-1757: - Please wait with committing there are more tests failing with

[jira] [Updated] (TIKA-1756) Update forbiddenapis to v2.0

2015-09-30 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated TIKA-1756: Attachment: TIKA-1756.patch > Update forbiddenapis to v2.0 > > >

[jira] [Commented] (TIKA-1756) Update forbiddenapis to v2.0

2015-09-30 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1756?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14938879#comment-14938879 ] Uwe Schindler commented on TIKA-1756: - While testing this I found out that TIKA's test break when

[jira] [Updated] (TIKA-1757) tika-batch tests fail on systems with whitespace or special chars in folder name

2015-09-30 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated TIKA-1757: Attachment: TIKA-1757.patch Patch for broken test. > tika-batch tests fail on systems with

[jira] [Commented] (TIKA-1757) tika-batch tests fail on systems with whitespace or special chars in folder name

2015-09-30 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1757?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14938917#comment-14938917 ] Uwe Schindler commented on TIKA-1757: - bq. If one needs a java.nio.file.Path, Paths.get(url.toURI())

[jira] [Commented] (TIKA-1714) Consider making default host for Tika Server 0.0.0.0 instead of localhost

2015-08-18 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14701435#comment-14701435 ] Uwe Schindler commented on TIKA-1714: - If you want to bind for all, don't use 0.0.0.0,

[jira] [Commented] (TIKA-1714) Consider making default host for Tika Server 0.0.0.0 instead of localhost

2015-08-18 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14701442#comment-14701442 ] Uwe Schindler commented on TIKA-1714: - In any case, I agree with Nick, we should not do

[jira] [Commented] (TIKA-1706) Bring back commons-io to tika-core

2015-08-15 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14698313#comment-14698313 ] Uwe Schindler commented on TIKA-1706: - Yes, you can add the maven property

[jira] [Commented] (TIKA-1706) Bring back commons-io to tika-core

2015-08-14 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14697961#comment-14697961 ] Uwe Schindler commented on TIKA-1706: - If you bring in commons-io, you should also add

[jira] [Updated] (TIKA-1705) Update ASM dependency to 5.0.4

2015-08-11 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1705?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated TIKA-1705: Attachment: TIKA-1705-2.patch Sorry for a second patch. I just noticed that you were using

[jira] [Reopened] (TIKA-1705) Update ASM dependency to 5.0.4

2015-08-11 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1705?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler reopened TIKA-1705: - Reopen for 2nd patch. Update ASM dependency to 5.0.4 --

[jira] [Commented] (TIKA-1705) Update ASM dependency to 5.0.4

2015-08-11 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14681759#comment-14681759 ] Uwe Schindler commented on TIKA-1705: - The question about this: This will not fail

[jira] [Created] (TIKA-1705) Update ASM dependency to 5.0.4

2015-08-10 Thread Uwe Schindler (JIRA)
Uwe Schindler created TIKA-1705: --- Summary: Update ASM dependency to 5.0.4 Key: TIKA-1705 URL: https://issues.apache.org/jira/browse/TIKA-1705 Project: Tika Issue Type: Task Affects

[jira] [Updated] (TIKA-1705) Update ASM dependency to 5.0.4

2015-08-10 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1705?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated TIKA-1705: Attachment: TIKA-1705.patch Simple patch. All tests pass. Update ASM dependency to 5.0.4

[jira] [Commented] (TIKA-1675) please avoid xmlbeans dependency

2015-07-07 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1675?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14617578#comment-14617578 ] Uwe Schindler commented on TIKA-1675: - There was already an issue/discussion open on

[jira] [Comment Edited] (TIKA-1675) please avoid xmlbeans dependency

2015-07-07 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1675?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14617578#comment-14617578 ] Uwe Schindler edited comment on TIKA-1675 at 7/7/15 10:53 PM: --

[jira] [Commented] (TIKA-1675) please avoid xmlbeans dependency

2015-07-07 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1675?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14617588#comment-14617588 ] Uwe Schindler commented on TIKA-1675: - kiwiwings kiwiwi...@apache.org already proposed

[jira] [Commented] (TIKA-1637) Oracle internal API jdeps request for information

2015-05-25 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1637?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14558087#comment-14558087 ] Uwe Schindler commented on TIKA-1637: - Hi Dave, forbidden-apis already forbids use of

[jira] [Comment Edited] (TIKA-1637) Oracle internal API jdeps request for information

2015-05-25 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1637?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14558087#comment-14558087 ] Uwe Schindler edited comment on TIKA-1637 at 5/25/15 10:18 AM:

[jira] [Commented] (TIKA-1628) ExternalParser.check should return false if it hits SecurityException

2015-05-12 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14539838#comment-14539838 ] Uwe Schindler commented on TIKA-1628: - +1 to the patch. I don't think we need a test!

[jira] [Comment Edited] (TIKA-1582) Mime Detection based on neural networks with Byte-frequency-histogram

2015-05-02 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1582?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14525112#comment-14525112 ] Uwe Schindler edited comment on TIKA-1582 at 5/2/15 7:35 AM: -

[jira] [Commented] (TIKA-1582) Mime Detection based on neural networks with Byte-frequency-histogram

2015-05-02 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1582?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14525112#comment-14525112 ] Uwe Schindler commented on TIKA-1582: - Hi Chris, there is already forbidden-apis 1.8

[jira] [Commented] (TIKA-1511) Create a parser for SQLite3

2015-03-29 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14385803#comment-14385803 ] Uwe Schindler commented on TIKA-1511: - Solr uses ANT + IVY to build. We don't use

[jira] [Commented] (TIKA-1558) Create a Parser Blacklist

2015-02-23 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1558?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14333400#comment-14333400 ] Uwe Schindler commented on TIKA-1558: - Hi, Lucene uses SPI for its index codecs, so we

[jira] [Comment Edited] (TIKA-1558) Create a Parser Blacklist

2015-02-23 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1558?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14333400#comment-14333400 ] Uwe Schindler edited comment on TIKA-1558 at 2/23/15 4:06 PM: --

[jira] [Commented] (TIKA-1526) ExternalParser should trap/ignore/workarround JDK-8047340 JDK-8055301 so Turkish Tika users can still use non-external parsers

2015-02-23 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14333628#comment-14333628 ] Uwe Schindler commented on TIKA-1526: - Thanks David! ExternalParser should

[jira] [Commented] (TIKA-1557) Create TesseractOCR Option to Never Run

2015-02-20 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1557?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14329523#comment-14329523 ] Uwe Schindler commented on TIKA-1557: - I would not make this a special option only for

[jira] [Comment Edited] (TIKA-1557) Create TesseractOCR Option to Never Run

2015-02-20 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1557?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14329523#comment-14329523 ] Uwe Schindler edited comment on TIKA-1557 at 2/20/15 9:05 PM: --

[jira] [Comment Edited] (TIKA-1557) Create TesseractOCR Option to Never Run

2015-02-20 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1557?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14329523#comment-14329523 ] Uwe Schindler edited comment on TIKA-1557 at 2/20/15 8:42 PM: --

[jira] [Commented] (TIKA-1555) posix_spawn is not a supported process launch mechanism on this platform

2015-02-20 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14329276#comment-14329276 ] Uwe Schindler commented on TIKA-1555: - Also, this issue in the JDK is already fixed in

[jira] [Commented] (TIKA-1555) posix_spawn is not a supported process launch mechanism on this platform

2015-02-20 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14329282#comment-14329282 ] Uwe Schindler commented on TIKA-1555: - @UweSays:

[jira] [Commented] (TIKA-1555) posix_spawn is not a supported process launch mechanism on this platform

2015-02-20 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14329272#comment-14329272 ] Uwe Schindler commented on TIKA-1555: - This is a duplicate of TIKA-1526. posix_spawn

[jira] [Commented] (TIKA-1526) ExternalParser should trap/ignore/workarround JDK-8047340 JDK-8055301 so Turkish Tika users can still use non-external parsers

2015-02-20 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14329350#comment-14329350 ] Uwe Schindler commented on TIKA-1526: - I was not able to test this, because I have no

[jira] [Commented] (TIKA-1555) posix_spawn is not a supported process launch mechanism on this platform

2015-02-20 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14329344#comment-14329344 ] Uwe Schindler commented on TIKA-1555: - Hi David, can you try to compile Tika from

[jira] [Commented] (TIKA-1555) posix_spawn is not a supported process launch mechanism on this platform

2015-02-20 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14329364#comment-14329364 ] Uwe Schindler commented on TIKA-1555: - bq. BTW I wonder if we could add a setting which

[jira] [Commented] (TIKA-1555) posix_spawn is not a supported process launch mechanism on this platform

2015-02-20 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14329474#comment-14329474 ] Uwe Schindler commented on TIKA-1555: - bq. You can also disable OCR by setting the

[jira] [Commented] (TIKA-1526) ExternalParser should trap/ignore/workarround JDK-8047340 JDK-8055301 so Turkish Tika users can still use non-external parsers

2015-01-23 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14289125#comment-14289125 ] Uwe Schindler commented on TIKA-1526: - [~grossws]: This bug is not in Maven itsself,

[jira] [Commented] (TIKA-1526) ExternalParser should trap/ignore/workarround JDK-8047340 JDK-8055301 so Turkish Tika users can still use non-external parsers

2015-01-23 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14288963#comment-14288963 ] Uwe Schindler commented on TIKA-1526: - I tried it with maven, but this is all too

[jira] [Commented] (TIKA-1529) Turn forbidden-apis back on

2015-01-23 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14289438#comment-14289438 ] Uwe Schindler commented on TIKA-1529: - If you just check for ASCII chars in some string

[jira] [Comment Edited] (TIKA-1526) ExternalParser should trap/ignore/workarround JDK-8047340 JDK-8055301 so Turkish Tika users can still use non-external parsers

2015-01-23 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14289182#comment-14289182 ] Uwe Schindler edited comment on TIKA-1526 at 1/23/15 12:32 PM:

[jira] [Commented] (TIKA-1526) ExternalParser should trap/ignore/workarround JDK-8047340 JDK-8055301 so Turkish Tika users can still use non-external parsers

2015-01-23 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14289182#comment-14289182 ] Uwe Schindler commented on TIKA-1526: - To work around this bug you can in fact do this.

[jira] [Commented] (TIKA-1526) ExternalParser should trap/ignore/workarround JDK-8047340 JDK-8055301 so Turkish Tika users can still use non-external parsers

2015-01-22 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14288444#comment-14288444 ] Uwe Schindler commented on TIKA-1526: - Hi Tylor: The problem is explained above. To

[jira] [Comment Edited] (TIKA-1526) ExternalParser should trap/ignore/workarround JDK-8047340 JDK-8055301 so Turkish Tika users can still use non-external parsers

2015-01-22 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14288444#comment-14288444 ] Uwe Schindler edited comment on TIKA-1526 at 1/22/15 11:29 PM:

[jira] [Commented] (TIKA-1526) ExternalParser should trap/ignore/workarround JDK-8047340 JDK-8055301 so Turkish Tika users can still use non-external parsers

2015-01-22 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14287820#comment-14287820 ] Uwe Schindler commented on TIKA-1526: - FYI: The underlying bug in the JVM will never be

[jira] [Commented] (TIKA-1526) ExternalParser should trap/ignore/workarround JDK-8047340 JDK-8055301 so Turkish Tika users can still use non-external parsers

2015-01-22 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14287824#comment-14287824 ] Uwe Schindler commented on TIKA-1526: - Tim: Linux does not use posis spawn. You ned

[jira] [Comment Edited] (TIKA-1526) ExternalParser should trap/ignore/workarround JDK-8047340 JDK-8055301 so Turkish Tika users can still use non-external parsers

2015-01-22 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14287824#comment-14287824 ] Uwe Schindler edited comment on TIKA-1526 at 1/22/15 5:36 PM: --

[jira] [Commented] (TIKA-1526) ExternalParser should trap/ignore/workarround JDK-8047340 JDK-8055301 so Turkish Tika users can still use non-external parsers

2015-01-22 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14287850#comment-14287850 ] Uwe Schindler commented on TIKA-1526: - There is also a second problem: The bug is in

[jira] [Commented] (TIKA-1435) Update rome dependency to 1.5

2015-01-20 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283723#comment-14283723 ] Uwe Schindler commented on TIKA-1435: - Indeed this confused me while doing the Apache

[jira] [Commented] (TIKA-1523) metadata extractor gets the wrong number of pages of some documents Microsoft Word 9.0

2015-01-19 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283148#comment-14283148 ] Uwe Schindler commented on TIKA-1523: - Hi, I did some recherche: This is a bug in Word

[jira] [Comment Edited] (TIKA-1523) metadata extractor gets the wrong number of pages of some documents Microsoft Word 9.0

2015-01-19 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283116#comment-14283116 ] Uwe Schindler edited comment on TIKA-1523 at 1/19/15 10:50 PM:

[jira] [Updated] (TIKA-1523) metadata extractor gets the wrong number of pages of some documents Microsoft Word 9.0

2015-01-19 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1523?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated TIKA-1523: Attachment: screenshot-2.png metadata extractor gets the wrong number of pages of some documents

[jira] [Updated] (TIKA-1523) metadata extractor gets the wrong number of pages of some documents Microsoft Word 9.0

2015-01-19 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1523?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated TIKA-1523: Attachment: (was: screenshot-2.png) metadata extractor gets the wrong number of pages of some

[jira] [Updated] (TIKA-1523) metadata extractor gets the wrong number of pages of some documents Microsoft Word 9.0

2015-01-19 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1523?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated TIKA-1523: Attachment: screenshot-2.png metadata extractor gets the wrong number of pages of some documents

[jira] [Commented] (TIKA-1523) metadata extractor gets the wrong number of pages of some documents Microsoft Word 9.0

2015-01-19 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283116#comment-14283116 ] Uwe Schindler commented on TIKA-1523: - Yes. I extracts just the metadata. So I think

[jira] [Comment Edited] (TIKA-1523) metadata extractor gets the wrong number of pages of some documents Microsoft Word 9.0

2015-01-19 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283148#comment-14283148 ] Uwe Schindler edited comment on TIKA-1523 at 1/19/15 11:16 PM:

[jira] [Updated] (TIKA-1523) metadata extractor gets the wrong number of pages of some documents Microsoft Word 9.0

2015-01-19 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1523?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated TIKA-1523: Attachment: screenshot-1.png metadata extractor gets the wrong number of pages of some documents

[jira] [Commented] (TIKA-1523) metadata extractor gets the wrong number of pages of some documents Microsoft Word 9.0

2015-01-19 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14283092#comment-14283092 ] Uwe Schindler commented on TIKA-1523: - If I save the file with Office 2010, the page

[jira] [Commented] (TIKA-1457) NullPointerException in tika-app, parsing PDF content

2014-10-28 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1457?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14186533#comment-14186533 ] Uwe Schindler commented on TIKA-1457: - Hi, the next version of Solr with TIKA 1.6 will

[jira] [Comment Edited] (TIKA-1457) NullPointerException in tika-app, parsing PDF content

2014-10-28 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1457?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14186533#comment-14186533 ] Uwe Schindler edited comment on TIKA-1457 at 10/28/14 7:50 AM:

[jira] [Commented] (TIKA-1387) Add forbidden-apis checker to TIKA build

2014-10-25 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1387?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14184073#comment-14184073 ] Uwe Schindler commented on TIKA-1387: - I think this is already committed an working. I

[jira] [Commented] (TIKA-1387) Add forbidden-apis checker to TIKA build

2014-08-13 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1387?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14095798#comment-14095798 ] Uwe Schindler commented on TIKA-1387: - I think, for messages written in english

[jira] [Commented] (TIKA-1387) Add forbidden-apis checker to TIKA build

2014-08-13 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1387?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14095853#comment-14095853 ] Uwe Schindler commented on TIKA-1387: - Nick: in ImageMetadataExtractor.java, the date

[jira] [Created] (TIKA-1387) Add forbidden-apis checker to TIKA build

2014-08-06 Thread Uwe Schindler (JIRA)
Uwe Schindler created TIKA-1387: --- Summary: Add forbidden-apis checker to TIKA build Key: TIKA-1387 URL: https://issues.apache.org/jira/browse/TIKA-1387 Project: Tika Issue Type: Improvement

[jira] [Created] (TIKA-1386) Add forbidden-apis checker to TIKA build

2014-08-06 Thread Uwe Schindler (JIRA)
Uwe Schindler created TIKA-1386: --- Summary: Add forbidden-apis checker to TIKA build Key: TIKA-1386 URL: https://issues.apache.org/jira/browse/TIKA-1386 Project: Tika Issue Type: Improvement

[jira] [Closed] (TIKA-1386) Add forbidden-apis checker to TIKA build

2014-08-06 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1386?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler closed TIKA-1386. --- Resolution: Duplicate JIRA hung and created the issue 2 times. Add forbidden-apis checker to TIKA

[jira] [Updated] (TIKA-1387) Add forbidden-apis checker to TIKA build

2014-08-06 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1387?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated TIKA-1387: Attachment: TIKA-1387.patch This patch refactors the tika-java7 module a bit, so the forbidden-api

[jira] [Commented] (TIKA-1387) Add forbidden-apis checker to TIKA build

2014-08-06 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1387?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14087489#comment-14087489 ] Uwe Schindler commented on TIKA-1387: - One suggestion: The official name of the

[jira] [Updated] (TIKA-1387) Add forbidden-apis checker to TIKA build

2014-08-06 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1387?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated TIKA-1387: Attachment: TIKA-1387.patch Patch with renamed properties to conform to Maven standards. Add

[jira] [Commented] (TIKA-1387) Add forbidden-apis checker to TIKA build

2014-08-06 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1387?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14088088#comment-14088088 ] Uwe Schindler commented on TIKA-1387: - Hi I left a comment in the review. Was out for

[jira] [Reopened] (TIKA-1387) Add forbidden-apis checker to TIKA build

2014-08-06 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1387?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler reopened TIKA-1387: - I disagree wth some fixes, because they just workaround the forbidden-checks by still using system

[jira] [Commented] (TIKA-1252) Tika is not indexing all authors of a PDF

2014-03-03 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1252?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13918634#comment-13918634 ] Uwe Schindler commented on TIKA-1252: - This could be a problem in Solr's

[jira] [Commented] (TIKA-1252) Tika is not indexing all authors of a PDF

2014-03-03 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1252?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13918643#comment-13918643 ] Uwe Schindler commented on TIKA-1252: - I did a quick check in

[jira] [Comment Edited] (TIKA-1252) Tika is not indexing all authors of a PDF

2014-03-03 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1252?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13918643#comment-13918643 ] Uwe Schindler edited comment on TIKA-1252 at 3/3/14 10:17 PM: --

[jira] [Created] (TIKA-1211) OpenDocument (ODF) parser produces multipe startDocument() events

2013-12-17 Thread Uwe Schindler (JIRA)
Uwe Schindler created TIKA-1211: --- Summary: OpenDocument (ODF) parser produces multipe startDocument() events Key: TIKA-1211 URL: https://issues.apache.org/jira/browse/TIKA-1211 Project: Tika

[jira] [Updated] (TIKA-1211) OpenDocument (ODF) parser produces multiple startDocument() events

2013-12-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1211?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated TIKA-1211: Summary: OpenDocument (ODF) parser produces multiple startDocument() events (was: OpenDocument

[jira] [Commented] (TIKA-1211) OpenDocument (ODF) parser produces multiple startDocument() events

2013-12-17 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13850416#comment-13850416 ] Uwe Schindler commented on TIKA-1211: - There are multiple ways to fix this: - Make

[jira] [Commented] (TIKA-1181) RTFParser not keeping HTML font colors and underscore tags.

2013-10-07 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1181?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13788171#comment-13788171 ] Uwe Schindler commented on TIKA-1181: - Other parsers like OpenOffice do not preserve

[jira] [Commented] (TIKA-1134) ContentHandler gets ignorable whitespace for br tags when parsing HTML

2013-08-09 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13734769#comment-13734769 ] Uwe Schindler commented on TIKA-1134: - Hoss: I agree to fix this in the documentation.

[jira] [Commented] (TIKA-1134) ContentHandler gets ignorable whitespace for br tags when parsing HTML

2013-08-08 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13733344#comment-13733344 ] Uwe Schindler commented on TIKA-1134: - Hi Hoss, the rule in TIKA is: - TIKA inserts

[jira] [Commented] (TIKA-1134) ContentHandler gets ignorable whitespace for br tags when parsing HTML

2013-08-08 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13733348#comment-13733348 ] Uwe Schindler commented on TIKA-1134: - I think this issue is Won't fix. The issues

[jira] [Commented] (TIKA-1145) classloaders issue loading resources when extending Tika

2013-07-04 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13699793#comment-13699793 ] Uwe Schindler commented on TIKA-1145: - I think the main problem is ServiceLoader's

[jira] [Commented] (TIKA-1145) classloaders issue loading resources when extending Tika

2013-07-04 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13699883#comment-13699883 ] Uwe Schindler commented on TIKA-1145: - OK, I misunderstood the original problem. If you

[jira] [Commented] (TIKA-1145) classloaders issue loading resources when extending Tika

2013-07-04 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13699888#comment-13699888 ] Uwe Schindler commented on TIKA-1145: - It is still strange that you see this behaviour:

  1   2   >