[jira] [Commented] (TIKA-2756) Switch to commons-lang 3

2018-10-17 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2756?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16653720#comment-16653720 ] Hudson commented on TIKA-2756: -- SUCCESS: Integrated in Jenkins build tika-branch-1x #117 (See

[jira] [Commented] (TIKA-2757) Add versions-maven-plugin

2018-10-17 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2757?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16653721#comment-16653721 ] Hudson commented on TIKA-2757: -- SUCCESS: Integrated in Jenkins build tika-branch-1x #117 (See

[jira] [Commented] (TIKA-2543) No content extraction for application/x-webarchive format

2018-10-17 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2543?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16653804#comment-16653804 ] Nick Burch commented on TIKA-2543: -- Great find Tim! Looks like an excellent resource on this. Assuming

[jira] [Commented] (TIKA-2757) Add versions-maven-plugin

2018-10-17 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2757?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16653814#comment-16653814 ] Hudson commented on TIKA-2757: -- ABORTED: Integrated in Jenkins build Tika-trunk #1580 (See

[jira] [Commented] (TIKA-2577) Sonatype Nexus Auditor is reporting that the Bouncy castle version used by Tika 1.17 is vulnerable

2018-10-17 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16654010#comment-16654010 ] Tim Allison commented on TIKA-2577: --- Agreed. Tika 1.19.1 uses BouncyCastle 1.60. I just added the

[jira] [Resolved] (TIKA-2577) Sonatype Nexus Auditor is reporting that the Bouncy castle version used by Tika 1.17 is vulnerable

2018-10-17 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-2577. --- Resolution: Fixed Fix Version/s: 1.19 > Sonatype Nexus Auditor is reporting that the Bouncy

[jira] [Commented] (TIKA-2543) No content extraction for application/x-webarchive format

2018-10-17 Thread Rafael Ferreira (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2543?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16653711#comment-16653711 ] Rafael Ferreira commented on TIKA-2543: --- This seems like a more widespread issue than I imagined,

[jira] [Commented] (TIKA-2543) No content extraction for application/x-webarchive format

2018-10-17 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2543?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16653725#comment-16653725 ] Tim Allison commented on TIKA-2543: --- Still on lookout for Java parser with an Apache friendly license

[jira] [Commented] (TIKA-2577) Sonatype Nexus Auditor is reporting that the Bouncy castle version used by Tika 1.17 is vulnerable

2018-10-17 Thread Andrew Pavlin (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16653994#comment-16653994 ] Andrew Pavlin commented on TIKA-2577: - I have to agree with the comment. Next build should include the

[jira] [Commented] (TIKA-2756) Switch to commons-lang 3

2018-10-17 Thread Robert Munteanu (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2756?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16653819#comment-16653819 ] Robert Munteanu commented on TIKA-2756: --- Thanks for looking into this [~talli...@apache.org]! >

[jira] [Commented] (TIKA-2543) No content extraction for application/x-webarchive format

2018-10-17 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2543?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16653718#comment-16653718 ] Tim Allison commented on TIKA-2543: --- TIKA-1358 might be relevant. We don't currently parse modern Apple

[jira] [Commented] (TIKA-2543) No content extraction for application/x-webarchive format

2018-10-17 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2543?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16653764#comment-16653764 ] Tim Allison commented on TIKA-2543: ---

[jira] [Updated] (TIKA-2543) No content extraction for application/x-webarchive format

2018-10-17 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2543?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-2543: -- Attachment: tika.plist > No content extraction for application/x-webarchive format >

[jira] [Commented] (TIKA-2543) No content extraction for application/x-webarchive format

2018-10-17 Thread Rafael Ferreira (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2543?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16653715#comment-16653715 ] Rafael Ferreira commented on TIKA-2543: --- If someone can point in the general area of the problem,

[jira] [Commented] (TIKA-2744) rss+xml doesnt accept files with .xml extension

2018-10-17 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2744?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16653808#comment-16653808 ] Nick Burch commented on TIKA-2744: -- I've added a test RSS 2.0 file to Tika's test documents, and it's

tika-2.x-windows - Build # 336 - Still Failing

2018-10-17 Thread Apache Jenkins Server
The Apache Jenkins build system has built tika-2.x-windows (build #336) Status: Still Failing Check console output at https://builds.apache.org/job/tika-2.x-windows/336/ to view the results.

[jira] [Updated] (TIKA-2744) rss+xml doesnt accept files with .xml extension

2018-10-17 Thread Martin (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Martin updated TIKA-2744: - Attachment: rsstest.xml > rss+xml doesnt accept files with .xml extension >

[jira] [Commented] (TIKA-2744) rss+xml doesnt accept files with .xml extension

2018-10-17 Thread Martin (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2744?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16654681#comment-16654681 ] Martin commented on TIKA-2744: -- Hello Guys,    I apologize for late comment. I added attachment to this bug

[jira] [Commented] (TIKA-2734) Tika addes extra characters at the end of text in extracting from excel file

2018-10-17 Thread feng ye (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2734?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16654386#comment-16654386 ] feng ye commented on TIKA-2734: --- Thanks Tim for your detailed tips.  I am using Tika to extract all kinds

[jira] [Commented] (TIKA-2734) Tika addes extra characters at the end of text in extracting from excel file

2018-10-17 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2734?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16654394#comment-16654394 ] Tim Allison commented on TIKA-2734: --- It will not. Let us know if you have any surprises. > Tika addes

[jira] [Commented] (TIKA-2756) Switch to commons-lang 3

2018-10-17 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2756?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16653603#comment-16653603 ] Tim Allison commented on TIKA-2756: --- I refactored the parts of our code that rely on {{commons-lang}}.

[jira] [Commented] (TIKA-2756) Switch to commons-lang 3

2018-10-17 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2756?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16653613#comment-16653613 ] Hudson commented on TIKA-2756: -- SUCCESS: Integrated in Jenkins build Tika-trunk #1579 (See

[jira] [Commented] (TIKA-2757) Add versions-maven-plugin

2018-10-17 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2757?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16653614#comment-16653614 ] Hudson commented on TIKA-2757: -- FAILURE: Integrated in Jenkins build tika-2.x-windows #335 (See

tika-2.x-windows - Build # 335 - Still Failing

2018-10-17 Thread Apache Jenkins Server
The Apache Jenkins build system has built tika-2.x-windows (build #335) Status: Still Failing Check console output at https://builds.apache.org/job/tika-2.x-windows/335/ to view the results.

[jira] [Commented] (TIKA-2756) Switch to commons-lang 3

2018-10-17 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2756?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16653529#comment-16653529 ] Hudson commented on TIKA-2756: -- FAILURE: Integrated in Jenkins build tika-2.x-windows #334 (See

tika-2.x-windows - Build # 334 - Failure

2018-10-17 Thread Apache Jenkins Server
The Apache Jenkins build system has built tika-2.x-windows (build #334) Status: Failure Check console output at https://builds.apache.org/job/tika-2.x-windows/334/ to view the results.

[jira] [Created] (TIKA-2757) Add versions-maven-plugin

2018-10-17 Thread Tim Allison (JIRA)
Tim Allison created TIKA-2757: - Summary: Add versions-maven-plugin Key: TIKA-2757 URL: https://issues.apache.org/jira/browse/TIKA-2757 Project: Tika Issue Type: Task Reporter: Tim

[jira] [Commented] (TIKA-2757) Add versions-maven-plugin

2018-10-17 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2757?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16653558#comment-16653558 ] Tim Allison commented on TIKA-2757: --- When I run {{versions:display-plug-in-updates}}, I get this error

[jira] [Commented] (TIKA-2734) Tika addes extra characters at the end of text in extracting from excel file

2018-10-17 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2734?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16653483#comment-16653483 ] Tim Allison commented on TIKA-2734: --- The facade method of calling Tika doesn't include a ParseContext,

[jira] [Commented] (TIKA-2755) Allow Tika to skip extraction of tags in HTML

2018-10-17 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2755?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16653517#comment-16653517 ] Tim Allison commented on TIKA-2755: --- Doh. My fault, not yours. tika-server uses the