[GitHub] [tika] Gagravarr commented on pull request #366: Fix build fail caused by can't find test file

2020-09-30 Thread GitBox
Gagravarr commented on pull request #366: URL: https://github.com/apache/tika/pull/366#issuecomment-701622248 Thanks for spotting this, the perils of cherry-picking between 1.x and 2.x branches without paying enough attention! I think this might have already been fixed now, can you p

[jira] [Commented] (TIKA-3044) add -C/--content cli option using WriteOutContentHandler

2020-09-30 Thread Alexander Klimetschek (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3044?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17204961#comment-17204961 ] Alexander Klimetschek commented on TIKA-3044: - {quote}The current proposal is

[jira] [Resolved] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-09-30 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-3094. --- Fix Version/s: 1.25 Resolution: Fixed > Apache Tika fails to extract text for pptx extension. >

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-09-30 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17204920#comment-17204920 ] Tim Allison commented on TIKA-3094: --- I think this is resolved, and the fix will come out

[jira] [Commented] (TIKA-3206) commons-io : 2.6, which is a transitive dependency of tika is vulnerable to "sonatype-2018-0705".

2020-09-30 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17204914#comment-17204914 ] Tim Allison commented on TIKA-3206: --- Thank you for opening this issue. We've already up

[jira] [Comment Edited] (TIKA-3044) add -C/--content cli option using WriteOutContentHandler

2020-09-30 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3044?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17204912#comment-17204912 ] Tim Allison edited comment on TIKA-3044 at 9/30/20, 5:38 PM: -

[jira] [Commented] (TIKA-3044) add -C/--content cli option using WriteOutContentHandler

2020-09-30 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3044?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17204912#comment-17204912 ] Tim Allison commented on TIKA-3044: --- It has been a while since I looked at this part of

[jira] [Commented] (TIKA-3205) Mime magic for more certificate related formats

2020-09-30 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17204911#comment-17204911 ] Hudson commented on TIKA-3205: -- SUCCESS: Integrated in Jenkins build Tika » tika-branch1x-jdk

Re: TIKA-3044 contribution: add -C/--content cli option using WriteOutContentHandler

2020-09-30 Thread Tim Allison
Sorry for my delay. Reviewing now. On Wed, Sep 30, 2020 at 12:23 PM Alexander Klimetschek wrote: > Hi Tika Committers, > > I was wondering if [1] has a chance of getting added. It brings the > command line options on par with the Tika API for text extraction for the > very common use case of ge

TIKA-3044 contribution: add -C/--content cli option using WriteOutContentHandler

2020-09-30 Thread Alexander Klimetschek
Hi Tika Committers, I was wondering if [1] has a chance of getting added. It brings the command line options on par with the Tika API for text extraction for the very common use case of getting „all text“ for indexing. The patch [2] has unit tests and is IMO very straightforward. We rely on it

[jira] [Created] (TIKA-3206) commons-io : 2.6, which is a transitive dependency of tika is vulnerable to "sonatype-2018-0705".

2020-09-30 Thread Ankush Rana (Jira)
Ankush Rana created TIKA-3206: - Summary: commons-io : 2.6, which is a transitive dependency of tika is vulnerable to "sonatype-2018-0705". Key: TIKA-3206 URL: https://issues.apache.org/jira/browse/TIKA-3206

[GitHub] [tika] PeterAlfredLee opened a new pull request #366: Fix build fail caused by can't find test file

2020-09-30 Thread GitBox
PeterAlfredLee opened a new pull request #366: URL: https://github.com/apache/tika/pull/366 Test with [latest commit 75c2ff5](https://github.com/apache/tika/commit/75c2ff5686a70c0fb15c4b52534c1be09669af1b) in `main` branch and got some test fail caused by can't find test file. This PR i

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-09-30 Thread Abhijit Rajwade (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17204502#comment-17204502 ] Abhijit Rajwade commented on TIKA-3094: --- [~tallison] [~bob] [~hudson] I don't know