[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-09-30 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17204920#comment-17204920 ] Tim Allison commented on TIKA-3094: --- I think this is resolved, and the fix will come out with 1.25. If

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-09-30 Thread Abhijit Rajwade (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17204502#comment-17204502 ] Abhijit Rajwade commented on TIKA-3094: --- [~tallison] [~bob] [~hudson] I don't know if this issue is

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-06-02 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17123914#comment-17123914 ] Hudson commented on TIKA-3094: -- SUCCESS: Integrated in Jenkins build tika-branch-1x #339 (See

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-05-07 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17102182#comment-17102182 ] Hudson commented on TIKA-3094: -- SUCCESS: Integrated in Jenkins build Tika-trunk #1813 (See

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-05-07 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17102137#comment-17102137 ] Bob Paulin commented on TIKA-3094: -- Looks like the jaxb error is not so much an issue with tika as it is

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-05-05 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17100054#comment-17100054 ] Hudson commented on TIKA-3094: -- SUCCESS: Integrated in Jenkins build Tika-trunk #1812 (See

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-05-05 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17099964#comment-17099964 ] Tim Allison commented on TIKA-3094: --- Thank you, [~bob]! On 3, that was my idiocy in not initializing a

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-05-05 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17099868#comment-17099868 ] Bob Paulin commented on TIKA-3094: -- Thanks [~tallison] .  For #2 JAXB was removed from the JDK

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-05-05 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17099859#comment-17099859 ] Tim Allison commented on TIKA-3094: --- Hi [~bob], I'll take #3. On 2, if you comment out the following in

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-05-05 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17099848#comment-17099848 ] Bob Paulin commented on TIKA-3094: -- Hey [~tallison] I ran a build on Java 8 and Java 11 and I was unable

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-05-04 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17099485#comment-17099485 ] Hudson commented on TIKA-3094: -- SUCCESS: Integrated in Jenkins build Tika-trunk #1811 (See

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-05-04 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17099475#comment-17099475 ] Tim Allison commented on TIKA-3094: --- Thank you [~bob]! For kicks, I ran the osgi'd Tika against all of

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-05-03 Thread Abhishek Chauhan (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17098690#comment-17098690 ] Abhishek Chauhan commented on TIKA-3094: Hello [~tallison]  [~bob] , I have increased the

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-04-30 Thread Abhijit Rajwade (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17096324#comment-17096324 ] Abhijit Rajwade commented on TIKA-3094: --- Yes [~bob] thanks a lot for the prompt fix. > Apache Tika

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-04-30 Thread Abhishek Chauhan (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17096319#comment-17096319 ] Abhishek Chauhan commented on TIKA-3094: Really thankful to [~bob] for resolving this ! > Apache

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-04-29 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17095964#comment-17095964 ] Hudson commented on TIKA-3094: -- SUCCESS: Integrated in Jenkins build tika-branch-1x #337 (See

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-04-29 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17095917#comment-17095917 ] Bob Paulin commented on TIKA-3094: -- Fixed with

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-04-29 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17095883#comment-17095883 ] Bob Paulin commented on TIKA-3094: -- Embedding SparseBitSet in Embed-Dependency fixes the issue.  Will be

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-04-29 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17095342#comment-17095342 ] Tim Allison commented on TIKA-3094: --- Y, exactly right. > Apache Tika fails to extract text for pptx

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-04-29 Thread Abhijit Rajwade (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17095133#comment-17095133 ] Abhijit Rajwade commented on TIKA-3094: --- I am working with [~abchauha] on this issue. One question.

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-04-28 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17094735#comment-17094735 ] Tim Allison commented on TIKA-3094: --- Thank you, [~bob]! > Apache Tika fails to extract text for pptx

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-04-28 Thread Abhishek Chauhan (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17094638#comment-17094638 ] Abhishek Chauhan commented on TIKA-3094: Glad ! Thanks for sharing this [~bob].  > Apache Tika

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-04-28 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17094614#comment-17094614 ] Bob Paulin commented on TIKA-3094: -- Thanks [~abchauha] .  The build process adds OSGi specific headers so

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-04-28 Thread Abhishek Chauhan (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17094594#comment-17094594 ] Abhishek Chauhan commented on TIKA-3094: [~bob] Please find the .pptx file attached.  Just would

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-04-28 Thread Bob Paulin (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17094517#comment-17094517 ] Bob Paulin commented on TIKA-3094: -- If SparseBitSet is embedded in the tika-bundle that the library

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-04-28 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17094466#comment-17094466 ] Tim Allison commented on TIKA-3094: --- [~bobpaulin], is this something we can fix within Tika or do we

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-04-28 Thread Abhishek Chauhan (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17094407#comment-17094407 ] Abhishek Chauhan commented on TIKA-3094: [~tallison] We are calling using OSGI bundle. Also, the

[jira] [Commented] (TIKA-3094) Apache Tika fails to extract text for pptx extension.

2020-04-28 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17094385#comment-17094385 ] Tim Allison commented on TIKA-3094: --- How are you calling Tika? Are you using the osgi bundle or calling