[jira] [Resolved] (TIKA-3529) Fix sameserverid unit test for Windows and clean up server integration tests

2021-08-18 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-3529. --- Fix Version/s: 2.1.0 Assignee: Tim Allison Resolution: Fixed > Fix sameserverid unit

[jira] [Commented] (TIKA-3510) tika-parser-scientific-module seems to embbed many dependencies

2021-08-18 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3510?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17401343#comment-17401343 ] Tim Allison commented on TIKA-3510: --- Much better. Thank you! > tika-parser-scientific-module seems to

[jira] [Resolved] (TIKA-3532) Add clirr plugin back for 2.1.0

2021-08-18 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3532?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-3532. --- Fix Version/s: 2.1.0 Assignee: Tim Allison Resolution: Fixed > Add clirr plugin back

[jira] [Created] (TIKA-3532) Add clirr plugin back for 2.1.0

2021-08-18 Thread Tim Allison (Jira)
Tim Allison created TIKA-3532: - Summary: Add clirr plugin back for 2.1.0 Key: TIKA-3532 URL: https://issues.apache.org/jira/browse/TIKA-3532 Project: Tika Issue Type: Task Reporter:

[jira] [Commented] (TIKA-3530) Simplify dependencies via larger DependencyManagement section in tika-parent

2021-08-18 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17401316#comment-17401316 ] Hudson commented on TIKA-3530: -- SUCCESS: Integrated in Jenkins build Tika » tika-main-jdk8 #318 (See

[jira] [Resolved] (TIKA-3531) Increase wait time for OpenSearch startup in integration test

2021-08-18 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-3531. --- Fix Version/s: 2.1.0 Assignee: Tim Allison Resolution: Fixed > Increase wait time for

[jira] [Commented] (TIKA-3529) Fix sameserverid unit test for Windows and clean up server integration tests

2021-08-18 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17401289#comment-17401289 ] Hudson commented on TIKA-3529: -- SUCCESS: Integrated in Jenkins build Tika » tika-main-jdk8 #317 (See

[jira] [Updated] (TIKA-3531) Increase wait time for OpenSearch startup in integration test

2021-08-18 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-3531: -- Priority: Trivial (was: Major) > Increase wait time for OpenSearch startup in integration test >

[jira] [Created] (TIKA-3531) Increase wait time for OpenSearch startup in integration test

2021-08-18 Thread Tim Allison (Jira)
Tim Allison created TIKA-3531: - Summary: Increase wait time for OpenSearch startup in integration test Key: TIKA-3531 URL: https://issues.apache.org/jira/browse/TIKA-3531 Project: Tika Issue

[jira] [Commented] (TIKA-3530) Simplify dependencies via larger DependencyManagement section in tika-parent

2021-08-18 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17401263#comment-17401263 ] Tim Allison commented on TIKA-3530: --- bq. Showing with 341 additions and 1,267 deletions. > Simplify

[jira] [Updated] (TIKA-3530) Simplify dependencies via larger DependencyManagement section in tika-parent

2021-08-18 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3530?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-3530: -- Priority: Minor (was: Major) > Simplify dependencies via larger DependencyManagement section in

[jira] [Resolved] (TIKA-3530) Simplify dependencies via larger DependencyManagement section in tika-parent

2021-08-18 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3530?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-3530. --- Fix Version/s: 2.1.0 Assignee: Tim Allison Resolution: Fixed > Simplify dependencies

[jira] [Commented] (TIKA-3518) Tika 1.26 not Working with Tesseract 4.0 and Higher Version

2021-08-18 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17401255#comment-17401255 ] Hudson commented on TIKA-3518: -- UNSTABLE: Integrated in Jenkins build Tika » tika-main-jdk8 #316 (See

[jira] [Created] (TIKA-3530) Simplify dependencies via larger DependencyManagement section in tika-parent

2021-08-18 Thread Tim Allison (Jira)
Tim Allison created TIKA-3530: - Summary: Simplify dependencies via larger DependencyManagement section in tika-parent Key: TIKA-3530 URL: https://issues.apache.org/jira/browse/TIKA-3530 Project: Tika

[jira] [Resolved] (TIKA-3518) Tika 1.26 not Working with Tesseract 4.0 and Higher Version

2021-08-18 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-3518. --- Fix Version/s: 2.1.0 Resolution: Fixed > Tika 1.26 not Working with Tesseract 4.0 and Higher

[jira] [Created] (TIKA-3529) Fix sameserverid unit test for Windows and clean up server integration tests

2021-08-18 Thread Tim Allison (Jira)
Tim Allison created TIKA-3529: - Summary: Fix sameserverid unit test for Windows and clean up server integration tests Key: TIKA-3529 URL: https://issues.apache.org/jira/browse/TIKA-3529 Project: Tika

[jira] [Commented] (TIKA-3518) Tika 1.26 not Working with Tesseract 4.0 and Higher Version

2021-08-18 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17401163#comment-17401163 ] Hudson commented on TIKA-3518: -- SUCCESS: Integrated in Jenkins build Tika » tika-main-jdk8 #315 (See

[jira] [Comment Edited] (TIKA-3518) Tika 1.26 not Working with Tesseract 4.0 and Higher Version

2021-08-18 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17401104#comment-17401104 ] Tim Allison edited comment on TIKA-3518 at 8/18/21, 2:58 PM: - -I'm wondering

[jira] [Commented] (TIKA-3518) Tika 1.26 not Working with Tesseract 4.0 and Higher Version

2021-08-18 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17401104#comment-17401104 ] Tim Allison commented on TIKA-3518: --- I'm wondering if in the earlier versions of tesseract, one pointed

[jira] [Comment Edited] (TIKA-3510) tika-parser-scientific-module seems to embbed many dependencies

2021-08-18 Thread Thomas Mortagne (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3510?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17401103#comment-17401103 ] Thomas Mortagne edited comment on TIKA-3510 at 8/18/21, 2:34 PM: - bq. Are

[jira] [Commented] (TIKA-3510) tika-parser-scientific-module seems to embbed many dependencies

2021-08-18 Thread Thomas Mortagne (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3510?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17401103#comment-17401103 ] Thomas Mortagne commented on TIKA-3510: --- bq. Are you ok with the changes?

[jira] [Comment Edited] (TIKA-3518) Tika 1.26 not Working with Tesseract 4.0 and Higher Version

2021-08-18 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17401093#comment-17401093 ] Tim Allison edited comment on TIKA-3518 at 8/18/21, 2:23 PM: - I think I

[jira] [Commented] (TIKA-3518) Tika 1.26 not Working with Tesseract 4.0 and Higher Version

2021-08-18 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17401093#comment-17401093 ] Tim Allison commented on TIKA-3518: --- I think I figured out what is going on... There is a bug. At

[jira] [Commented] (TIKA-3510) tika-parser-scientific-module seems to embbed many dependencies

2021-08-18 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3510?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17401087#comment-17401087 ] Tim Allison commented on TIKA-3510: --- No problem. After the most recent fix, I'm all set. I had done a

[jira] [Commented] (TIKA-3510) tika-parser-scientific-module seems to embbed many dependencies

2021-08-18 Thread Thomas Mortagne (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3510?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17401082#comment-17401082 ] Thomas Mortagne commented on TIKA-3510: --- bq. Thomas Mortagne, please take a look and see if this

[jira] [Commented] (TIKA-3523) A replacement for enableFileUrl or Support for Google Cloud

2021-08-18 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17400994#comment-17400994 ] Hudson commented on TIKA-3523: -- SUCCESS: Integrated in Jenkins build Tika » tika-main-jdk8 #314 (See

[CANCELLED][VOTE] Release Apache Tika 2.1.0 Candidate #1

2021-08-18 Thread Tim Allison
-1 because of a bug found by Tilman: https://issues.apache.org/jira/browse/TIKA-3523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17400733#comment-17400733 I'll respin an rc2 later today. Please let me know if you find anything else. Thank you! Best, Tim On

[jira] [Commented] (TIKA-3528) WMV file detected as WMA (audio/x-ms-wma)

2021-08-18 Thread Nick Burch (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3528?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17400934#comment-17400934 ] Nick Burch commented on TIKA-3528: -- The specification document from Microsoft documents the following

[jira] [Commented] (TIKA-3528) WMV file detected as WMA (audio/x-ms-wma)

2021-08-18 Thread Nick Burch (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3528?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17400920#comment-17400920 ] Nick Burch commented on TIKA-3528: -- Currently we detect to the video format based on the overall

[jira] [Updated] (TIKA-3528) WMV file detected as WMA (audio/x-ms-wma)

2021-08-18 Thread Nitish Gupta (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nitish Gupta updated TIKA-3528: --- Description: Attached file is detected as "audio/x-ms-wma" instead of "video/x-ms-asf". Link :

[jira] [Commented] (TIKA-3526) i cant extract content from attachments in the document

2021-08-18 Thread matcha007 (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17400847#comment-17400847 ] matcha007 commented on TIKA-3526: - I suspect this is related to my mixing WPS and office > i cant extract

[jira] [Commented] (TIKA-3526) i cant extract content from attachments in the document

2021-08-18 Thread matcha007 (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17400809#comment-17400809 ] matcha007 commented on TIKA-3526: -  by the way,my project using the AutoDetectParser. > i cant extract