[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16212548#comment-16212548 ] Uwe Schindler commented on SOLR-8981: - [~steve_rowe]: I added a patch to other issue. The reason is that for safety I set the static "disable serialization" on the JMatIO parser in the init() of the plugin. I found out later by reviewing [~talli...@mitre.org]'s fork that the default is already using "false". But safe is safe. > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement > Components: contrib - Solr Cell (Tika extraction) >Reporter: Tim Allison >Assignee: Uwe Schindler > Fix For: 5.5.5, 6.2, 7.0 > > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16212542#comment-16212542 ] Steve Rowe commented on SOLR-8981: -- bq. Hi Steve Rowe, did you try to remove the jmatio.jar file. Yes, I did remove the jmatio.jar file, and started the 5.5.5 RC1 vote with it removed. bq. How about trying to just update the JAR file by Tim Allison fork? I added an alternative patch to SOLR-11486! +1, I'll go respin. > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement > Components: contrib - Solr Cell (Tika extraction) >Reporter: Tim Allison >Assignee: Uwe Schindler > Fix For: 5.5.5, 6.2, 7.0 > > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16212527#comment-16212527 ] Tim Allison commented on SOLR-8981: --- +1 Thank you, [~thetaphi]! > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement > Components: contrib - Solr Cell (Tika extraction) >Reporter: Tim Allison >Assignee: Uwe Schindler > Fix For: 5.5.5, 6.2, 7.0 > > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16212410#comment-16212410 ] Uwe Schindler commented on SOLR-8981: - I added an alternative patch to SOLR-8981! > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement > Components: contrib - Solr Cell (Tika extraction) >Reporter: Tim Allison >Assignee: Uwe Schindler > Fix For: 5.5.5, 6.2, 7.0 > > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16212317#comment-16212317 ] Uwe Schindler commented on SOLR-8981: - I just figured out: How about trying to just update the JAR file by [~talli...@mitre.org] fork? I looked at the code of MatParser.java, there were no real changes. The imports are the same and API calls! So we would keep old TIKA version for compatibility and just "patch" the CVE vuln with updating the JAR file. > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement > Components: contrib - Solr Cell (Tika extraction) >Reporter: Tim Allison >Assignee: Uwe Schindler > Fix For: 5.5.5, 6.2, 7.0 > > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16212282#comment-16212282 ] Uwe Schindler commented on SOLR-8981: - Hi [~steve_rowe], did you try to remove the jmatio.jar file. Maybe we should add a simple test by adding a matlab file to figure out if it does not crush horribly. It should just refuse to parse the file with some exception of a not found class, as there is no parser. We recommended the same thing to people as quick workaround when the Microsoft Word XXE issues happend. > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement > Components: contrib - Solr Cell (Tika extraction) >Reporter: Tim Allison >Assignee: Uwe Schindler > Fix For: 5.5.5, 6.2, 7.0 > > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16211864#comment-16211864 ] ASF subversion and git services commented on SOLR-8981: --- Commit 917798d5ad509ec5d13bebea10b5d6071bed6202 in lucene-solr's branch refs/heads/branch_5_5 from [~steve_rowe] [ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=917798d ] Revert "SOLR-8981: Update TIKA to 1.13:" This reverts commit 10fb52a64b4fa5ff999421912217d5c717fce12b. > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement > Components: contrib - Solr Cell (Tika extraction) >Reporter: Tim Allison >Assignee: Uwe Schindler > Fix For: 5.5.5, 6.2, 7.0 > > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16211861#comment-16211861 ] ASF subversion and git services commented on SOLR-8981: --- Commit 8a554525a58fad1395a65b241dbdf9b8b5943ddb in lucene-solr's branch refs/heads/branch_5_5 from [~steve_rowe] [ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=8a55452 ] Revert "SOLR-8981: branch_5_5: CHANGES.txt: Tika 1.7-1.13" This reverts commit 2e5f7d14f62aae94dc117b39d78a731344a016dc. > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement > Components: contrib - Solr Cell (Tika extraction) >Reporter: Tim Allison >Assignee: Uwe Schindler > Fix For: 5.5.5, 6.2, 7.0 > > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16211862#comment-16211862 ] ASF subversion and git services commented on SOLR-8981: --- Commit 75142f178c2807e1878366143857552a22e6e195 in lucene-solr's branch refs/heads/branch_5_5 from [~steve_rowe] [ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=75142f1 ] Revert "SOLR-8981: branch_5_5: fix bad CHANGES.txt merge" This reverts commit 2e97142c9161962cb1ba3e6ad6fa8a6f4faf85cf. > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement > Components: contrib - Solr Cell (Tika extraction) >Reporter: Tim Allison >Assignee: Uwe Schindler > Fix For: 5.5.5, 6.2, 7.0 > > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16211863#comment-16211863 ] ASF subversion and git services commented on SOLR-8981: --- Commit 074f7209c62fe265f66a256c77ccfa8c3a247cf7 in lucene-solr's branch refs/heads/branch_5_5 from [~steve_rowe] [ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=074f720 ] Revert "SOLR-8981: Add notice for jackcess" This reverts commit dc84062e657035d0f8c07fd29fe2a32bc60827e0. > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement > Components: contrib - Solr Cell (Tika extraction) >Reporter: Tim Allison >Assignee: Uwe Schindler > Fix For: 5.5.5, 6.2, 7.0 > > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16208641#comment-16208641 ] ASF subversion and git services commented on SOLR-8981: --- Commit 2e5f7d14f62aae94dc117b39d78a731344a016dc in lucene-solr's branch refs/heads/branch_5_5 from [~steve_rowe] [ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=2e5f7d1 ] SOLR-8981: branch_5_5: CHANGES.txt: Tika 1.7-1.13 > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement > Components: contrib - Solr Cell (Tika extraction) >Reporter: Tim Allison >Assignee: Uwe Schindler > Fix For: 5.5.5, 6.2, 7.0 > > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16208638#comment-16208638 ] ASF subversion and git services commented on SOLR-8981: --- Commit 2e97142c9161962cb1ba3e6ad6fa8a6f4faf85cf in lucene-solr's branch refs/heads/branch_5_5 from [~steve_rowe] [ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=2e97142 ] SOLR-8981: branch_5_5: fix bad CHANGES.txt merge > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement > Components: contrib - Solr Cell (Tika extraction) >Reporter: Tim Allison >Assignee: Uwe Schindler > Fix For: 5.5.5, 6.2, 7.0 > > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16208626#comment-16208626 ] ASF subversion and git services commented on SOLR-8981: --- Commit 10fb52a64b4fa5ff999421912217d5c717fce12b in lucene-solr's branch refs/heads/branch_5_5 from [~thetaphi] [ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=10fb52a ] SOLR-8981: Update TIKA to 1.13: - This commit merges branch 'SOLR-8981' of https://github.com/tballison/lucene-solr - Adds some modifications and reverts jackcess-encrypt addition (not yet working) - Fixes order of ivy-versions.properties - This closes #44 > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement > Components: contrib - Solr Cell (Tika extraction) >Reporter: Tim Allison >Assignee: Uwe Schindler > Fix For: 6.2, 7.0 > > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16208628#comment-16208628 ] ASF subversion and git services commented on SOLR-8981: --- Commit dc84062e657035d0f8c07fd29fe2a32bc60827e0 in lucene-solr's branch refs/heads/branch_5_5 from [~thetaphi] [ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=dc84062 ] SOLR-8981: Add notice for jackcess > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement > Components: contrib - Solr Cell (Tika extraction) >Reporter: Tim Allison >Assignee: Uwe Schindler > Fix For: 6.2, 7.0 > > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16208627#comment-16208627 ] ASF subversion and git services commented on SOLR-8981: --- Commit 10fb52a64b4fa5ff999421912217d5c717fce12b in lucene-solr's branch refs/heads/branch_5_5 from [~thetaphi] [ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=10fb52a ] SOLR-8981: Update TIKA to 1.13: - This commit merges branch 'SOLR-8981' of https://github.com/tballison/lucene-solr - Adds some modifications and reverts jackcess-encrypt addition (not yet working) - Fixes order of ivy-versions.properties - This closes #44 > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement > Components: contrib - Solr Cell (Tika extraction) >Reporter: Tim Allison >Assignee: Uwe Schindler > Fix For: 6.2, 7.0 > > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16208616#comment-16208616 ] Steve Rowe commented on SOLR-8981: -- Reopening to backport to branch_5_5. > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement > Components: contrib - Solr Cell (Tika extraction) >Reporter: Tim Allison >Assignee: Uwe Schindler > Fix For: 6.2, 7.0 > > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15340484#comment-15340484 ] Uwe Schindler commented on SOLR-8981: - Done! > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement > Components: contrib - Solr Cell (Tika extraction) >Reporter: Tim Allison >Assignee: Uwe Schindler > Fix For: master (7.0), 6.2 > > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15340481#comment-15340481 ] ASF subversion and git services commented on SOLR-8981: --- Commit e50613cb81f7551996a9cdc76ae47bd9cbb84907 in lucene-solr's branch refs/heads/branch_6x from [~thetaphi] [ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=e50613c ] SOLR-8981: Add notice for jackcess > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement > Components: contrib - Solr Cell (Tika extraction) >Reporter: Tim Allison >Assignee: Uwe Schindler > Fix For: master (7.0), 6.2 > > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15340479#comment-15340479 ] ASF subversion and git services commented on SOLR-8981: --- Commit 9c88143bdaa0bbf304be1e8a81941dfe59c89f99 in lucene-solr's branch refs/heads/master from [~thetaphi] [ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=9c88143 ] SOLR-8981: Add notice for jackcess > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement > Components: contrib - Solr Cell (Tika extraction) >Reporter: Tim Allison >Assignee: Uwe Schindler > Fix For: master (7.0), 6.2 > > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15340337#comment-15340337 ] Tim Allison commented on SOLR-8981: --- Thank you, [~thetaphi]! Since my last push, I heard back from the main developer of Jackcess, James Ahlborn. Would it possible to modify the Copyright range in solr/NOTICE.txt to 2008-2016? {noformat} Jackcess: http://jackcess.sourceforge.net/ Copyright (C) 2008-2016 James Ahlborn {noformat} Thank you! > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement > Components: contrib - Solr Cell (Tika extraction) >Reporter: Tim Allison >Assignee: Uwe Schindler > Fix For: master (7.0), 6.2 > > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338244#comment-15338244 ] Tim Allison commented on SOLR-8981: --- +1 on SOLR-9221 > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement > Components: contrib - Solr Cell (Tika extraction) >Reporter: Tim Allison >Assignee: Uwe Schindler > Fix For: master (7.0), 6.2 > > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338233#comment-15338233 ] Uwe Schindler commented on SOLR-8981: - bq. Where is morph lines? Can someone please explain explicitly what the issue is here and I'll have a crack at fixing it. Morphlines is an external library by (I think) Cloudera This issue is: - It depends on an older version of Solr (which is a circular dependency) and also on an older version of TIKA (the one matching the Solr version at that time) - At some time there were 3 contribs donated to us, that made use of this library. The problem are the circular dependencies. - We cannot fix the problem in morphlines, as it is not our code. So we cannot prevent tests from failing. Because of that I proposed to remove it. Steve opened SOLR-9221 for that. > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement > Components: contrib - Solr Cell (Tika extraction) >Reporter: Tim Allison >Assignee: Uwe Schindler > Fix For: master (7.0), 6.2 > > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338200#comment-15338200 ] Steve Rowe commented on SOLR-8981: -- bq. I'd like to get rid of the unmaintained morphlines code. See SOLR-9221. > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement > Components: contrib - Solr Cell (Tika extraction) >Reporter: Tim Allison >Assignee: Uwe Schindler > Fix For: master (7.0), 6.2 > > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338199#comment-15338199 ] Lewis John McGibbney commented on SOLR-8981: Where is morph lines? Can someone please explain explicitly what the issue is here and I'll have a crack at fixing it. Thanks -- *Lewis* > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement > Components: contrib - Solr Cell (Tika extraction) >Reporter: Tim Allison >Assignee: Uwe Schindler > Fix For: master (7.0), 6.2 > > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338145#comment-15338145 ] Uwe Schindler commented on SOLR-8981: - ...which did not happen since... 1 or 2 years? > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement > Components: contrib - Solr Cell (Tika extraction) >Reporter: Tim Allison >Assignee: Uwe Schindler > Fix For: master (7.0), 6.2 > > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338143#comment-15338143 ] Uwe Schindler commented on SOLR-8981: - Same here. I already disabled some tests two years ago after upgrade to Tika 1.7 (!!!) Nobody fixed morphlines. To me the whole contribute is now dead code, sorry. This is a chicken egg problem: it depends on older Solr and Tika versions so it breaks on every update. But if we don't upgrade they will also not upgrade morphlines to newer Solr and Tika. > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement > Components: contrib - Solr Cell (Tika extraction) >Reporter: Tim Allison >Assignee: Uwe Schindler > Fix For: master (7.0), 6.2 > > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338141#comment-15338141 ] Steve Rowe commented on SOLR-8981: -- bq. Can you try to fix this test or disable it for now? I made an issue to fix these tests (SOLR-9220), and I'll disable them for now, but I don't know anything about Kite/Morphlines, so I won't be pursuing a fix. > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement > Components: contrib - Solr Cell (Tika extraction) >Reporter: Tim Allison >Assignee: Uwe Schindler > Fix For: master (7.0), 6.2 > > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338133#comment-15338133 ] Steve Rowe commented on SOLR-8981: -- My Jenkins has also found a mapreduce contrib test failure [http://jenkins.sarowe.net/job/Lucene-Solr-tests-6.x/1326/]: {noformat} Checking out Revision 1f7b9555076b4a46cc44cc9d4c8619ebe340f350 (refs/remotes/origin/branch_6x) [...] [junit4] 2> NOTE: reproduce with: ant test -Dtestcase=MorphlineMapperTest -Dtests.method=testMapper -Dtests.seed=6970D27EBC03F20D -Dtests.slow=true -Dtests.linedocsfile=/home/jenkins/lucene-data/enwiki.random.lines.txt -Dtests.locale=de-LU -Dtests.timezone=America/Antigua -Dtests.asserts=true -Dtests.file.encoding=UTF-8 [junit4] ERROR 13.2s J0 | MorphlineMapperTest.testMapper <<< [junit4]> Throwable #1: org.kitesdk.morphline.api.MorphlineCompilationException: Cannot instantiate Tika parser: org.apache.tika.parser.crypto.Pkcs7Parser near: { [junit4]> # /var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-map-reduce/test/J0/temp/solr.hadoop.MorphlineMapperTest_6970D27EBC03F20D-001/tempDir-001/test-morphlines/solrCellDocumentTypes.conf: 199 [junit4]> # rename "content" field to "text" fields [junit4]> "dateFormats" : [ [junit4]> # /var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-map-reduce/test/J0/temp/solr.hadoop.MorphlineMapperTest_6970D27EBC03F20D-001/tempDir-001/test-morphlines/solrCellDocumentTypes.conf: 199 [junit4]> "-MM-dd'T'HH:mm:ss", [junit4]> # /var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-map-reduce/test/J0/temp/solr.hadoop.MorphlineMapperTest_6970D27EBC03F20D-001/tempDir-001/test-morphlines/solrCellDocumentTypes.conf: 199 [junit4]> "-MM-dd" [junit4]> ], [junit4]> # /var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-map-reduce/test/J0/temp/solr.hadoop.MorphlineMapperTest_6970D27EBC03F20D-001/tempDir-001/test-morphlines/solrCellDocumentTypes.conf: 198 [junit4]> "fmap" : { [junit4]> # /var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-map-reduce/test/J0/temp/solr.hadoop.MorphlineMapperTest_6970D27EBC03F20D-001/tempDir-001/test-morphlines/solrCellDocumentTypes.conf: 198 [junit4]> "content-type" : "content_type", [junit4]> # /var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-map-reduce/test/J0/temp/solr.hadoop.MorphlineMapperTest_6970D27EBC03F20D-001/tempDir-001/test-morphlines/solrCellDocumentTypes.conf: 198 [junit4]> "content" : "text" [junit4]> }, [junit4]> # /var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-map-reduce/test/J0/temp/solr.hadoop.MorphlineMapperTest_6970D27EBC03F20D-001/tempDir-001/test-morphlines/solrCellDocumentTypes.conf: 207 [junit4]> # Tika parsers to be registered. If multiple parsers support the same MIME type, [junit4]> # the parser is chosen that is closest to the bottom in this list: [junit4]> "parsers" : [ [junit4]> # /var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-map-reduce/test/J0/temp/solr.hadoop.MorphlineMapperTest_6970D27EBC03F20D-001/tempDir-001/test-morphlines/solrCellDocumentTypes.conf: 208 [junit4]> { [junit4]> # /var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-map-reduce/test/J0/temp/solr.hadoop.MorphlineMapperTest_6970D27EBC03F20D-001/tempDir-001/test-morphlines/solrCellDocumentTypes.conf: 208 [junit4]> "parser" : "org.apache.tika.parser.asm.ClassParser" [junit4]> }, [junit4]> # /var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-map-reduce/test/J0/temp/solr.hadoop.MorphlineMapperTest_6970D27EBC03F20D-001/tempDir-001/test-morphlines/solrCellDocumentTypes.conf: 211 [junit4]> # { parser : org.apache.tika.parser.AutoDetectParser } [junit4]> # { parser : org.gagravarr.tika.OggParser, additionalSupportedMimeTypes : [audio/ogg] } [junit4]> { [junit4]> # /var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-map-reduce/test/J0/temp/solr.hadoop.MorphlineMapperTest_6970D27EBC03F20D-001/tempDir-001/test-morphlines/solrCellDocumentTypes.conf: 211 [junit4]> "parser" : "org.gagravarr.tika.FlacParser" [junit4]> }, [junit4]> #
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338107#comment-15338107 ] Uwe Schindler commented on SOLR-8981: - I was not able to test this on Windows. The problem with morphlines is that it seems to depend on old Tika versions. I'd like to get rid of the unmaintained morphlines code. Can you try to fix this test or disable it for now? Uwe -- Uwe Schindler H.-H.-Meier-Allee 63, 28213 Bremen http://www.thetaphi.de > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement > Components: contrib - Solr Cell (Tika extraction) >Reporter: Tim Allison >Assignee: Uwe Schindler > Fix For: master (7.0), 6.2 > > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338106#comment-15338106 ] Steve Rowe commented on SOLR-8981: -- My Jenkins found a {{SolrCellMorphlineTest.testSolrCellDocumentTypes2()}} failure [http://jenkins.sarowe.net/job/Lucene-Solr-tests-6.x/1325/]: {noformat} Checking out Revision 1f7b9555076b4a46cc44cc9d4c8619ebe340f350 (refs/remotes/origin/branch_6x) [...] [junit4] 2> NOTE: reproduce with: ant test -Dtestcase=SolrCellMorphlineTest -Dtests.method=testSolrCellDocumentTypes2 -Dtests.seed=9B88EA69660A1C83 -Dtests.slow=true -Dtests.linedocsfile=/home/jenkins/lucene-data/enwiki.random.lines.txt -Dtests.locale=en -Dtests.timezone=Europe/Skopje -Dtests.asserts=true -Dtests.file.encoding=UTF-8 [junit4] ERROR 11.2s | SolrCellMorphlineTest.testSolrCellDocumentTypes2 <<< [junit4]> Throwable #1: org.kitesdk.morphline.api.MorphlineCompilationException: Cannot instantiate Tika parser: org.apache.tika.parser.crypto.Pkcs7Parser near: { [junit4]> # /var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-morphlines-cell/test/J0/temp/solr.morphlines.cell.SolrCellMorphlineTest_9B88EA69660A1C83-001/tempDir-003/test-morphlines/solrCellDocumentTypes.conf: 199 [junit4]> # rename "content" field to "text" fields [junit4]> "dateFormats" : [ [junit4]> # /var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-morphlines-cell/test/J0/temp/solr.morphlines.cell.SolrCellMorphlineTest_9B88EA69660A1C83-001/tempDir-003/test-morphlines/solrCellDocumentTypes.conf: 199 [junit4]> "-MM-dd'T'HH:mm:ss", [junit4]> # /var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-morphlines-cell/test/J0/temp/solr.morphlines.cell.SolrCellMorphlineTest_9B88EA69660A1C83-001/tempDir-003/test-morphlines/solrCellDocumentTypes.conf: 199 [junit4]> "-MM-dd" [junit4]> ], [junit4]> # /var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-morphlines-cell/test/J0/temp/solr.morphlines.cell.SolrCellMorphlineTest_9B88EA69660A1C83-001/tempDir-003/test-morphlines/solrCellDocumentTypes.conf: 198 [junit4]> "fmap" : { [junit4]> # /var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-morphlines-cell/test/J0/temp/solr.morphlines.cell.SolrCellMorphlineTest_9B88EA69660A1C83-001/tempDir-003/test-morphlines/solrCellDocumentTypes.conf: 198 [junit4]> "content-type" : "content_type", [junit4]> # /var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-morphlines-cell/test/J0/temp/solr.morphlines.cell.SolrCellMorphlineTest_9B88EA69660A1C83-001/tempDir-003/test-morphlines/solrCellDocumentTypes.conf: 198 [junit4]> "content" : "text" [junit4]> }, [junit4]> # /var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-morphlines-cell/test/J0/temp/solr.morphlines.cell.SolrCellMorphlineTest_9B88EA69660A1C83-001/tempDir-003/test-morphlines/solrCellDocumentTypes.conf: 207 [junit4]> # Tika parsers to be registered. If multiple parsers support the same MIME type, [junit4]> # the parser is chosen that is closest to the bottom in this list: [junit4]> "parsers" : [ [junit4]> # /var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-morphlines-cell/test/J0/temp/solr.morphlines.cell.SolrCellMorphlineTest_9B88EA69660A1C83-001/tempDir-003/test-morphlines/solrCellDocumentTypes.conf: 208 [junit4]> { [junit4]> # /var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-morphlines-cell/test/J0/temp/solr.morphlines.cell.SolrCellMorphlineTest_9B88EA69660A1C83-001/tempDir-003/test-morphlines/solrCellDocumentTypes.conf: 208 [junit4]> "parser" : "org.apache.tika.parser.asm.ClassParser" [junit4]> }, [junit4]> # /var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-morphlines-cell/test/J0/temp/solr.morphlines.cell.SolrCellMorphlineTest_9B88EA69660A1C83-001/tempDir-003/test-morphlines/solrCellDocumentTypes.conf: 211 [junit4]> # { parser : org.apache.tika.parser.AutoDetectParser } [junit4]> # { parser : org.gagravarr.tika.OggParser, additionalSupportedMimeTypes : [audio/ogg] } [junit4]> { [junit4]> # /var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-morphlines-cell/test/J0/temp/solr.morphlines.cell.SolrCellMorphlineTest_9B88EA69660A1C83-001/tempDir-003/test-morphlines/solrCellDocumentTypes.conf: 211 [junit4]> "parser" : "org.gagravarr.tika.FlacParser"
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338062#comment-15338062 ] ASF subversion and git services commented on SOLR-8981: --- Commit 1f7b9555076b4a46cc44cc9d4c8619ebe340f350 in lucene-solr's branch refs/heads/branch_6x from [~thetaphi] [ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=1f7b955 ] SOLR-8981: Add changes entry > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338061#comment-15338061 ] ASF subversion and git services commented on SOLR-8981: --- Commit a7f89cd84314d11443a646842553d89e855bd358 in lucene-solr's branch refs/heads/master from [~thetaphi] [ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=a7f89cd ] SOLR-8981: Add changes entry > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338054#comment-15338054 ] ASF subversion and git services commented on SOLR-8981: --- Commit 7403b46c4daacfa977d7940961418c1b1fde346e in lucene-solr's branch refs/heads/branch_6x from [~thetaphi] [ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=7403b46c ] SOLR-8981: Update TIKA to 1.13: - This commit merges branch 'SOLR-8981' of https://github.com/tballison/lucene-solr - Adds some modifications and reverts jackcess-encrypt addition (not yet working) - Fixes order of ivy-versions.properties - This closes #44 > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338053#comment-15338053 ] ASF subversion and git services commented on SOLR-8981: --- Commit 7403b46c4daacfa977d7940961418c1b1fde346e in lucene-solr's branch refs/heads/branch_6x from [~thetaphi] [ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=7403b46c ] SOLR-8981: Update TIKA to 1.13: - This commit merges branch 'SOLR-8981' of https://github.com/tballison/lucene-solr - Adds some modifications and reverts jackcess-encrypt addition (not yet working) - Fixes order of ivy-versions.properties - This closes #44 > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338040#comment-15338040 ] ASF subversion and git services commented on SOLR-8981: --- Commit 19cb7404f5a592058ab4f675c11eea359ac8fdc3 in lucene-solr's branch refs/heads/master from [~thetaphi] [ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=19cb740 ] SOLR-8981: Update TIKA to 1.13: - This commit merges branch 'SOLR-8981' of https://github.com/tballison/lucene-solr - Adds some modifications and reverts jackcess-encrypt addition (not yet working) - Fixes order of ivy-versions.properties - This closes #44 > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338043#comment-15338043 ] ASF GitHub Bot commented on SOLR-8981: -- Github user asfgit closed the pull request at: https://github.com/apache/lucene-solr/pull/44 > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338041#comment-15338041 ] ASF subversion and git services commented on SOLR-8981: --- Commit 19cb7404f5a592058ab4f675c11eea359ac8fdc3 in lucene-solr's branch refs/heads/master from [~thetaphi] [ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=19cb740 ] SOLR-8981: Update TIKA to 1.13: - This commit merges branch 'SOLR-8981' of https://github.com/tballison/lucene-solr - Adds some modifications and reverts jackcess-encrypt addition (not yet working) - Fixes order of ivy-versions.properties - This closes #44 > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338037#comment-15338037 ] ASF subversion and git services commented on SOLR-8981: --- Commit 31c091b4856081f2d1b302499a436e5953779e5e in lucene-solr's branch refs/heads/master from [~talli...@mitre.org] [ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=31c091b ] SOLR-8981 clean up new lines, upgrade isoparser, add notice in CHANGES.txt > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338038#comment-15338038 ] ASF subversion and git services commented on SOLR-8981: --- Commit 785bebbcbd8f77ccc6d75acf3fb3d42ee29770fc in lucene-solr's branch refs/heads/master from [~talli...@mitre.org] [ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=785bebb ] SOLR-8981 remove "don't test with java-9" commands; fix bug introduced by TIKA-995 -- doubling of body elements in HTML tags; add copyright info for Jackcess. > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338036#comment-15338036 ] ASF subversion and git services commented on SOLR-8981: --- Commit 1706b92790011f3ec5a85915adad3834e87d8970 in lucene-solr's branch refs/heads/master from [~talli...@mitre.org] [ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=1706b92 ] SOLR-8981 clean up license and sha1 info > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338039#comment-15338039 ] ASF subversion and git services commented on SOLR-8981: --- Commit dd09f0f42b07415bdf3ef54c5dbc3e2550bed688 in lucene-solr's branch refs/heads/master from [~talli...@mitre.org] [ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=dd09f0f ] SOLR-8981 add jackcess-encrypt > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338035#comment-15338035 ] ASF subversion and git services commented on SOLR-8981: --- Commit ba0e71703464849198b384aa6e92962db8a04b51 in lucene-solr's branch refs/heads/master from [~talli...@mitre.org] [ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=ba0e717 ] SOLR-8981 upgrade to Tika 1.13 > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15337781#comment-15337781 ] Uwe Schindler commented on SOLR-8981: - I am waiting for a statement by [~infrastruct...@apache.org]. > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15337775#comment-15337775 ] Tim Allison commented on SOLR-8981: --- Probably lucene-solr's repo protecting itself from code that originated on my fork. :) > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15337749#comment-15337749 ] ASF GitHub Bot commented on SOLR-8981: -- Github user uschindler commented on the issue: https://github.com/apache/lucene-solr/pull/44 Ah OK, so no problem on my side. I'll wait a bit. > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15337746#comment-15337746 ] ASF GitHub Bot commented on SOLR-8981: -- Github user lewismc commented on the issue: https://github.com/apache/lucene-solr/pull/44 Yes the server is buggered. Good work folks. > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15337745#comment-15337745 ] ASF GitHub Bot commented on SOLR-8981: -- Github user uschindler commented on the issue: https://github.com/apache/lucene-solr/pull/44 Hi I have applied some other fixes and will push soon. Currently ASF have some problems with pushing: git.exe push --progress "origin" master:master Counting objects: 121, done. Delta compression using up to 8 threads. Compressing objects: 100% (66/66), done. Writing objects: 100% (121/121), 8.90 KiB | 0 bytes/s, done. Total 121 (delta 55), reused 17 (delta 2) remote: You are not authorized to edit this repository. remote: To https://git-wip-us.apache.org/repos/asf/lucene-solr.git ! [remote rejected] master -> master (pre-receive hook declined) error: failed to push some refs to 'https://git-wip-us.apache.org/repos/asf/lucene-solr.git' > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15337096#comment-15337096 ] Tim Allison commented on SOLR-8981: --- Yes, please! > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336962#comment-15336962 ] ASF GitHub Bot commented on SOLR-8981: -- Github user uschindler commented on the issue: https://github.com/apache/lucene-solr/pull/44 OK, the tests pass for me successfully. Should I remove the jackcess-encrypt package from your PR after merging (you said you will be away this weekend)? > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336955#comment-15336955 ] ASF GitHub Bot commented on SOLR-8981: -- Github user uschindler commented on a diff in the pull request: https://github.com/apache/lucene-solr/pull/44#discussion_r67575579 --- Diff: solr/contrib/morphlines-cell/src/test/org/apache/solr/morphlines/cell/SolrCellMorphlineTest.java --- @@ -42,8 +42,6 @@ @BeforeClass public static void beforeClass2() { assumeFalse("FIXME: Morphlines currently has issues with Windows paths", Constants.WINDOWS); -assumeFalse("This test fails with Java 9 (https://issues.apache.org/jira/browse/PDFBOX-3155, https://issues.apache.org/jira/browse/SOLR-8876)", --- End diff -- This should stay, because Hadoop related stuff also fails with Java 9. Maybe only remove the PDFBOX issue number. > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336876#comment-15336876 ] ASF GitHub Bot commented on SOLR-8981: -- Github user uschindler commented on the issue: https://github.com/apache/lucene-solr/pull/44 Let's pick option 2 for now. Maybe update the rest of Solr after some review. > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336797#comment-15336797 ] ASF GitHub Bot commented on SOLR-8981: -- Github user tballison commented on the issue: https://github.com/apache/lucene-solr/pull/44 > I also only have Windows :) How can you live with the failed builds?!? I wanted to help with [morphlines](https://mail-archives.apache.org/mod_mbox/lucene-solr-user/201606.mbox/%3CCY1PR09MB1115F9A08E97879D959D3CDCC7570%40CY1PR09MB1115.namprd09.prod.outlook.com%3E), but I can't easily do much... > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336794#comment-15336794 ] ASF GitHub Bot commented on SOLR-8981: -- Github user tballison commented on the issue: https://github.com/apache/lucene-solr/pull/44 If we leave out updating bouncycastle, I'm fairly confident that users will run problems at run time if they try to decrypt MSAccess and probably PDF and doc. We had a binary incompatibility between 1.52 and 1.54 with Jackcess: https://sourceforge.net/p/jackcessencrypt/feature-requests/2/ IIRC, the exception was thrown on any encrypted MSAccess file, not just those for which the user had a password. I see two options: 1) upgrade bouncycastle and hope we don't break other parts of Solr 2) announce decryption of Jackcess/POI/PDFBox as unsupported > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336790#comment-15336790 ] ASF GitHub Bot commented on SOLR-8981: -- Github user uschindler commented on the issue: https://github.com/apache/lucene-solr/pull/44 I also only have Windows :) I would leave out image format, but MS Access looks fine. Could we leave out updating bouncycastl then? > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336778#comment-15336778 ] ASF GitHub Bot commented on SOLR-8981: -- Github user tballison commented on the issue: https://github.com/apache/lucene-solr/pull/44 There will likely be some conflicts with bouncy castle. Tika 1.13: bcmail-jdk15on 1.54 bcprov-jdk15on 1.54 vs. Solr: org.bouncycastle.version = 1.45 /org.bouncycastle/bcmail-jdk15 = ${org.bouncycastle.version} /org.bouncycastle/bcprov-jdk15 = ${org.bouncycastle.version} > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336759#comment-15336759 ] ASF GitHub Bot commented on SOLR-8981: -- Github user tballison commented on the issue: https://github.com/apache/lucene-solr/pull/44 WebP is an image format. Jackcess encrypt is the library that allows users to decrypt MSAccess files. Please give it a go with Java 9. I can't easily test the morphlines stuff on my main dev box (Windows ... :( ). > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336740#comment-15336740 ] ASF GitHub Bot commented on SOLR-8981: -- Github user uschindler commented on the issue: https://github.com/apache/lucene-solr/pull/44 Did you check with Java 9 or should I do it? I am not sure about the last assume removed, because there is another SOLR issue in the assume message' not just the PDFBOX one. > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336732#comment-15336732 ] ASF GitHub Bot commented on SOLR-8981: -- Github user uschindler commented on the issue: https://github.com/apache/lucene-solr/pull/44 What file formats are this? Documents? Otherwise please leave them out. > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336653#comment-15336653 ] Tim Allison commented on SOLR-8981: --- In looking at [~lewismc]'s earlier work on 1.12 [here|https://issues.apache.org/jira/browse/SOLR-8716?focusedCommentId=15250294=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-15250294], it looks like I missed the webp parser and jackcess-encrypt. Should I add those? > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336625#comment-15336625 ] ASF GitHub Bot commented on SOLR-8981: -- Github user tballison commented on the issue: https://github.com/apache/lucene-solr/pull/44 Our bug introduced in TIKA-995. > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336583#comment-15336583 ] ASF GitHub Bot commented on SOLR-8981: -- Github user tballison commented on the issue: https://github.com/apache/lucene-solr/pull/44 Not willing to point fingers... :) I'd like to track down the change in our history between 1.7 and 1.13 so that I actually understand what happened > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336571#comment-15336571 ] ASF GitHub Bot commented on SOLR-8981: -- Github user uschindler commented on the issue: https://github.com/apache/lucene-solr/pull/44 LOL. So is this a bug in Solr or in TIKA? Because it did not happen previously. > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336563#comment-15336563 ] ASF GitHub Bot commented on SOLR-8981: -- Github user tballison commented on the issue: https://github.com/apache/lucene-solr/pull/44 The XHTMLContentHandler adds and . In out-of-the-box Tika with the DefaultHtmlMapper, "body" tags are not in the list of "SAFE_ELEMENTS", which means that the html's "body" tag is never passed through...so we don't see the doubling in Tika. The solution is to suppress the body tag in Solr's MostlyPassthroughHtmlMapper. > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336547#comment-15336547 ] ASF GitHub Bot commented on SOLR-8981: -- Github user tballison commented on the issue: https://github.com/apache/lucene-solr/pull/44 Just found it. Confirming that fix doesn't break anything else. > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336521#comment-15336521 ] ASF GitHub Bot commented on SOLR-8981: -- Github user uschindler commented on the issue: https://github.com/apache/lucene-solr/pull/44 Were you able to fix the test or should I look into it? > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336286#comment-15336286 ] ASF GitHub Bot commented on SOLR-8981: -- Github user tballison commented on the issue: https://github.com/apache/lucene-solr/pull/44 No, it is a self-contained test with a test file. +1 on local and _only_ local. > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336285#comment-15336285 ] ASF GitHub Bot commented on SOLR-8981: -- Github user uschindler commented on the issue: https://github.com/apache/lucene-solr/pull/44 > will take a look. The test passed if you assumed that the html had two bodies, but that's crazy... I hope this test does not download the internet? It should all run local! I have not looked into it. > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336281#comment-15336281 ] ASF GitHub Bot commented on SOLR-8981: -- Github user uschindler commented on the issue: https://github.com/apache/lucene-solr/pull/44 Grep for that one and remove them. Tests should pass then with latest Java 9: `assumeFalse("This test fails with Java 9 (https://issues.apache.org/jira/browse/PDFBOX-3155)", Constants.JRE_IS_MINIMUM_JAVA9);` > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336273#comment-15336273 ] ASF GitHub Bot commented on SOLR-8981: -- Github user uschindler commented on the issue: https://github.com/apache/lucene-solr/pull/44 OK, I will merge again later. So I will revert my checkout once you have fixed that. Otherwise all looks fine. BTW: Can you remove the assumeFalse on Java 9, because PDFBox is fixed? This was because on Java 9 PDFBOX failed in clinit (version number parsing failure). > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336268#comment-15336268 ] ASF GitHub Bot commented on SOLR-8981: -- Github user tballison commented on the issue: https://github.com/apache/lucene-solr/pull/44 argh... will take a look. The test passed if you assumed that the html had two bodies, but that's crazy... > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336270#comment-15336270 ] ASF GitHub Bot commented on SOLR-8981: -- GitHub user tballison reopened a pull request: https://github.com/apache/lucene-solr/pull/44 SOLR-8981 SOLR-8981 upgrade to Tika 1.13 You can merge this pull request into a Git repository by running: $ git pull https://github.com/tballison/lucene-solr SOLR-8981 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/lucene-solr/pull/44.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #44 commit ba0e71703464849198b384aa6e92962db8a04b51 Author: tballisonDate: 2016-06-16T16:56:45Z SOLR-8981 upgrade to Tika 1.13 commit 1706b92790011f3ec5a85915adad3834e87d8970 Author: tballison Date: 2016-06-16T19:36:52Z SOLR-8981 clean up license and sha1 info commit 31c091b4856081f2d1b302499a436e5953779e5e Author: tballison Date: 2016-06-17T13:47:53Z SOLR-8981 clean up new lines, upgrade isoparser, add notice in CHANGES.txt > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336269#comment-15336269 ] ASF GitHub Bot commented on SOLR-8981: -- Github user tballison closed the pull request at: https://github.com/apache/lucene-solr/pull/44 > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336266#comment-15336266 ] ASF GitHub Bot commented on SOLR-8981: -- Github user lewismc commented on the issue: https://github.com/apache/lucene-solr/pull/44 @uschindler yep we've seen this before. I have no idea what is going on here. I'll look in to it again today. Can someone point out the exact code which does the XPath magic? > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336265#comment-15336265 ] ASF GitHub Bot commented on SOLR-8981: -- Github user tballison commented on the issue: https://github.com/apache/lucene-solr/pull/44 Y, I did run the extraction tests. That was the error we were getting initially, but which (without explanation) disappeared on my most recent integration attempt. > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336267#comment-15336267 ] ASF GitHub Bot commented on SOLR-8981: -- Github user uschindler commented on the issue: https://github.com/apache/lucene-solr/pull/44 for me it still happens. I just merged the PR > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336262#comment-15336262 ] ASF GitHub Bot commented on SOLR-8981: -- Github user uschindler commented on the issue: https://github.com/apache/lucene-solr/pull/44 I merged everything successfully, but I get one test failure in solr/contrib/extraction: [junit4] FAILURE 0.05s J0 | ExtractingRequestHandlerTest.testXPath <<< [junit4]> Throwable #1: org.junit.ComparisonFailure: expected:<[News]> but was:<[]> [junit4]>at __randomizedtesting.SeedInfo.seed([404BA07016F1FB57:3E1A6EE30E469911]:0) I have the feeling I have seen this before. Weren't you running the extraction tests? > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336234#comment-15336234 ] Tim Allison commented on SOLR-8981: --- Great! Among the many improvements (esp. PDFBox 2.x), this version includes [jackcess|http://jackcess.sourceforge.net/] for parsing MSAccess files. > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336224#comment-15336224 ] Uwe Schindler commented on SOLR-8981: - Thanks! I will merge the PR later this evening! For 6.1 it is now to late, but 6.2 will have this :-) Uwe > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336217#comment-15336217 ] Lewis John McGibbney commented on SOLR-8981: Nice work -- *Lewis* > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336170#comment-15336170 ] ASF GitHub Bot commented on SOLR-8981: -- Github user tballison commented on the issue: https://github.com/apache/lucene-solr/pull/44 Git (well, it was my fault, don't get me wrong) added the \r\n somehow. I had turned off autocrlf earlier. > C:\...>git config --get core.autocrlf input I realized I forgot to update the isoparser, and I cleaned up the Jackcess notice. Let me know how this looks now. > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15334893#comment-15334893 ] ASF GitHub Bot commented on SOLR-8981: -- Github user uschindler commented on the issue: https://github.com/apache/lucene-solr/pull/44 > I think this should work... ant precommit worked in Linux with these modifications. I kept getting hangs with ant jar-checksums in Windows. If you checkout with git on windows using auto-eol it fails. The reason is git that threats sha1 files as text and converts their line endings. > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15334520#comment-15334520 ] ASF GitHub Bot commented on SOLR-8981: -- Github user tballison commented on the issue: https://github.com/apache/lucene-solr/pull/44 I think I got it... ant precommit worked in Linux with these modifications. I kept getting hangs with ant jar-checksums in Windows. > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15334235#comment-15334235 ] ASF GitHub Bot commented on SOLR-8981: -- Github user uschindler commented on the issue: https://github.com/apache/lucene-solr/pull/44 Hallo, please also update all SHA1 hashes of files. Plesae run "ant precommit" from root folder of Lu/Solr. This will report all missing things. > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Assignee: Uwe Schindler >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15334228#comment-15334228 ] Uwe Schindler commented on SOLR-8981: - To test TIKA please only run tests inside contrib/extraction! Solr tests are generally unstable, especially on windows. See our Jenkins logs. > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15334159#comment-15334159 ] Tim Allison commented on SOLR-8981: --- I got a build failure here: {noformat} Tests with failures [seed: C22A0B280C50BF8F]: [junit4] - org.apache.solr.handler.component.SpellCheckComponentTest.test {noformat} However, when I tested this alone, all was fine...different seed? Not sure if this is a regular build failure or something caused by the changes. [~lewismc], if you have a chance to review, I'd appreciate a second set of eyes before we bother [~thetaphi] for a review. > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15334150#comment-15334150 ] ASF GitHub Bot commented on SOLR-8981: -- GitHub user tballison opened a pull request: https://github.com/apache/lucene-solr/pull/44 SOLR-8981 SOLR-8981 upgrade to Tika 1.13 You can merge this pull request into a Git repository by running: $ git pull https://github.com/tballison/lucene-solr SOLR-8981 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/lucene-solr/pull/44.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #44 commit ba0e71703464849198b384aa6e92962db8a04b51 Author: tballisonDate: 2016-06-16T16:56:45Z SOLR-8981 upgrade to Tika 1.13 > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15334085#comment-15334085 ] Lewis John McGibbney commented on SOLR-8981: Brilliant. The most recent patch I submitted matches Tika 1.13 dependencies less scientific data formats and all of the other non 'document' formats. Thanks for rebuilding Tim it's appreciated. -- *Lewis* > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15334015#comment-15334015 ] Tim Allison commented on SOLR-8981: --- Just tested now, and the upgrade patch is no longer failing on that test (?!). If I get a fully clean build, I'll submit it. > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15333929#comment-15333929 ] Tim Allison commented on SOLR-8981: --- Y, I think the only thing stopping us now was the unit test failure noted above. I'll take a look. I don't know if that'll be a blocker. > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15333910#comment-15333910 ] Tommaso Teofili commented on SOLR-8981: --- IIRC there's a related Solr issue about upgrading to Tika 1.12 [~lewismc] was working on (progress slowed down by having to hand scraping which, transitive or not, dependencies needed to be updated or not back then). > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15333880#comment-15333880 ] Andriy Binetsky commented on SOLR-8981: --- Hi guys, Is it anything new about this issue? Where can I find mentioned sources? I'm asking because we are recently updating our Solr to 6.x version and would like to have Tika 1.13 with all bugfixes/improvements. > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15308355#comment-15308355 ] Tim Allison commented on SOLR-8981: --- I'm getting a failure on that test too. I can't figure out what's going on. I'm getting exactly the same output with the standalone Tika 1.7 and 1.13 apps on the test file...argh... > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15302442#comment-15302442 ] Lewis John McGibbney commented on SOLR-8981: I am working on this again and will try to post a patch ASAP. [~talli...@mitre.org]. I have the following test failing in Solr https://github.com/apache/lucene-solr/blob/master/solr/contrib/extraction/src/test/org/apache/solr/handler/extraction/ExtractingRequestHandlerTest.java#L505 I have been debugging the tests with no luck as of yet. I'll post a new PR later today. The new PR is rebased against lucene-solr master and Tika 1.13 > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15302427#comment-15302427 ] Tim Allison commented on SOLR-8981: --- CVE-2016-4434: Apache Tika XML External Entity vulnerability in versions 0.10-1.12: [announcement|https://mail-archives.apache.org/mod_mbox/tika-dev/201605.mbox/%3C1705136517.1175366.1464278135251.JavaMail.yahoo%40mail.yahoo.com%3E] > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15289030#comment-15289030 ] Chris A. Mattmann commented on SOLR-8981: - correct [~talli...@apache.org] won't affect it for now. > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15288803#comment-15288803 ] Tim Allison commented on SOLR-8981: --- Thanks to [~grossws], our traditional language detection API _should_ be unchanged in 1.13. #famouslastwords We've also added Optimaize and Julia under a new package (tika-langdetect) TIKA-1696. This new package allows easier integration for other language detection packages such as [Yalder|https://github.com/kkrugler/yalder] [~chrismattmann] and [~kkrugler], is the above correct? > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15288757#comment-15288757 ] Alexandre Rafalovitch commented on SOLR-8981: - Is this going to affect language detection module in Solr? Or is API unchanged? > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available
[ https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15285816#comment-15285816 ] Tim Allison commented on SOLR-8981: --- Tika 1.13 is now available. > Upgrade to Tika 1.13 when it is available > - > > Key: SOLR-8981 > URL: https://issues.apache.org/jira/browse/SOLR-8981 > Project: Solr > Issue Type: Improvement >Reporter: Tim Allison >Priority: Minor > > Tika 1.13 should be out within a month. This includes PDFBox 2.0.0 and a > number of other upgrades and improvements. > If there are any showstoppers in 1.13 from Solr's side or requests before we > roll 1.13, let us know. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org