[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2017-10-20 Thread Uwe Schindler (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16212548#comment-16212548
 ] 

Uwe Schindler commented on SOLR-8981:
-

[~steve_rowe]: I added a patch to other issue. The reason is that for safety I 
set the static "disable serialization" on the JMatIO parser in the init() of 
the plugin. I found out later by reviewing [~talli...@mitre.org]'s fork that 
the default is already using "false". But safe is safe.

> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Tim Allison
>Assignee: Uwe Schindler
> Fix For: 5.5.5, 6.2, 7.0
>
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2017-10-20 Thread Steve Rowe (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16212542#comment-16212542
 ] 

Steve Rowe commented on SOLR-8981:
--

bq. Hi Steve Rowe, did you try to remove the jmatio.jar file.

Yes, I did remove the jmatio.jar file, and started the 5.5.5 RC1 vote with it 
removed.

bq. How about trying to just update the JAR file by Tim Allison fork?  I added 
an alternative patch to SOLR-11486!

+1, I'll go respin.



> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Tim Allison
>Assignee: Uwe Schindler
> Fix For: 5.5.5, 6.2, 7.0
>
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2017-10-20 Thread Tim Allison (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16212527#comment-16212527
 ] 

Tim Allison commented on SOLR-8981:
---

+1  Thank you, [~thetaphi]!

> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Tim Allison
>Assignee: Uwe Schindler
> Fix For: 5.5.5, 6.2, 7.0
>
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2017-10-20 Thread Uwe Schindler (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16212410#comment-16212410
 ] 

Uwe Schindler commented on SOLR-8981:
-

I added an alternative patch to SOLR-8981!

> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Tim Allison
>Assignee: Uwe Schindler
> Fix For: 5.5.5, 6.2, 7.0
>
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2017-10-20 Thread Uwe Schindler (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16212317#comment-16212317
 ] 

Uwe Schindler commented on SOLR-8981:
-

I just figured out: How about trying to just update the JAR file by 
[~talli...@mitre.org] fork? I looked at the code of MatParser.java, there were 
no real changes. The imports are the same and API calls! So we would keep old 
TIKA version for compatibility and just "patch" the CVE vuln with updating the 
JAR file.

> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Tim Allison
>Assignee: Uwe Schindler
> Fix For: 5.5.5, 6.2, 7.0
>
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2017-10-20 Thread Uwe Schindler (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16212282#comment-16212282
 ] 

Uwe Schindler commented on SOLR-8981:
-

Hi [~steve_rowe],
did you try to remove the jmatio.jar file. Maybe we should add a simple test by 
adding a matlab file to figure out if it does not crush horribly. It should 
just refuse to parse the file with some exception of a not found class, as 
there is no parser. We recommended the same thing to people as quick workaround 
when the Microsoft Word XXE issues happend.

> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Tim Allison
>Assignee: Uwe Schindler
> Fix For: 5.5.5, 6.2, 7.0
>
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2017-10-19 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16211864#comment-16211864
 ] 

ASF subversion and git services commented on SOLR-8981:
---

Commit 917798d5ad509ec5d13bebea10b5d6071bed6202 in lucene-solr's branch 
refs/heads/branch_5_5 from [~steve_rowe]
[ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=917798d ]

Revert "SOLR-8981: Update TIKA to 1.13:"

This reverts commit 10fb52a64b4fa5ff999421912217d5c717fce12b.


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Tim Allison
>Assignee: Uwe Schindler
> Fix For: 5.5.5, 6.2, 7.0
>
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2017-10-19 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16211861#comment-16211861
 ] 

ASF subversion and git services commented on SOLR-8981:
---

Commit 8a554525a58fad1395a65b241dbdf9b8b5943ddb in lucene-solr's branch 
refs/heads/branch_5_5 from [~steve_rowe]
[ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=8a55452 ]

Revert "SOLR-8981: branch_5_5: CHANGES.txt: Tika 1.7-1.13"

This reverts commit 2e5f7d14f62aae94dc117b39d78a731344a016dc.


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Tim Allison
>Assignee: Uwe Schindler
> Fix For: 5.5.5, 6.2, 7.0
>
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2017-10-19 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16211862#comment-16211862
 ] 

ASF subversion and git services commented on SOLR-8981:
---

Commit 75142f178c2807e1878366143857552a22e6e195 in lucene-solr's branch 
refs/heads/branch_5_5 from [~steve_rowe]
[ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=75142f1 ]

Revert "SOLR-8981: branch_5_5: fix bad CHANGES.txt merge"

This reverts commit 2e97142c9161962cb1ba3e6ad6fa8a6f4faf85cf.


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Tim Allison
>Assignee: Uwe Schindler
> Fix For: 5.5.5, 6.2, 7.0
>
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2017-10-19 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16211863#comment-16211863
 ] 

ASF subversion and git services commented on SOLR-8981:
---

Commit 074f7209c62fe265f66a256c77ccfa8c3a247cf7 in lucene-solr's branch 
refs/heads/branch_5_5 from [~steve_rowe]
[ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=074f720 ]

Revert "SOLR-8981: Add notice for jackcess"

This reverts commit dc84062e657035d0f8c07fd29fe2a32bc60827e0.


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Tim Allison
>Assignee: Uwe Schindler
> Fix For: 5.5.5, 6.2, 7.0
>
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2017-10-17 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16208641#comment-16208641
 ] 

ASF subversion and git services commented on SOLR-8981:
---

Commit 2e5f7d14f62aae94dc117b39d78a731344a016dc in lucene-solr's branch 
refs/heads/branch_5_5 from [~steve_rowe]
[ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=2e5f7d1 ]

SOLR-8981: branch_5_5: CHANGES.txt: Tika 1.7-1.13


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Tim Allison
>Assignee: Uwe Schindler
> Fix For: 5.5.5, 6.2, 7.0
>
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2017-10-17 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16208638#comment-16208638
 ] 

ASF subversion and git services commented on SOLR-8981:
---

Commit 2e97142c9161962cb1ba3e6ad6fa8a6f4faf85cf in lucene-solr's branch 
refs/heads/branch_5_5 from [~steve_rowe]
[ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=2e97142 ]

SOLR-8981: branch_5_5: fix bad CHANGES.txt merge


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Tim Allison
>Assignee: Uwe Schindler
> Fix For: 5.5.5, 6.2, 7.0
>
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2017-10-17 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16208626#comment-16208626
 ] 

ASF subversion and git services commented on SOLR-8981:
---

Commit 10fb52a64b4fa5ff999421912217d5c717fce12b in lucene-solr's branch 
refs/heads/branch_5_5 from [~thetaphi]
[ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=10fb52a ]

SOLR-8981: Update TIKA to 1.13:
- This commit merges branch 'SOLR-8981' of 
https://github.com/tballison/lucene-solr
- Adds some modifications and reverts jackcess-encrypt addition (not yet 
working)
- Fixes order of ivy-versions.properties
- This closes #44


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Tim Allison
>Assignee: Uwe Schindler
> Fix For: 6.2, 7.0
>
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2017-10-17 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16208628#comment-16208628
 ] 

ASF subversion and git services commented on SOLR-8981:
---

Commit dc84062e657035d0f8c07fd29fe2a32bc60827e0 in lucene-solr's branch 
refs/heads/branch_5_5 from [~thetaphi]
[ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=dc84062 ]

SOLR-8981: Add notice for jackcess


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Tim Allison
>Assignee: Uwe Schindler
> Fix For: 6.2, 7.0
>
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2017-10-17 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16208627#comment-16208627
 ] 

ASF subversion and git services commented on SOLR-8981:
---

Commit 10fb52a64b4fa5ff999421912217d5c717fce12b in lucene-solr's branch 
refs/heads/branch_5_5 from [~thetaphi]
[ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=10fb52a ]

SOLR-8981: Update TIKA to 1.13:
- This commit merges branch 'SOLR-8981' of 
https://github.com/tballison/lucene-solr
- Adds some modifications and reverts jackcess-encrypt addition (not yet 
working)
- Fixes order of ivy-versions.properties
- This closes #44


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Tim Allison
>Assignee: Uwe Schindler
> Fix For: 6.2, 7.0
>
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2017-10-17 Thread Steve Rowe (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16208616#comment-16208616
 ] 

Steve Rowe commented on SOLR-8981:
--

Reopening to backport to branch_5_5.

> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Tim Allison
>Assignee: Uwe Schindler
> Fix For: 6.2, 7.0
>
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-20 Thread Uwe Schindler (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15340484#comment-15340484
 ] 

Uwe Schindler commented on SOLR-8981:
-

Done!

> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Tim Allison
>Assignee: Uwe Schindler
> Fix For: master (7.0), 6.2
>
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-20 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15340481#comment-15340481
 ] 

ASF subversion and git services commented on SOLR-8981:
---

Commit e50613cb81f7551996a9cdc76ae47bd9cbb84907 in lucene-solr's branch 
refs/heads/branch_6x from [~thetaphi]
[ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=e50613c ]

SOLR-8981: Add notice for jackcess


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Tim Allison
>Assignee: Uwe Schindler
> Fix For: master (7.0), 6.2
>
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-20 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15340479#comment-15340479
 ] 

ASF subversion and git services commented on SOLR-8981:
---

Commit 9c88143bdaa0bbf304be1e8a81941dfe59c89f99 in lucene-solr's branch 
refs/heads/master from [~thetaphi]
[ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=9c88143 ]

SOLR-8981: Add notice for jackcess


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Tim Allison
>Assignee: Uwe Schindler
> Fix For: master (7.0), 6.2
>
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-20 Thread Tim Allison (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15340337#comment-15340337
 ] 

Tim Allison commented on SOLR-8981:
---

Thank you, [~thetaphi]!

Since my last push, I heard back from the main developer of Jackcess, James 
Ahlborn.

Would it possible to modify the Copyright range in solr/NOTICE.txt to 2008-2016?

{noformat}
  Jackcess: http://jackcess.sourceforge.net/
  Copyright (C) 2008-2016 James Ahlborn
{noformat}

Thank you!

> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Tim Allison
>Assignee: Uwe Schindler
> Fix For: master (7.0), 6.2
>
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-18 Thread Tim Allison (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338244#comment-15338244
 ] 

Tim Allison commented on SOLR-8981:
---

+1 on SOLR-9221

> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Tim Allison
>Assignee: Uwe Schindler
> Fix For: master (7.0), 6.2
>
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-18 Thread Uwe Schindler (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338233#comment-15338233
 ] 

Uwe Schindler commented on SOLR-8981:
-

bq. Where is morph lines? Can someone please explain explicitly what the issue 
is here and I'll have a crack at fixing it.

Morphlines is an external library by (I think) Cloudera

This issue is:
- It depends on an older version of Solr (which is a circular dependency) and 
also on an older version of TIKA (the one matching the Solr version at that 
time)
- At some time there were 3 contribs donated to us, that made use of this 
library. The problem are the circular dependencies.
- We cannot fix the problem in morphlines, as it is not our code. So we cannot 
prevent tests from failing.

Because of that I proposed to remove it. Steve opened SOLR-9221 for that.

> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Tim Allison
>Assignee: Uwe Schindler
> Fix For: master (7.0), 6.2
>
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-18 Thread Steve Rowe (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338200#comment-15338200
 ] 

Steve Rowe commented on SOLR-8981:
--

bq. I'd like to get rid of the unmaintained morphlines code.

See SOLR-9221.

> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Tim Allison
>Assignee: Uwe Schindler
> Fix For: master (7.0), 6.2
>
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-18 Thread Lewis John McGibbney (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338199#comment-15338199
 ] 

Lewis John McGibbney commented on SOLR-8981:


Where is morph lines? Can someone please explain explicitly what the issue
is here and I'll have a crack at fixing it.
Thanks




-- 
*Lewis*


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Tim Allison
>Assignee: Uwe Schindler
> Fix For: master (7.0), 6.2
>
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-18 Thread Uwe Schindler (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338145#comment-15338145
 ] 

Uwe Schindler commented on SOLR-8981:
-

...which did not happen since... 1 or 2 years?

> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Tim Allison
>Assignee: Uwe Schindler
> Fix For: master (7.0), 6.2
>
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-18 Thread Uwe Schindler (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338143#comment-15338143
 ] 

Uwe Schindler commented on SOLR-8981:
-

Same here. I already disabled some tests two years ago after upgrade to Tika 
1.7 (!!!) Nobody fixed morphlines. To me the whole contribute is now dead code, 
sorry.

This is a chicken egg problem: it depends on older Solr and Tika versions so it 
breaks on every update. But if we don't upgrade they will also not upgrade 
morphlines to newer Solr and Tika.

> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Tim Allison
>Assignee: Uwe Schindler
> Fix For: master (7.0), 6.2
>
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-18 Thread Steve Rowe (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338141#comment-15338141
 ] 

Steve Rowe commented on SOLR-8981:
--

bq. Can you try to fix this test or disable it for now?

I made an issue to fix these tests (SOLR-9220), and I'll disable them for now, 
but I don't know anything about Kite/Morphlines, so I won't be pursuing a fix.

> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Tim Allison
>Assignee: Uwe Schindler
> Fix For: master (7.0), 6.2
>
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-18 Thread Steve Rowe (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338133#comment-15338133
 ] 

Steve Rowe commented on SOLR-8981:
--

My Jenkins has also found a mapreduce contrib test failure 
[http://jenkins.sarowe.net/job/Lucene-Solr-tests-6.x/1326/]:

{noformat}
Checking out Revision 1f7b9555076b4a46cc44cc9d4c8619ebe340f350 
(refs/remotes/origin/branch_6x)
[...]
   [junit4]   2> NOTE: reproduce with: ant test  -Dtestcase=MorphlineMapperTest 
-Dtests.method=testMapper -Dtests.seed=6970D27EBC03F20D -Dtests.slow=true 
-Dtests.linedocsfile=/home/jenkins/lucene-data/enwiki.random.lines.txt 
-Dtests.locale=de-LU -Dtests.timezone=America/Antigua -Dtests.asserts=true 
-Dtests.file.encoding=UTF-8
   [junit4] ERROR   13.2s J0 | MorphlineMapperTest.testMapper <<<
   [junit4]> Throwable #1: 
org.kitesdk.morphline.api.MorphlineCompilationException: Cannot instantiate 
Tika parser: org.apache.tika.parser.crypto.Pkcs7Parser near: {
   [junit4]> # 
/var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-map-reduce/test/J0/temp/solr.hadoop.MorphlineMapperTest_6970D27EBC03F20D-001/tempDir-001/test-morphlines/solrCellDocumentTypes.conf:
 199
   [junit4]> #  rename "content" field to "text" fields
   [junit4]> "dateFormats" : [
   [junit4]> # 
/var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-map-reduce/test/J0/temp/solr.hadoop.MorphlineMapperTest_6970D27EBC03F20D-001/tempDir-001/test-morphlines/solrCellDocumentTypes.conf:
 199
   [junit4]> "-MM-dd'T'HH:mm:ss",
   [junit4]> # 
/var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-map-reduce/test/J0/temp/solr.hadoop.MorphlineMapperTest_6970D27EBC03F20D-001/tempDir-001/test-morphlines/solrCellDocumentTypes.conf:
 199
   [junit4]> "-MM-dd"
   [junit4]> ],
   [junit4]> # 
/var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-map-reduce/test/J0/temp/solr.hadoop.MorphlineMapperTest_6970D27EBC03F20D-001/tempDir-001/test-morphlines/solrCellDocumentTypes.conf:
 198
   [junit4]> "fmap" : {
   [junit4]> # 
/var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-map-reduce/test/J0/temp/solr.hadoop.MorphlineMapperTest_6970D27EBC03F20D-001/tempDir-001/test-morphlines/solrCellDocumentTypes.conf:
 198
   [junit4]> "content-type" : "content_type",
   [junit4]> # 
/var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-map-reduce/test/J0/temp/solr.hadoop.MorphlineMapperTest_6970D27EBC03F20D-001/tempDir-001/test-morphlines/solrCellDocumentTypes.conf:
 198
   [junit4]> "content" : "text"
   [junit4]> },
   [junit4]> # 
/var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-map-reduce/test/J0/temp/solr.hadoop.MorphlineMapperTest_6970D27EBC03F20D-001/tempDir-001/test-morphlines/solrCellDocumentTypes.conf:
 207
   [junit4]> #  Tika parsers to be registered. If multiple parsers 
support the same MIME type, 
   [junit4]> #  the parser is chosen that is closest to the bottom in 
this list:
   [junit4]> "parsers" : [
   [junit4]> # 
/var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-map-reduce/test/J0/temp/solr.hadoop.MorphlineMapperTest_6970D27EBC03F20D-001/tempDir-001/test-morphlines/solrCellDocumentTypes.conf:
 208
   [junit4]> {
   [junit4]> # 
/var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-map-reduce/test/J0/temp/solr.hadoop.MorphlineMapperTest_6970D27EBC03F20D-001/tempDir-001/test-morphlines/solrCellDocumentTypes.conf:
 208
   [junit4]> "parser" : "org.apache.tika.parser.asm.ClassParser"
   [junit4]> },
   [junit4]> # 
/var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-map-reduce/test/J0/temp/solr.hadoop.MorphlineMapperTest_6970D27EBC03F20D-001/tempDir-001/test-morphlines/solrCellDocumentTypes.conf:
 211
   [junit4]> #  { parser : org.apache.tika.parser.AutoDetectParser 
}  
   [junit4]> #  { parser : org.gagravarr.tika.OggParser, 
additionalSupportedMimeTypes : [audio/ogg] }
   [junit4]> {
   [junit4]> # 
/var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-map-reduce/test/J0/temp/solr.hadoop.MorphlineMapperTest_6970D27EBC03F20D-001/tempDir-001/test-morphlines/solrCellDocumentTypes.conf:
 211
   [junit4]> "parser" : "org.gagravarr.tika.FlacParser"
   [junit4]> },
   [junit4]> # 

[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-18 Thread Uwe Schindler (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338107#comment-15338107
 ] 

Uwe Schindler commented on SOLR-8981:
-

I was not able to test this on Windows. The problem with morphlines is that it 
seems to depend on old Tika versions.

I'd like to get rid of the unmaintained morphlines code.

Can you try to fix this test or disable it for now?

Uwe


--
Uwe Schindler
H.-H.-Meier-Allee 63, 28213 Bremen
http://www.thetaphi.de


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Tim Allison
>Assignee: Uwe Schindler
> Fix For: master (7.0), 6.2
>
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-18 Thread Steve Rowe (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338106#comment-15338106
 ] 

Steve Rowe commented on SOLR-8981:
--

My Jenkins found a {{SolrCellMorphlineTest.testSolrCellDocumentTypes2()}} 
failure [http://jenkins.sarowe.net/job/Lucene-Solr-tests-6.x/1325/]:

{noformat}
Checking out Revision 1f7b9555076b4a46cc44cc9d4c8619ebe340f350 
(refs/remotes/origin/branch_6x)
[...]
   [junit4]   2> NOTE: reproduce with: ant test  
-Dtestcase=SolrCellMorphlineTest -Dtests.method=testSolrCellDocumentTypes2 
-Dtests.seed=9B88EA69660A1C83 -Dtests.slow=true 
-Dtests.linedocsfile=/home/jenkins/lucene-data/enwiki.random.lines.txt 
-Dtests.locale=en -Dtests.timezone=Europe/Skopje -Dtests.asserts=true 
-Dtests.file.encoding=UTF-8
   [junit4] ERROR   11.2s | SolrCellMorphlineTest.testSolrCellDocumentTypes2 <<<
   [junit4]> Throwable #1: 
org.kitesdk.morphline.api.MorphlineCompilationException: Cannot instantiate 
Tika parser: org.apache.tika.parser.crypto.Pkcs7Parser near: {
   [junit4]> # 
/var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-morphlines-cell/test/J0/temp/solr.morphlines.cell.SolrCellMorphlineTest_9B88EA69660A1C83-001/tempDir-003/test-morphlines/solrCellDocumentTypes.conf:
 199
   [junit4]> #  rename "content" field to "text" fields
   [junit4]> "dateFormats" : [
   [junit4]> # 
/var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-morphlines-cell/test/J0/temp/solr.morphlines.cell.SolrCellMorphlineTest_9B88EA69660A1C83-001/tempDir-003/test-morphlines/solrCellDocumentTypes.conf:
 199
   [junit4]> "-MM-dd'T'HH:mm:ss",
   [junit4]> # 
/var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-morphlines-cell/test/J0/temp/solr.morphlines.cell.SolrCellMorphlineTest_9B88EA69660A1C83-001/tempDir-003/test-morphlines/solrCellDocumentTypes.conf:
 199
   [junit4]> "-MM-dd"
   [junit4]> ],
   [junit4]> # 
/var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-morphlines-cell/test/J0/temp/solr.morphlines.cell.SolrCellMorphlineTest_9B88EA69660A1C83-001/tempDir-003/test-morphlines/solrCellDocumentTypes.conf:
 198
   [junit4]> "fmap" : {
   [junit4]> # 
/var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-morphlines-cell/test/J0/temp/solr.morphlines.cell.SolrCellMorphlineTest_9B88EA69660A1C83-001/tempDir-003/test-morphlines/solrCellDocumentTypes.conf:
 198
   [junit4]> "content-type" : "content_type",
   [junit4]> # 
/var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-morphlines-cell/test/J0/temp/solr.morphlines.cell.SolrCellMorphlineTest_9B88EA69660A1C83-001/tempDir-003/test-morphlines/solrCellDocumentTypes.conf:
 198
   [junit4]> "content" : "text"
   [junit4]> },
   [junit4]> # 
/var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-morphlines-cell/test/J0/temp/solr.morphlines.cell.SolrCellMorphlineTest_9B88EA69660A1C83-001/tempDir-003/test-morphlines/solrCellDocumentTypes.conf:
 207
   [junit4]> #  Tika parsers to be registered. If multiple parsers 
support the same MIME type, 
   [junit4]> #  the parser is chosen that is closest to the bottom in 
this list:
   [junit4]> "parsers" : [
   [junit4]> # 
/var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-morphlines-cell/test/J0/temp/solr.morphlines.cell.SolrCellMorphlineTest_9B88EA69660A1C83-001/tempDir-003/test-morphlines/solrCellDocumentTypes.conf:
 208
   [junit4]> {
   [junit4]> # 
/var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-morphlines-cell/test/J0/temp/solr.morphlines.cell.SolrCellMorphlineTest_9B88EA69660A1C83-001/tempDir-003/test-morphlines/solrCellDocumentTypes.conf:
 208
   [junit4]> "parser" : "org.apache.tika.parser.asm.ClassParser"
   [junit4]> },
   [junit4]> # 
/var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-morphlines-cell/test/J0/temp/solr.morphlines.cell.SolrCellMorphlineTest_9B88EA69660A1C83-001/tempDir-003/test-morphlines/solrCellDocumentTypes.conf:
 211
   [junit4]> #  { parser : org.apache.tika.parser.AutoDetectParser 
}  
   [junit4]> #  { parser : org.gagravarr.tika.OggParser, 
additionalSupportedMimeTypes : [audio/ogg] }
   [junit4]> {
   [junit4]> # 
/var/lib/jenkins/jobs/Lucene-Solr-tests-6.x/workspace/solr/build/contrib/solr-morphlines-cell/test/J0/temp/solr.morphlines.cell.SolrCellMorphlineTest_9B88EA69660A1C83-001/tempDir-003/test-morphlines/solrCellDocumentTypes.conf:
 211
   [junit4]> "parser" : "org.gagravarr.tika.FlacParser"
 

[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-18 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338062#comment-15338062
 ] 

ASF subversion and git services commented on SOLR-8981:
---

Commit 1f7b9555076b4a46cc44cc9d4c8619ebe340f350 in lucene-solr's branch 
refs/heads/branch_6x from [~thetaphi]
[ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=1f7b955 ]

SOLR-8981: Add changes entry


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-18 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338061#comment-15338061
 ] 

ASF subversion and git services commented on SOLR-8981:
---

Commit a7f89cd84314d11443a646842553d89e855bd358 in lucene-solr's branch 
refs/heads/master from [~thetaphi]
[ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=a7f89cd ]

SOLR-8981: Add changes entry


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-18 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338054#comment-15338054
 ] 

ASF subversion and git services commented on SOLR-8981:
---

Commit 7403b46c4daacfa977d7940961418c1b1fde346e in lucene-solr's branch 
refs/heads/branch_6x from [~thetaphi]
[ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=7403b46c ]

SOLR-8981: Update TIKA to 1.13:
- This commit merges branch 'SOLR-8981' of 
https://github.com/tballison/lucene-solr
- Adds some modifications and reverts jackcess-encrypt addition (not yet 
working)
- Fixes order of ivy-versions.properties
- This closes #44


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-18 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338053#comment-15338053
 ] 

ASF subversion and git services commented on SOLR-8981:
---

Commit 7403b46c4daacfa977d7940961418c1b1fde346e in lucene-solr's branch 
refs/heads/branch_6x from [~thetaphi]
[ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=7403b46c ]

SOLR-8981: Update TIKA to 1.13:
- This commit merges branch 'SOLR-8981' of 
https://github.com/tballison/lucene-solr
- Adds some modifications and reverts jackcess-encrypt addition (not yet 
working)
- Fixes order of ivy-versions.properties
- This closes #44


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-18 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338040#comment-15338040
 ] 

ASF subversion and git services commented on SOLR-8981:
---

Commit 19cb7404f5a592058ab4f675c11eea359ac8fdc3 in lucene-solr's branch 
refs/heads/master from [~thetaphi]
[ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=19cb740 ]

SOLR-8981: Update TIKA to 1.13:
- This commit merges branch 'SOLR-8981' of 
https://github.com/tballison/lucene-solr
- Adds some modifications and reverts jackcess-encrypt addition (not yet 
working)
- Fixes order of ivy-versions.properties
- This closes #44


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-18 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338043#comment-15338043
 ] 

ASF GitHub Bot commented on SOLR-8981:
--

Github user asfgit closed the pull request at:

https://github.com/apache/lucene-solr/pull/44


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-18 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338041#comment-15338041
 ] 

ASF subversion and git services commented on SOLR-8981:
---

Commit 19cb7404f5a592058ab4f675c11eea359ac8fdc3 in lucene-solr's branch 
refs/heads/master from [~thetaphi]
[ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=19cb740 ]

SOLR-8981: Update TIKA to 1.13:
- This commit merges branch 'SOLR-8981' of 
https://github.com/tballison/lucene-solr
- Adds some modifications and reverts jackcess-encrypt addition (not yet 
working)
- Fixes order of ivy-versions.properties
- This closes #44


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-18 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338037#comment-15338037
 ] 

ASF subversion and git services commented on SOLR-8981:
---

Commit 31c091b4856081f2d1b302499a436e5953779e5e in lucene-solr's branch 
refs/heads/master from [~talli...@mitre.org]
[ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=31c091b ]

SOLR-8981 clean up new lines, upgrade isoparser, add notice in CHANGES.txt


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-18 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338038#comment-15338038
 ] 

ASF subversion and git services commented on SOLR-8981:
---

Commit 785bebbcbd8f77ccc6d75acf3fb3d42ee29770fc in lucene-solr's branch 
refs/heads/master from [~talli...@mitre.org]
[ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=785bebb ]

SOLR-8981 remove "don't test with java-9" commands; fix bug introduced by 
TIKA-995 -- doubling of body elements in HTML tags; add copyright info for 
Jackcess.


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-18 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338036#comment-15338036
 ] 

ASF subversion and git services commented on SOLR-8981:
---

Commit 1706b92790011f3ec5a85915adad3834e87d8970 in lucene-solr's branch 
refs/heads/master from [~talli...@mitre.org]
[ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=1706b92 ]

SOLR-8981 clean up license and sha1 info


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-18 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338039#comment-15338039
 ] 

ASF subversion and git services commented on SOLR-8981:
---

Commit dd09f0f42b07415bdf3ef54c5dbc3e2550bed688 in lucene-solr's branch 
refs/heads/master from [~talli...@mitre.org]
[ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=dd09f0f ]

SOLR-8981 add jackcess-encrypt


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-18 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15338035#comment-15338035
 ] 

ASF subversion and git services commented on SOLR-8981:
---

Commit ba0e71703464849198b384aa6e92962db8a04b51 in lucene-solr's branch 
refs/heads/master from [~talli...@mitre.org]
[ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=ba0e717 ]

SOLR-8981 upgrade to Tika 1.13


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-18 Thread Uwe Schindler (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15337781#comment-15337781
 ] 

Uwe Schindler commented on SOLR-8981:
-

I am waiting for a statement by [~infrastruct...@apache.org].

> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-18 Thread Tim Allison (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15337775#comment-15337775
 ] 

Tim Allison commented on SOLR-8981:
---

Probably lucene-solr's repo protecting itself from code that originated on my 
fork. :)

> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-18 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15337749#comment-15337749
 ] 

ASF GitHub Bot commented on SOLR-8981:
--

Github user uschindler commented on the issue:

https://github.com/apache/lucene-solr/pull/44
  
Ah OK, so no problem on my side. I'll wait a bit.


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-18 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15337746#comment-15337746
 ] 

ASF GitHub Bot commented on SOLR-8981:
--

Github user lewismc commented on the issue:

https://github.com/apache/lucene-solr/pull/44
  
Yes the server is buggered. Good work folks.


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-18 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15337745#comment-15337745
 ] 

ASF GitHub Bot commented on SOLR-8981:
--

Github user uschindler commented on the issue:

https://github.com/apache/lucene-solr/pull/44
  
Hi I have applied some other fixes and will push soon. Currently ASF have 
some problems with pushing:

git.exe push --progress "origin" master:master

Counting objects: 121, done.
Delta compression using up to 8 threads.
Compressing objects: 100% (66/66), done.
Writing objects: 100% (121/121), 8.90 KiB | 0 bytes/s, done.
Total 121 (delta 55), reused 17 (delta 2)
remote: You are not authorized to edit this repository.
remote:
To https://git-wip-us.apache.org/repos/asf/lucene-solr.git
! [remote rejected] master -> master (pre-receive hook declined)
error: failed to push some refs to 
'https://git-wip-us.apache.org/repos/asf/lucene-solr.git'



> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-17 Thread Tim Allison (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15337096#comment-15337096
 ] 

Tim Allison commented on SOLR-8981:
---

Yes, please!

> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-17 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336962#comment-15336962
 ] 

ASF GitHub Bot commented on SOLR-8981:
--

Github user uschindler commented on the issue:

https://github.com/apache/lucene-solr/pull/44
  
OK, the tests pass for me successfully. Should I remove the 
jackcess-encrypt package from your PR after merging (you said you will be away 
this weekend)?


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-17 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336955#comment-15336955
 ] 

ASF GitHub Bot commented on SOLR-8981:
--

Github user uschindler commented on a diff in the pull request:

https://github.com/apache/lucene-solr/pull/44#discussion_r67575579
  
--- Diff: 
solr/contrib/morphlines-cell/src/test/org/apache/solr/morphlines/cell/SolrCellMorphlineTest.java
 ---
@@ -42,8 +42,6 @@
   @BeforeClass
   public static void beforeClass2() {
 assumeFalse("FIXME: Morphlines currently has issues with Windows 
paths", Constants.WINDOWS);
-assumeFalse("This test fails with Java 9 
(https://issues.apache.org/jira/browse/PDFBOX-3155, 
https://issues.apache.org/jira/browse/SOLR-8876)",
--- End diff --

This should stay, because Hadoop related stuff also fails with Java 9. 
Maybe only remove the PDFBOX issue number.


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-17 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336876#comment-15336876
 ] 

ASF GitHub Bot commented on SOLR-8981:
--

Github user uschindler commented on the issue:

https://github.com/apache/lucene-solr/pull/44
  
Let's pick option 2 for now. Maybe update the rest of Solr after some 
review.


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-17 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336797#comment-15336797
 ] 

ASF GitHub Bot commented on SOLR-8981:
--

Github user tballison commented on the issue:

https://github.com/apache/lucene-solr/pull/44
  
> I also only have Windows :)

How can you live with the failed builds?!?  I wanted to help with 
[morphlines](https://mail-archives.apache.org/mod_mbox/lucene-solr-user/201606.mbox/%3CCY1PR09MB1115F9A08E97879D959D3CDCC7570%40CY1PR09MB1115.namprd09.prod.outlook.com%3E),
 but I can't easily do much...


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-17 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336794#comment-15336794
 ] 

ASF GitHub Bot commented on SOLR-8981:
--

Github user tballison commented on the issue:

https://github.com/apache/lucene-solr/pull/44
  
If we leave out updating bouncycastle, I'm fairly confident that users will 
run problems at run time if they try to decrypt MSAccess and probably PDF and 
doc.

We had a binary incompatibility between 1.52 and 1.54 with Jackcess: 
https://sourceforge.net/p/jackcessencrypt/feature-requests/2/

IIRC, the exception was thrown on any encrypted MSAccess file, not just 
those for which the user had a password.

I see two options: 

1) upgrade bouncycastle and hope we don't break other parts of Solr
2) announce decryption of Jackcess/POI/PDFBox as unsupported




> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-17 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336790#comment-15336790
 ] 

ASF GitHub Bot commented on SOLR-8981:
--

Github user uschindler commented on the issue:

https://github.com/apache/lucene-solr/pull/44
  
I also only have Windows :)

I would leave out image format, but MS Access looks fine. Could we leave 
out updating bouncycastl then?


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-17 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336778#comment-15336778
 ] 

ASF GitHub Bot commented on SOLR-8981:
--

Github user tballison commented on the issue:

https://github.com/apache/lucene-solr/pull/44
  
There will likely be some conflicts with bouncy castle.  

Tika 1.13:
bcmail-jdk15on  1.54
bcprov-jdk15on  1.54

vs. Solr:
org.bouncycastle.version = 1.45
/org.bouncycastle/bcmail-jdk15 = ${org.bouncycastle.version}
/org.bouncycastle/bcprov-jdk15 = ${org.bouncycastle.version}



> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-17 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336759#comment-15336759
 ] 

ASF GitHub Bot commented on SOLR-8981:
--

Github user tballison commented on the issue:

https://github.com/apache/lucene-solr/pull/44
  
WebP is an image format.
Jackcess encrypt is the library that allows users to decrypt MSAccess files.

Please give it a go with Java 9.  I can't easily test the morphlines stuff 
on my main dev box (Windows ... :( ).


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-17 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336740#comment-15336740
 ] 

ASF GitHub Bot commented on SOLR-8981:
--

Github user uschindler commented on the issue:

https://github.com/apache/lucene-solr/pull/44
  
Did you check with Java 9 or should I do it? I am not sure about the last 
assume removed, because there is another SOLR issue in the assume message' not 
just the PDFBOX one.


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-17 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336732#comment-15336732
 ] 

ASF GitHub Bot commented on SOLR-8981:
--

Github user uschindler commented on the issue:

https://github.com/apache/lucene-solr/pull/44
  
What file formats are this? Documents? Otherwise please leave them out.


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-17 Thread Tim Allison (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336653#comment-15336653
 ] 

Tim Allison commented on SOLR-8981:
---

In looking at [~lewismc]'s earlier work on 1.12 
[here|https://issues.apache.org/jira/browse/SOLR-8716?focusedCommentId=15250294=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-15250294],
 it looks like I missed the webp parser and jackcess-encrypt.  Should I add 
those?


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-17 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336625#comment-15336625
 ] 

ASF GitHub Bot commented on SOLR-8981:
--

Github user tballison commented on the issue:

https://github.com/apache/lucene-solr/pull/44
  
Our bug introduced in TIKA-995.


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-17 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336583#comment-15336583
 ] 

ASF GitHub Bot commented on SOLR-8981:
--

Github user tballison commented on the issue:

https://github.com/apache/lucene-solr/pull/44
  
Not willing to point fingers... :)

I'd like to track down the change in our history between 1.7 and 1.13 so 
that I actually understand what happened


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-17 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336571#comment-15336571
 ] 

ASF GitHub Bot commented on SOLR-8981:
--

Github user uschindler commented on the issue:

https://github.com/apache/lucene-solr/pull/44
  
LOL. So is this a bug in Solr or in TIKA? Because it did not happen 
previously.


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-17 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336563#comment-15336563
 ] 

ASF GitHub Bot commented on SOLR-8981:
--

Github user tballison commented on the issue:

https://github.com/apache/lucene-solr/pull/44
  
The XHTMLContentHandler adds  and .  In out-of-the-box Tika 
with the DefaultHtmlMapper, "body" tags are not in the list of "SAFE_ELEMENTS", 
which means that the html's "body" tag is never passed through...so we don't 
see the doubling in Tika.

The solution is to suppress the body tag in Solr's 
MostlyPassthroughHtmlMapper.


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-17 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336547#comment-15336547
 ] 

ASF GitHub Bot commented on SOLR-8981:
--

Github user tballison commented on the issue:

https://github.com/apache/lucene-solr/pull/44
  
Just found it.  Confirming that fix doesn't break anything else.




> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-17 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336521#comment-15336521
 ] 

ASF GitHub Bot commented on SOLR-8981:
--

Github user uschindler commented on the issue:

https://github.com/apache/lucene-solr/pull/44
  
Were you able to fix the test or should I look into it?


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-17 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336286#comment-15336286
 ] 

ASF GitHub Bot commented on SOLR-8981:
--

Github user tballison commented on the issue:

https://github.com/apache/lucene-solr/pull/44
  
No, it is a self-contained test with a test file. +1 on local and _only_ 
local.


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-17 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336285#comment-15336285
 ] 

ASF GitHub Bot commented on SOLR-8981:
--

Github user uschindler commented on the issue:

https://github.com/apache/lucene-solr/pull/44
  
> will take a look. The test passed if you assumed that the html had two 
bodies, but that's crazy...

I hope this test does not download the internet? It should all run local! I 
have not looked into it.


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-17 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336281#comment-15336281
 ] 

ASF GitHub Bot commented on SOLR-8981:
--

Github user uschindler commented on the issue:

https://github.com/apache/lucene-solr/pull/44
  
Grep for that one and remove them. Tests should pass then with latest Java 
9:
`assumeFalse("This test fails with Java 9 
(https://issues.apache.org/jira/browse/PDFBOX-3155)", 
Constants.JRE_IS_MINIMUM_JAVA9);`



> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-17 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336273#comment-15336273
 ] 

ASF GitHub Bot commented on SOLR-8981:
--

Github user uschindler commented on the issue:

https://github.com/apache/lucene-solr/pull/44
  
OK, I will merge again later. So I will revert my checkout once you have 
fixed that. Otherwise all looks fine.

BTW: Can you remove the assumeFalse on Java 9, because PDFBox is fixed? 
This was because on Java 9 PDFBOX failed in clinit (version number parsing 
failure).


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-17 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336268#comment-15336268
 ] 

ASF GitHub Bot commented on SOLR-8981:
--

Github user tballison commented on the issue:

https://github.com/apache/lucene-solr/pull/44
  
argh...

will take a look.  The test passed if you assumed that the html had two 
bodies, but that's crazy...


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-17 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336270#comment-15336270
 ] 

ASF GitHub Bot commented on SOLR-8981:
--

GitHub user tballison reopened a pull request:

https://github.com/apache/lucene-solr/pull/44

SOLR-8981

SOLR-8981 upgrade to Tika 1.13

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/tballison/lucene-solr SOLR-8981

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/lucene-solr/pull/44.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #44


commit ba0e71703464849198b384aa6e92962db8a04b51
Author: tballison 
Date:   2016-06-16T16:56:45Z

SOLR-8981 upgrade to Tika 1.13

commit 1706b92790011f3ec5a85915adad3834e87d8970
Author: tballison 
Date:   2016-06-16T19:36:52Z

SOLR-8981 clean up license and sha1 info

commit 31c091b4856081f2d1b302499a436e5953779e5e
Author: tballison 
Date:   2016-06-17T13:47:53Z

SOLR-8981 clean up new lines, upgrade isoparser, add notice in CHANGES.txt




> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-17 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336269#comment-15336269
 ] 

ASF GitHub Bot commented on SOLR-8981:
--

Github user tballison closed the pull request at:

https://github.com/apache/lucene-solr/pull/44


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-17 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336266#comment-15336266
 ] 

ASF GitHub Bot commented on SOLR-8981:
--

Github user lewismc commented on the issue:

https://github.com/apache/lucene-solr/pull/44
  
@uschindler yep we've seen this before. I have no idea what is going on 
here. I'll look in to it again today. Can someone point out the exact code 
which does the XPath magic?


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-17 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336265#comment-15336265
 ] 

ASF GitHub Bot commented on SOLR-8981:
--

Github user tballison commented on the issue:

https://github.com/apache/lucene-solr/pull/44
  
Y, I did run the extraction tests.  That was the error we were getting 
initially, but which (without explanation) disappeared on my most recent 
integration attempt.


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-17 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336267#comment-15336267
 ] 

ASF GitHub Bot commented on SOLR-8981:
--

Github user uschindler commented on the issue:

https://github.com/apache/lucene-solr/pull/44
  
for me it still happens. I just merged the PR


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-17 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336262#comment-15336262
 ] 

ASF GitHub Bot commented on SOLR-8981:
--

Github user uschindler commented on the issue:

https://github.com/apache/lucene-solr/pull/44
  
I merged everything successfully, but I get one test failure in 
solr/contrib/extraction:

[junit4] FAILURE 0.05s J0 | ExtractingRequestHandlerTest.testXPath <<<
[junit4]> Throwable #1: org.junit.ComparisonFailure: expected:<[News]> 
but was:<[]>
[junit4]>at 
__randomizedtesting.SeedInfo.seed([404BA07016F1FB57:3E1A6EE30E469911]:0)

I have the feeling I have seen this before. Weren't you running the 
extraction tests?


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-17 Thread Tim Allison (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336234#comment-15336234
 ] 

Tim Allison commented on SOLR-8981:
---

Great!  Among the many improvements (esp. PDFBox 2.x), this version includes 
[jackcess|http://jackcess.sourceforge.net/] for parsing MSAccess files.

> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-17 Thread Uwe Schindler (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336224#comment-15336224
 ] 

Uwe Schindler commented on SOLR-8981:
-

Thanks! I will merge the PR later this evening! For 6.1 it is now to late, but 
6.2 will have this :-)

Uwe

> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-17 Thread Lewis John McGibbney (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336217#comment-15336217
 ] 

Lewis John McGibbney commented on SOLR-8981:


Nice work




-- 
*Lewis*


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-17 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336170#comment-15336170
 ] 

ASF GitHub Bot commented on SOLR-8981:
--

Github user tballison commented on the issue:

https://github.com/apache/lucene-solr/pull/44
  
Git (well, it was my fault, don't get me wrong) added the \r\n somehow.  I 
had turned off autocrlf earlier.

> C:\...>git config --get core.autocrlf
input

I realized I forgot to update the isoparser, and I cleaned up the Jackcess 
notice.

Let me know how this looks now.


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-16 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15334893#comment-15334893
 ] 

ASF GitHub Bot commented on SOLR-8981:
--

Github user uschindler commented on the issue:

https://github.com/apache/lucene-solr/pull/44
  
> I think this should work... ant precommit worked in Linux with these 
modifications. I kept getting hangs with ant jar-checksums in Windows.

If you checkout with git on windows using auto-eol it fails. The reason is 
git that threats sha1 files as text and converts their line endings.


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-16 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15334520#comment-15334520
 ] 

ASF GitHub Bot commented on SOLR-8981:
--

Github user tballison commented on the issue:

https://github.com/apache/lucene-solr/pull/44
  
I think I got it...  ant precommit worked in Linux with these 
modifications.  I kept getting hangs with ant jar-checksums in Windows.


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-16 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15334235#comment-15334235
 ] 

ASF GitHub Bot commented on SOLR-8981:
--

Github user uschindler commented on the issue:

https://github.com/apache/lucene-solr/pull/44
  
Hallo,
please also update all SHA1 hashes of files. Plesae run "ant precommit" 
from root folder of Lu/Solr. This will report all missing things.


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Assignee: Uwe Schindler
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-16 Thread Uwe Schindler (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15334228#comment-15334228
 ] 

Uwe Schindler commented on SOLR-8981:
-

To test TIKA please only run tests inside contrib/extraction!

Solr tests are generally unstable, especially on windows. See our Jenkins logs.

> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-16 Thread Tim Allison (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15334159#comment-15334159
 ] 

Tim Allison commented on SOLR-8981:
---

I got a build failure here:
{noformat}
 Tests with failures [seed: C22A0B280C50BF8F]:
   [junit4]   - org.apache.solr.handler.component.SpellCheckComponentTest.test
{noformat}

However, when I tested this alone, all was fine...different seed?
Not sure if this is a regular build failure or something caused by the changes.

[~lewismc], if you have a chance to review, I'd appreciate a second set of eyes 
before we bother [~thetaphi] for a review.



> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-16 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15334150#comment-15334150
 ] 

ASF GitHub Bot commented on SOLR-8981:
--

GitHub user tballison opened a pull request:

https://github.com/apache/lucene-solr/pull/44

SOLR-8981

SOLR-8981 upgrade to Tika 1.13

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/tballison/lucene-solr SOLR-8981

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/lucene-solr/pull/44.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #44


commit ba0e71703464849198b384aa6e92962db8a04b51
Author: tballison 
Date:   2016-06-16T16:56:45Z

SOLR-8981 upgrade to Tika 1.13




> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-16 Thread Lewis John McGibbney (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15334085#comment-15334085
 ] 

Lewis John McGibbney commented on SOLR-8981:


Brilliant. The most recent patch I submitted matches Tika 1.13 dependencies
less scientific data formats and all of the other non 'document' formats.
Thanks for rebuilding Tim it's appreciated.




-- 
*Lewis*


> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-16 Thread Tim Allison (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15334015#comment-15334015
 ] 

Tim Allison commented on SOLR-8981:
---

Just tested now, and the upgrade patch is no longer failing on that test (?!).  
If I get a fully clean build, I'll submit it.

> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-16 Thread Tim Allison (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15333929#comment-15333929
 ] 

Tim Allison commented on SOLR-8981:
---

Y, I think the only thing stopping us now was the unit test failure noted 
above.  I'll take a look.  I don't know if that'll be a blocker.

> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-16 Thread Tommaso Teofili (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15333910#comment-15333910
 ] 

Tommaso Teofili commented on SOLR-8981:
---

IIRC there's a related Solr issue about upgrading to Tika 1.12 [~lewismc] was 
working on (progress slowed down by having to hand scraping which, transitive 
or not, dependencies needed to be updated or not back then).

> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-06-16 Thread Andriy Binetsky (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15333880#comment-15333880
 ] 

Andriy Binetsky commented on SOLR-8981:
---

Hi guys,
Is it anything new about this issue? Where can I find mentioned sources?  I'm 
asking because we are recently updating our Solr to 6.x version and would like 
to have Tika 1.13 with all bugfixes/improvements. 

> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-05-31 Thread Tim Allison (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15308355#comment-15308355
 ] 

Tim Allison commented on SOLR-8981:
---

I'm getting a failure on that test too.  I can't figure out what's going on.  
I'm getting exactly the same output with the standalone Tika 1.7 and 1.13 apps 
on the test file...argh...

> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-05-26 Thread Lewis John McGibbney (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15302442#comment-15302442
 ] 

Lewis John McGibbney commented on SOLR-8981:


I am working on this again and will try to post a patch ASAP. 
[~talli...@mitre.org]. I have the following test failing in Solr
https://github.com/apache/lucene-solr/blob/master/solr/contrib/extraction/src/test/org/apache/solr/handler/extraction/ExtractingRequestHandlerTest.java#L505
I have been debugging the tests with no luck as of yet. I'll post a new PR 
later today. The new PR is rebased against lucene-solr master and Tika 1.13

> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-05-26 Thread Tim Allison (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15302427#comment-15302427
 ] 

Tim Allison commented on SOLR-8981:
---

CVE-2016-4434: Apache Tika XML External Entity vulnerability in versions 
0.10-1.12: 
[announcement|https://mail-archives.apache.org/mod_mbox/tika-dev/201605.mbox/%3C1705136517.1175366.1464278135251.JavaMail.yahoo%40mail.yahoo.com%3E]

> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-05-18 Thread Chris A. Mattmann (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15289030#comment-15289030
 ] 

Chris A. Mattmann commented on SOLR-8981:
-

correct [~talli...@apache.org] won't affect it for now.

> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-05-18 Thread Tim Allison (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15288803#comment-15288803
 ] 

Tim Allison commented on SOLR-8981:
---

Thanks to [~grossws], our traditional language detection API _should_ be 
unchanged in 1.13.  #famouslastwords

We've also added Optimaize and Julia under a new package (tika-langdetect) 
TIKA-1696.  This new package allows easier integration for other language 
detection packages such as [Yalder|https://github.com/kkrugler/yalder]

[~chrismattmann] and [~kkrugler], is the above correct?

> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-05-18 Thread Alexandre Rafalovitch (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15288757#comment-15288757
 ] 

Alexandre Rafalovitch commented on SOLR-8981:
-

Is this going to affect language detection module in Solr? Or is API unchanged?

> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8981) Upgrade to Tika 1.13 when it is available

2016-05-16 Thread Tim Allison (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8981?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15285816#comment-15285816
 ] 

Tim Allison commented on SOLR-8981:
---

Tika 1.13 is now available.

> Upgrade to Tika 1.13 when it is available
> -
>
> Key: SOLR-8981
> URL: https://issues.apache.org/jira/browse/SOLR-8981
> Project: Solr
>  Issue Type: Improvement
>Reporter: Tim Allison
>Priority: Minor
>
> Tika 1.13 should be out within a month.  This includes PDFBox 2.0.0 and a 
> number of other upgrades and improvements.  
> If there are any showstoppers in 1.13 from Solr's side or requests before we 
> roll 1.13, let us know.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org