[jira] [Commented] (SOLR-8716) Upgrade to Apache Tika 1.12

2016-04-26 Thread Tim Allison (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15258285#comment-15258285
 ] 

Tim Allison commented on SOLR-8716:
---

IIRC, you won't want to include the SQLite parser because it includes native 
libs.  Right?

> Upgrade to Apache Tika 1.12
> ---
>
> Key: SOLR-8716
> URL: https://issues.apache.org/jira/browse/SOLR-8716
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Lewis John McGibbney
>Assignee: Uwe Schindler
> Fix For: master
>
> Attachments: LUCENE-7041.patch
>
>
> We recently released Apache Tika 1.12. In order to use the fixes provided 
> within the Tika.translate API I propose to upgrade Tika from 1.7 --> 1.12 in 
> lucene/ivy-versions.properties.
> Patch coming up.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8716) Upgrade to Apache Tika 1.12

2016-04-26 Thread Lewis John McGibbney (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15258091#comment-15258091
 ] 

Lewis John McGibbney commented on SOLR-8716:


Hi [~thetaphi], argh, yes you are right. Lets wait for Tika 1.13. There is [a 
conversation|http://www.mail-archive.com/dev%40tika.apache.org/msg17480.html] 
about this right now.


> Upgrade to Apache Tika 1.12
> ---
>
> Key: SOLR-8716
> URL: https://issues.apache.org/jira/browse/SOLR-8716
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Lewis John McGibbney
>Assignee: Uwe Schindler
> Fix For: master
>
> Attachments: LUCENE-7041.patch
>
>
> We recently released Apache Tika 1.12. In order to use the fixes provided 
> within the Tika.translate API I propose to upgrade Tika from 1.7 --> 1.12 in 
> lucene/ivy-versions.properties.
> Patch coming up.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8716) Upgrade to Apache Tika 1.12

2016-04-25 Thread Uwe Schindler (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15257249#comment-15257249
 ] 

Uwe Schindler commented on SOLR-8716:
-

To get the checkout ready to commit run:

- run {{ant jar-checksums precommit}} after updating version numbers
- add missing license and notice files


> Upgrade to Apache Tika 1.12
> ---
>
> Key: SOLR-8716
> URL: https://issues.apache.org/jira/browse/SOLR-8716
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Lewis John McGibbney
>Assignee: Uwe Schindler
> Fix For: master
>
> Attachments: LUCENE-7041.patch
>
>
> We recently released Apache Tika 1.12. In order to use the fixes provided 
> within the Tika.translate API I propose to upgrade Tika from 1.7 --> 1.12 in 
> lucene/ivy-versions.properties.
> Patch coming up.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8716) Upgrade to Apache Tika 1.12

2016-04-25 Thread Uwe Schindler (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15257241#comment-15257241
 ] 

Uwe Schindler commented on SOLR-8716:
-

Maybe we should wait until Tika 1.13? Alternatively, fix the version numbers to 
reflect TIKA 1.12.

We should also fix the tests. After fixing the PDFbox version umbers still some 
tests failed because some tests produced different output because of improved 
metadata.

You should run tests inside the correct modules only:
- solr/contrib/extraction
- solr/contrib/dataimporthandler-extras
- solr/contrib/morphlines-cell

Uwe

> Upgrade to Apache Tika 1.12
> ---
>
> Key: SOLR-8716
> URL: https://issues.apache.org/jira/browse/SOLR-8716
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Lewis John McGibbney
>Assignee: Uwe Schindler
> Fix For: master
>
> Attachments: LUCENE-7041.patch
>
>
> We recently released Apache Tika 1.12. In order to use the fixes provided 
> within the Tika.translate API I propose to upgrade Tika from 1.7 --> 1.12 in 
> lucene/ivy-versions.properties.
> Patch coming up.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8716) Upgrade to Apache Tika 1.12

2016-04-25 Thread Uwe Schindler (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15257147#comment-15257147
 ] 

Uwe Schindler commented on SOLR-8716:
-

Hi,

there seem to be some versions wrong. Tika 1.12 uses: 
1.8.10, but you added 2.0. This leads to 
several NoClassDefFound exceptions. I have the feeling your updates are for 
version coming 1.13.

I have already fixed all Licenses, Sha1, files and added NOTICE files as 
required by "ant precommit". I committed this locally, so not sure how to 
proceed. I can supply a patch of my changes or alternatively you should tell me 
the right versions (a simple corrected patch for ivy-versions.xml may be fine) 
or update the PR and I merge again.

> Upgrade to Apache Tika 1.12
> ---
>
> Key: SOLR-8716
> URL: https://issues.apache.org/jira/browse/SOLR-8716
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Lewis John McGibbney
>Assignee: Uwe Schindler
> Fix For: master
>
> Attachments: LUCENE-7041.patch
>
>
> We recently released Apache Tika 1.12. In order to use the fixes provided 
> within the Tika.translate API I propose to upgrade Tika from 1.7 --> 1.12 in 
> lucene/ivy-versions.properties.
> Patch coming up.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8716) Upgrade to Apache Tika 1.12

2016-04-25 Thread Uwe Schindler (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15257037#comment-15257037
 ] 

Uwe Schindler commented on SOLR-8716:
-

Hi,
I am currently testing the PR. I just had to do the required stuff to update 
JAR file checksums to get precommit tests working. I will also add license 
files for the new JARs. After that I will commit, if tests are working. No need 
to modify the PR again, I will take over.
Thanks for taking care!

Uwe

> Upgrade to Apache Tika 1.12
> ---
>
> Key: SOLR-8716
> URL: https://issues.apache.org/jira/browse/SOLR-8716
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Lewis John McGibbney
>Assignee: Uwe Schindler
> Fix For: master
>
> Attachments: LUCENE-7041.patch
>
>
> We recently released Apache Tika 1.12. In order to use the fixes provided 
> within the Tika.translate API I propose to upgrade Tika from 1.7 --> 1.12 in 
> lucene/ivy-versions.properties.
> Patch coming up.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8716) Upgrade to Apache Tika 1.12

2016-04-22 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15254692#comment-15254692
 ] 

ASF GitHub Bot commented on SOLR-8716:
--

Github user lewismc commented on a diff in the pull request:

https://github.com/apache/lucene-solr/pull/31#discussion_r60803373
  
--- Diff: solr/NOTICE.txt ---
@@ -396,6 +394,33 @@ https://github.com/rjohnsondev/java-libpst
 JMatIO is a JAVA library to read/write/manipulate with Matlab binary 
MAT-files.
 http://www.sourceforge.net/projects/jmatio
 
+metadata-extractor is a straightforward Java library for reading metadata 
+from image files.
+https://github.com/drewnoakes/metadata-extractor
+
+Java MP4 Parser; A Java API to read, write and create MP4 container
+https://github.com/sannies/mp4parser
+
+Jackcess; is a pure Java library for reading from and writing to MS Access 
+databases
+http://jackcess.sourceforge.net/
+
+Jackcess Encrypt; an extension library for the Jackcess project which 
+implements support for some forms of Microsoft Access and Microsoft 
+Money encryption
+http://jackcessencrypt.sourceforge.net/
+
+ROME; is a Java framework for RSS and Atom feeds
+(https://github.com/rometools/rome)
+
+VorbisJava; Ogg and Vorbis Tools for Java
+Copyright 2012 Nick Burch
+https://github.com/Gagravarr/VorbisJava
+
+SQLite JSDC Driver; is a library for accessing and creating SQLite 
--- End diff --

Updated... thanks


> Upgrade to Apache Tika 1.12
> ---
>
> Key: SOLR-8716
> URL: https://issues.apache.org/jira/browse/SOLR-8716
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Lewis John McGibbney
>Assignee: Uwe Schindler
> Fix For: master
>
> Attachments: LUCENE-7041.patch
>
>
> We recently released Apache Tika 1.12. In order to use the fixes provided 
> within the Tika.translate API I propose to upgrade Tika from 1.7 --> 1.12 in 
> lucene/ivy-versions.properties.
> Patch coming up.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8716) Upgrade to Apache Tika 1.12

2016-04-22 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15254601#comment-15254601
 ] 

ASF GitHub Bot commented on SOLR-8716:
--

Github user uschindler commented on a diff in the pull request:

https://github.com/apache/lucene-solr/pull/31#discussion_r60798688
  
--- Diff: solr/NOTICE.txt ---
@@ -396,6 +394,33 @@ https://github.com/rjohnsondev/java-libpst
 JMatIO is a JAVA library to read/write/manipulate with Matlab binary 
MAT-files.
 http://www.sourceforge.net/projects/jmatio
 
+metadata-extractor is a straightforward Java library for reading metadata 
+from image files.
+https://github.com/drewnoakes/metadata-extractor
+
+Java MP4 Parser; A Java API to read, write and create MP4 container
+https://github.com/sannies/mp4parser
+
+Jackcess; is a pure Java library for reading from and writing to MS Access 
+databases
+http://jackcess.sourceforge.net/
+
+Jackcess Encrypt; an extension library for the Jackcess project which 
+implements support for some forms of Microsoft Access and Microsoft 
+Money encryption
+http://jackcessencrypt.sourceforge.net/
+
+ROME; is a Java framework for RSS and Atom feeds
+(https://github.com/rometools/rome)
+
+VorbisJava; Ogg and Vorbis Tools for Java
+Copyright 2012 Nick Burch
+https://github.com/Gagravarr/VorbisJava
+
+SQLite JSDC Driver; is a library for accessing and creating SQLite 
--- End diff --

JSDC -> JDBC


> Upgrade to Apache Tika 1.12
> ---
>
> Key: SOLR-8716
> URL: https://issues.apache.org/jira/browse/SOLR-8716
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Lewis John McGibbney
>Assignee: Uwe Schindler
> Fix For: master
>
> Attachments: LUCENE-7041.patch
>
>
> We recently released Apache Tika 1.12. In order to use the fixes provided 
> within the Tika.translate API I propose to upgrade Tika from 1.7 --> 1.12 in 
> lucene/ivy-versions.properties.
> Patch coming up.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8716) Upgrade to Apache Tika 1.12

2016-04-22 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15254567#comment-15254567
 ] 

ASF GitHub Bot commented on SOLR-8716:
--

Github user lewismc commented on a diff in the pull request:

https://github.com/apache/lucene-solr/pull/31#discussion_r60795874
  
--- Diff: solr/NOTICE.txt ---
@@ -396,6 +394,33 @@ https://github.com/rjohnsondev/java-libpst
 JMatIO is a JAVA library to read/write/manipulate with Matlab binary 
MAT-files.
 http://www.sourceforge.net/projects/jmatio
 
+metadata-extractor is a straightforward Java library for reading metadata 
+from image files.
+https://github.com/drewnoakes/metadata-extractor
+
+Java MP4 Parser; A Java API to read, write and create MP4 container
+https://github.com/sannies/mp4parser
+
+Jackcess; is a pure Java library for reading from and writing to MS Access 
+databases
+http://jackcess.sourceforge.net/
+
+Jackcess Encrypt; an extension library for the Jackcess project which 
+implements support for some forms of Microsoft Access and Microsoft 
+Money encryption
+http://jackcessencrypt.sourceforge.net/
+
+ROME; is a Java framework for RSS and Atom feeds
+(https://github.com/rometools/rome)
+
+VorbisJava; Ogg and Vorbis Tools for Java
+Copyright 2012 Nick Burch
+https://github.com/Gagravarr/VorbisJava
+
+SQLite JSDC Driver; is a library for accessing and creating SQLite 
--- End diff --

The last entry? SQLite

On Friday, April 22, 2016, Uwe Schindler  wrote:

> In solr/NOTICE.txt
> :
>
> > +databases
> > +http://jackcess.sourceforge.net/
> > +
> > +Jackcess Encrypt; an extension library for the Jackcess project which
> > +implements support for some forms of Microsoft Access and Microsoft
> > +Money encryption
> > +http://jackcessencrypt.sourceforge.net/
> > +
> > +ROME; is a Java framework for RSS and Atom feeds
> > +(https://github.com/rometools/rome)
> > +
> > +VorbisJava; Ogg and Vorbis Tools for Java
> > +Copyright 2012 Nick Burch
> > +https://github.com/Gagravarr/VorbisJava
> > +
> > +SQLite JSDC Driver; is a library for accessing and creating SQLite
>
> This is a typo, I think.
>
> —
> You are receiving this because you authored the thread.
> Reply to this email directly or view it on GitHub
> 

>


-- 
*Lewis*



> Upgrade to Apache Tika 1.12
> ---
>
> Key: SOLR-8716
> URL: https://issues.apache.org/jira/browse/SOLR-8716
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Lewis John McGibbney
>Assignee: Uwe Schindler
> Fix For: master
>
> Attachments: LUCENE-7041.patch
>
>
> We recently released Apache Tika 1.12. In order to use the fixes provided 
> within the Tika.translate API I propose to upgrade Tika from 1.7 --> 1.12 in 
> lucene/ivy-versions.properties.
> Patch coming up.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8716) Upgrade to Apache Tika 1.12

2016-04-22 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15254548#comment-15254548
 ] 

ASF GitHub Bot commented on SOLR-8716:
--

Github user uschindler commented on a diff in the pull request:

https://github.com/apache/lucene-solr/pull/31#discussion_r60794497
  
--- Diff: solr/NOTICE.txt ---
@@ -396,6 +394,33 @@ https://github.com/rjohnsondev/java-libpst
 JMatIO is a JAVA library to read/write/manipulate with Matlab binary 
MAT-files.
 http://www.sourceforge.net/projects/jmatio
 
+metadata-extractor is a straightforward Java library for reading metadata 
+from image files.
+https://github.com/drewnoakes/metadata-extractor
+
+Java MP4 Parser; A Java API to read, write and create MP4 container
+https://github.com/sannies/mp4parser
+
+Jackcess; is a pure Java library for reading from and writing to MS Access 
+databases
+http://jackcess.sourceforge.net/
+
+Jackcess Encrypt; an extension library for the Jackcess project which 
+implements support for some forms of Microsoft Access and Microsoft 
+Money encryption
+http://jackcessencrypt.sourceforge.net/
+
+ROME; is a Java framework for RSS and Atom feeds
+(https://github.com/rometools/rome)
+
+VorbisJava; Ogg and Vorbis Tools for Java
+Copyright 2012 Nick Burch
+https://github.com/Gagravarr/VorbisJava
+
+SQLite JSDC Driver; is a library for accessing and creating SQLite 
--- End diff --

This is a typo, I think.


> Upgrade to Apache Tika 1.12
> ---
>
> Key: SOLR-8716
> URL: https://issues.apache.org/jira/browse/SOLR-8716
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Lewis John McGibbney
>Assignee: Uwe Schindler
> Fix For: master
>
> Attachments: LUCENE-7041.patch
>
>
> We recently released Apache Tika 1.12. In order to use the fixes provided 
> within the Tika.translate API I propose to upgrade Tika from 1.7 --> 1.12 in 
> lucene/ivy-versions.properties.
> Patch coming up.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8716) Upgrade to Apache Tika 1.12

2016-04-22 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15254322#comment-15254322
 ] 

ASF GitHub Bot commented on SOLR-8716:
--

Github user lewismc commented on the pull request:

https://github.com/apache/lucene-solr/pull/31#issuecomment-213529331
  
Hi Uwe, as Jira is temporarily closed, I will respond here and hopefully 
the message will be queued and posted to the issue on Jira.
I agree with your comments and have updated the PR accordingly. Thanks for 
the continued review. 


> Upgrade to Apache Tika 1.12
> ---
>
> Key: SOLR-8716
> URL: https://issues.apache.org/jira/browse/SOLR-8716
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Lewis John McGibbney
>Assignee: Uwe Schindler
> Fix For: master
>
> Attachments: LUCENE-7041.patch
>
>
> We recently released Apache Tika 1.12. In order to use the fixes provided 
> within the Tika.translate API I propose to upgrade Tika from 1.7 --> 1.12 in 
> lucene/ivy-versions.properties.
> Patch coming up.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8716) Upgrade to Apache Tika 1.12

2016-04-22 Thread Uwe Schindler (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15253638#comment-15253638
 ] 

Uwe Schindler commented on SOLR-8716:
-

Hi,

I quickly reviewed the new dependencies: Some are fine (Access Databases), but 
some may not be relevant for Apache Solr, e.g. the Geo-Stuff. We also excluded 
Netcdf in the past.

In general we are mostly interested in libraries that extract text from 
"documents", not stuff that just extracts a bit of metadata or other 
non-document stuff. So I have the feeling Apache SIS is not relevant for the 
extraction module. Users that want to index geospatial stuff have to use other 
features of Solr.

For the other new dependencies, we have to add the NOTICE.txt entries (inside 
Solr).

> Upgrade to Apache Tika 1.12
> ---
>
> Key: SOLR-8716
> URL: https://issues.apache.org/jira/browse/SOLR-8716
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Lewis John McGibbney
>Assignee: Uwe Schindler
> Fix For: master
>
> Attachments: LUCENE-7041.patch
>
>
> We recently released Apache Tika 1.12. In order to use the fixes provided 
> within the Tika.translate API I propose to upgrade Tika from 1.7 --> 1.12 in 
> lucene/ivy-versions.properties.
> Patch coming up.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8716) Upgrade to Apache Tika 1.12

2016-04-22 Thread Uwe Schindler (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15253626#comment-15253626
 ] 

Uwe Schindler commented on SOLR-8716:
-

Hi Lewis,
we have no automated system through Jenkins. I would apply the patch locally 
and do some quick tests and then push them to Git. ASF Jenkins and Policeman 
Jenkins will take care of finding issues.
Uwe

> Upgrade to Apache Tika 1.12
> ---
>
> Key: SOLR-8716
> URL: https://issues.apache.org/jira/browse/SOLR-8716
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Lewis John McGibbney
>Assignee: Uwe Schindler
> Fix For: master
>
> Attachments: LUCENE-7041.patch
>
>
> We recently released Apache Tika 1.12. In order to use the fixes provided 
> within the Tika.translate API I propose to upgrade Tika from 1.7 --> 1.12 in 
> lucene/ivy-versions.properties.
> Patch coming up.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8716) Upgrade to Apache Tika 1.12

2016-04-20 Thread Lewis John McGibbney (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15251249#comment-15251249
 ] 

Lewis John McGibbney commented on SOLR-8716:


[~thetaphi] [~janhoy] out of curiosity how do patches in lucene-solr typically 
get reviewed? Do you have a pre-commit build or something set up? Thanks

> Upgrade to Apache Tika 1.12
> ---
>
> Key: SOLR-8716
> URL: https://issues.apache.org/jira/browse/SOLR-8716
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Lewis John McGibbney
>Assignee: Uwe Schindler
> Fix For: master
>
> Attachments: LUCENE-7041.patch
>
>
> We recently released Apache Tika 1.12. In order to use the fixes provided 
> within the Tika.translate API I propose to upgrade Tika from 1.7 --> 1.12 in 
> lucene/ivy-versions.properties.
> Patch coming up.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8716) Upgrade to Apache Tika 1.12

2016-04-20 Thread Lewis John McGibbney (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15250819#comment-15250819
 ] 

Lewis John McGibbney commented on SOLR-8716:


PR is updated to
1) ensure that the new dependencies involved in the above parsers are 
lexicographically ordered in lucene/ivy-versions.properties, and
2) that they are included within solr/contrib/extraction/ivy.xml

> Upgrade to Apache Tika 1.12
> ---
>
> Key: SOLR-8716
> URL: https://issues.apache.org/jira/browse/SOLR-8716
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Lewis John McGibbney
>Assignee: Uwe Schindler
> Fix For: master
>
> Attachments: LUCENE-7041.patch
>
>
> We recently released Apache Tika 1.12. In order to use the fixes provided 
> within the Tika.translate API I propose to upgrade Tika from 1.7 --> 1.12 in 
> lucene/ivy-versions.properties.
> Patch coming up.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8716) Upgrade to Apache Tika 1.12

2016-04-20 Thread Lewis John McGibbney (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15250294#comment-15250294
 ] 

Lewis John McGibbney commented on SOLR-8716:


New parsers are
{code}
org.apache.tika.parser.image.WebPParser
org.apache.tika.parser.microsoft.JackcessParser
org.apache.tika.parser.pkg.RarParser
org.apache.tika.parser.dif.DIFParser
org.apache.tika.parser.gdal.GDALParser
org.apache.tika.parser.pot.PooledTimeSeriesParser
org.apache.tika.parser.grib.GribParser
org.apache.tika.parser.jdbc.SQLite3Parser
org.apache.tika.parser.isatab.ISArchiveParser
org.apache.tika.parser.geoinfo.GeographicInformationParser
org.apache.tika.parser.geo.topic.GeoParser
org.apache.tika.parser.external.CompositeExternalParser
org.apache.tika.parser.journal.JournalParser
{code}

> Upgrade to Apache Tika 1.12
> ---
>
> Key: SOLR-8716
> URL: https://issues.apache.org/jira/browse/SOLR-8716
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Lewis John McGibbney
>Assignee: Uwe Schindler
> Fix For: master
>
> Attachments: LUCENE-7041.patch
>
>
> We recently released Apache Tika 1.12. In order to use the fixes provided 
> within the Tika.translate API I propose to upgrade Tika from 1.7 --> 1.12 in 
> lucene/ivy-versions.properties.
> Patch coming up.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8716) Upgrade to Apache Tika 1.12

2016-04-20 Thread Lewis John McGibbney (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15249398#comment-15249398
 ] 

Lewis John McGibbney commented on SOLR-8716:


bq. Are there any new parsers in 1.12 that we could use - which do not have the 
required dependency added?

There are a few... I'll get a comprehensive list and update in due course. 

> Upgrade to Apache Tika 1.12
> ---
>
> Key: SOLR-8716
> URL: https://issues.apache.org/jira/browse/SOLR-8716
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Lewis John McGibbney
>Assignee: Uwe Schindler
> Fix For: master
>
> Attachments: LUCENE-7041.patch
>
>
> We recently released Apache Tika 1.12. In order to use the fixes provided 
> within the Tika.translate API I propose to upgrade Tika from 1.7 --> 1.12 in 
> lucene/ivy-versions.properties.
> Patch coming up.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8716) Upgrade to Apache Tika 1.12

2016-04-20 Thread Uwe Schindler (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15249396#comment-15249396
 ] 

Uwe Schindler commented on SOLR-8716:
-

Cool, will look into it later.

Are there any new parsers in 1.12 that we could use - which do not have the 
required dependency added?

Uwe

> Upgrade to Apache Tika 1.12
> ---
>
> Key: SOLR-8716
> URL: https://issues.apache.org/jira/browse/SOLR-8716
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Lewis John McGibbney
>Assignee: Uwe Schindler
> Fix For: master
>
> Attachments: LUCENE-7041.patch
>
>
> We recently released Apache Tika 1.12. In order to use the fixes provided 
> within the Tika.translate API I propose to upgrade Tika from 1.7 --> 1.12 in 
> lucene/ivy-versions.properties.
> Patch coming up.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8716) Upgrade to Apache Tika 1.12

2016-04-19 Thread Lewis John McGibbney (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15249013#comment-15249013
 ] 

Lewis John McGibbney commented on SOLR-8716:


[~thetaphi] I managed to sort out the dependency stuff... I hope. Would 
appreciate peer review here again. Thanks. 

> Upgrade to Apache Tika 1.12
> ---
>
> Key: SOLR-8716
> URL: https://issues.apache.org/jira/browse/SOLR-8716
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Lewis John McGibbney
>Assignee: Uwe Schindler
> Fix For: master
>
> Attachments: LUCENE-7041.patch
>
>
> We recently released Apache Tika 1.12. In order to use the fixes provided 
> within the Tika.translate API I propose to upgrade Tika from 1.7 --> 1.12 in 
> lucene/ivy-versions.properties.
> Patch coming up.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8716) Upgrade to Apache Tika 1.12

2016-04-19 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15249008#comment-15249008
 ] 

ASF GitHub Bot commented on SOLR-8716:
--

Github user lewismc commented on the pull request:

https://github.com/apache/lucene-solr/pull/10#issuecomment-212181537
  
This PR is superseded by #31 


> Upgrade to Apache Tika 1.12
> ---
>
> Key: SOLR-8716
> URL: https://issues.apache.org/jira/browse/SOLR-8716
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Lewis John McGibbney
>Assignee: Uwe Schindler
> Fix For: master
>
> Attachments: LUCENE-7041.patch
>
>
> We recently released Apache Tika 1.12. In order to use the fixes provided 
> within the Tika.translate API I propose to upgrade Tika from 1.7 --> 1.12 in 
> lucene/ivy-versions.properties.
> Patch coming up.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8716) Upgrade to Apache Tika 1.12

2016-04-19 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15249009#comment-15249009
 ] 

ASF GitHub Bot commented on SOLR-8716:
--

Github user lewismc closed the pull request at:

https://github.com/apache/lucene-solr/pull/10


> Upgrade to Apache Tika 1.12
> ---
>
> Key: SOLR-8716
> URL: https://issues.apache.org/jira/browse/SOLR-8716
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Lewis John McGibbney
>Assignee: Uwe Schindler
> Fix For: master
>
> Attachments: LUCENE-7041.patch
>
>
> We recently released Apache Tika 1.12. In order to use the fixes provided 
> within the Tika.translate API I propose to upgrade Tika from 1.7 --> 1.12 in 
> lucene/ivy-versions.properties.
> Patch coming up.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8716) Upgrade to Apache Tika 1.12

2016-04-19 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15249002#comment-15249002
 ] 

ASF GitHub Bot commented on SOLR-8716:
--

GitHub user lewismc opened a pull request:

https://github.com/apache/lucene-solr/pull/31

SOLR-8716 Upgrade to Apache Tika 1.12

This PR is an attempt to address 
https://issues.apache.org/jira/browse/SOLR-8716, I ran the test suite with no 
issues. Please let me know if there are additional issues I need to deal with 
here. 

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/lewismc/lucene-solr SOLR-8716

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/lucene-solr/pull/31.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #31


commit 4557a3cc6c56b98420b2389a44b2b4fc3c133a5d
Author: Lewis John McGibbney 
Date:   2016-04-20T00:17:50Z

SOLR-8716 Upgrade to Apache Tika 1.12




> Upgrade to Apache Tika 1.12
> ---
>
> Key: SOLR-8716
> URL: https://issues.apache.org/jira/browse/SOLR-8716
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Lewis John McGibbney
>Assignee: Uwe Schindler
> Fix For: master
>
> Attachments: LUCENE-7041.patch
>
>
> We recently released Apache Tika 1.12. In order to use the fixes provided 
> within the Tika.translate API I propose to upgrade Tika from 1.7 --> 1.12 in 
> lucene/ivy-versions.properties.
> Patch coming up.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8716) Upgrade to Apache Tika 1.12

2016-02-23 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15158611#comment-15158611
 ] 

ASF GitHub Bot commented on SOLR-8716:
--

Github user janhoy commented on the pull request:

https://github.com/apache/lucene-solr/pull/10#issuecomment-187617872
  
Hi, you should also manually upgrade relevant Tika dependencies. See 
lucene-solr/solr/contrib/extraction/ivy.xml

When you compare changes between 1.7 and 1.12 you may find that Tika also 
added new dependencies such as parsers etc. Please make a list of these and 
suggest which of them you feel MUST be included with Solr. Note that we do not 
include every single Tika dependency today, to keep the distro slimmer.



> Upgrade to Apache Tika 1.12
> ---
>
> Key: SOLR-8716
> URL: https://issues.apache.org/jira/browse/SOLR-8716
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Lewis John McGibbney
>Assignee: Uwe Schindler
> Fix For: master
>
> Attachments: LUCENE-7041.patch
>
>
> We recently released Apache Tika 1.12. In order to use the fixes provided 
> within the Tika.translate API I propose to upgrade Tika from 1.7 --> 1.12 in 
> lucene/ivy-versions.properties.
> Patch coming up.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-8716) Upgrade to Apache Tika 1.12

2016-02-22 Thread Uwe Schindler (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15157859#comment-15157859
 ] 

Uwe Schindler commented on SOLR-8716:
-

I moved the issue to Solr. Please update the PR to use the correct issue number.

> Upgrade to Apache Tika 1.12
> ---
>
> Key: SOLR-8716
> URL: https://issues.apache.org/jira/browse/SOLR-8716
> Project: Solr
>  Issue Type: Improvement
>  Components: contrib - Solr Cell (Tika extraction)
>Reporter: Lewis John McGibbney
>Assignee: Uwe Schindler
> Fix For: master
>
> Attachments: LUCENE-7041.patch
>
>
> We recently released Apache Tika 1.12. In order to use the fixes provided 
> within the Tika.translate API I propose to upgrade Tika from 1.7 --> 1.12 in 
> lucene/ivy-versions.properties.
> Patch coming up.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org