RE: Integrating grobid with Tika in solr

2016-05-04 Thread Allison, Timothy B.
, 2016 2:38 PM To: solr-user@lucene.apache.org Subject: Re: Integrating grobid with Tika in solr On 5/4/2016 9:21 AM, Betsey Benagh wrote: > I’m feeling particularly dense, because I don’t see any Tika jars in > WEB-INF/lib: Oops. Sorry about that, I forgot that it's all contrib. That's my m

Re: Integrating grobid with Tika in solr

2016-05-04 Thread Shawn Heisey
On 5/4/2016 9:21 AM, Betsey Benagh wrote: > I’m feeling particularly dense, because I don’t see any Tika jars in > WEB-INF/lib: Oops. Sorry about that, I forgot that it's all contrib. That's my mistake, not yours. The Tika jars are in contrib/extraction/lib, along with a very large number of

Re: Integrating grobid with Tika in solr

2016-05-04 Thread Betsey Benagh
As a workaround, I’m trying to run Grobid on my files, and then import the corresponding XML into Solr. I don’t see any errors on the post: bba0124$ bin/post -c lrdtest ~/software/grobid/out/021002_1.tei.xml /Library/Java/JavaVirtualMachines/jdk1.8.0_71.jdk/Contents/Home/bin/java -classpath

Re: Integrating grobid with Tika in solr

2016-05-04 Thread Betsey Benagh
I’m feeling particularly dense, because I don’t see any Tika jars in WEB-INF/lib: antlr4-runtime-4.5.1-1.jar asm-5.0.4.jar asm-commons-5.0.4.jar commons-cli-1.2.jar commons-codec-1.10.jar commons-collections-3.2.2.jar commons-configuration-1.6.jar commons-exec-1.3.jar

Re: Integrating grobid with Tika in solr

2016-05-04 Thread Shawn Heisey
On 5/4/2016 8:38 AM, Betsey Benagh wrote: > Thanks, I¹m currently using 5.5, and will try upgrading to 6.0. > > > On 5/4/16, 10:37 AM, "Allison, Timothy B." wrote: >> Y. Solr 6.0.0 is shipping with Tika 1.7. Grobid came in with Tika 1.11. Just upgrading to 6.0.0 isn't

Re: Integrating grobid with Tika in solr

2016-05-04 Thread Betsey Benagh
. [mailto:talli...@mitre.org] >Sent: Wednesday, May 4, 2016 10:29 AM >To: solr-user@lucene.apache.org >Subject: RE: Integrating grobid with Tika in solr > >I think Solr is using a version of Tika that predates that addition of >the Grobid parser. You'll have to add that manually s

RE: Integrating grobid with Tika in solr

2016-05-04 Thread Allison, Timothy B.
Y. Solr 6.0.0 is shipping with Tika 1.7. Grobid came in with Tika 1.11. -Original Message- From: Allison, Timothy B. [mailto:talli...@mitre.org] Sent: Wednesday, May 4, 2016 10:29 AM To: solr-user@lucene.apache.org Subject: RE: Integrating grobid with Tika in solr I think Solr is using

RE: Integrating grobid with Tika in solr

2016-05-04 Thread Allison, Timothy B.
] Sent: Wednesday, May 4, 2016 10:07 AM To: solr-user@lucene.apache.org Subject: Re: Integrating grobid with Tika in solr Grobid runs as a service, and I'm (theoretically) configuring Tika to call it. >From the Grobid wiki, here are instructions for integrating with Tika >application: Fi

Re: Integrating grobid with Tika in solr

2016-05-04 Thread Betsey Benagh
Grobid runs as a service, and I’m (theoretically) configuring Tika to call it. >From the Grobid wiki, here are instructions for integrating with Tika >application: First we need to create the GrobidExtractor.properties file that points to the Grobid REST Service. My file looks like the

Re: Integrating grobid with Tika in solr

2016-05-04 Thread Shawn Heisey
On 5/4/2016 7:15 AM, Betsey Benagh wrote: > (X-posted from stack overflow) > > This feels like a basic, dumb question, but my reading of the documentation > has not led me to an answer. > > > i'm using Solr to index journal articles. Using the out-of-the-box > configuration, it indexed the