Re: Using Tika that comes with Solr 5.2

Nick Burch Wed, 03 Feb 2016 05:30:08 -0800

On Tue, 2 Feb 2016, Steven White wrote:

What I'm finding is that Tika will not extract the raw text off PDF,
Powerpoint, ets. files but it will off raw text files.


I'd suggest you try some of the steps in the troubleshooting page:
  http://wiki.apache.org/tika/Troubleshooting%20Tika

Probably start at the "No Content Extracted" section, and follow the linksto the possible problems + ways to check

Solr 5.2 comes with the following Tika JARs which I have included all of
them: tika-core-1.7.jar, tika-java7-1.7.jar, tika-parsers-1.7.jar,
tika-xmp-1.7.jar, vorbis-java-tika-0.6.jar,
kite-morphlines-tika-core-0.12.1.jar and
kite-morphlines-tika-decompress-0.12.1.jar

You seem to be missing quite a few of the Tika dependencies, which maywell be it, follow the troubleshooting guide to check!


Nick

Re: Using Tika that comes with Solr 5.2

Reply via email to