Hello Tika People,

I am trying to add a custom content-type to Tika and am finding it
difficult. Not sure if the tutorial I am following is out of date but it
could be the case.

I am using Tika 1.11, which I downloaded from here:
https://www.apache.org/dist/tika/tika-server-1.11.jar

Once I have this file I can successfully run it on my PC using:
java -jar tika-server-1.11.jar -h 0.0.0.0

I created a custom content-type like so:
<?xml version="1.0" encoding="UTF-8"?>
<properties>
  <parsers>
    <parser class="org.apache.tika.parser.DefaultParser">
      <mime-exclude>application/hello</mime-exclude>
    </parser>
    <parser class="org.apache.tika.parser.hello.HelloWorldParser">
      <mime>application/hello</mime>
    </parser>
  </parsers>
</properties>

This was saved into file called parsers.xml.

Then I follow the example in
https://tika.apache.org/1.5/parser_guide.html#Create_your_Parser_class and
ad the parser class.

My question is what do I need to do add to the "java -jar
tika-server-1.11.jar -h 0.0.0.0" command for it to load my custom parser?

Thanks in advanced,
James Brooking

Reply via email to