On Tue, Oct 14, 2014 at 12:32:52PM +0100, Nick Burch wrote:
> On Tue, 14 Oct 2014, Kamil Żyta wrote:
> > I want to use external parser but on web there isn't complex 
> > howto/tutorial. I only found parser/external/tika-external-parsers.xml 
> > sample configuration but I don't know how to register/enable this parser 
> > in tika parsers.
> 
> All you should need to do is provide a tika-external-parsers.xml file on 
> your classpath (in the appropriate directory), which defines how to talk 
> to your command line tool. Tika will find that and wire it up to the 
> external parser for you

where is the appropriate directory?
# find . -name tika-external-parsers.xml
./tika-parsers/src/main/resources/org/apache/tika/parser/external/tika-external-parsers.xml
./tika-parsers/target/classes/org/apache/tika/parser/external/tika-external-parsers.xml

> 
> Tika has tests for the external parser included in it, you can try looking 
> at those for inspiration

I can not find it

> 
> You can also look at the ffmpeg plugin for another example, that's based 
> on the external parser - https://github.com/AlfrescoLabs/tika-ffmpeg
> 

sample tika-external-parsers.xml include ffmpeg for 'video/x-matroska' but when 
I do:

# java -jar tika-app/target/tika-app-*.jar 
~/xvid_480p_as_l5_1mbps_he-aac_foreign_subs_matrix.mkv

I got only resourceName, Content-Length and Content-Type

K

Reply via email to