On Tue, Oct 14, 2014 at 12:32:52PM +0100, Nick Burch wrote: > On Tue, 14 Oct 2014, Kamil Żyta wrote: > > I want to use external parser but on web there isn't complex > > howto/tutorial. I only found parser/external/tika-external-parsers.xml > > sample configuration but I don't know how to register/enable this parser > > in tika parsers. > > All you should need to do is provide a tika-external-parsers.xml file on > your classpath (in the appropriate directory), which defines how to talk > to your command line tool. Tika will find that and wire it up to the > external parser for you
where is the appropriate directory? # find . -name tika-external-parsers.xml ./tika-parsers/src/main/resources/org/apache/tika/parser/external/tika-external-parsers.xml ./tika-parsers/target/classes/org/apache/tika/parser/external/tika-external-parsers.xml > > Tika has tests for the external parser included in it, you can try looking > at those for inspiration I can not find it > > You can also look at the ffmpeg plugin for another example, that's based > on the external parser - https://github.com/AlfrescoLabs/tika-ffmpeg > sample tika-external-parsers.xml include ffmpeg for 'video/x-matroska' but when I do: # java -jar tika-app/target/tika-app-*.jar ~/xvid_480p_as_l5_1mbps_he-aac_foreign_subs_matrix.mkv I got only resourceName, Content-Length and Content-Type K
