Hello Tika community,

Our team is migrating away from usage of tika-app.jar (2.6 currently) to 
something with more minimal third party dependencies which we can control.


Is there any good documentation or pathway to describe how a team could map the 
tika-app functionality we use to the same behavior using just tika-core and 
tika-parsers-standard-package
(I assume)?

The tika-app functions we use today are:

Mime-type detection
java -jar tika-app.jar -d <file>

and
Text extraction attempts
java -jar tika-app.jar -t <file>

Is there a subset of tika parser jars we would need to include to have 
equivalent functionality if we wrote our own wrapper main class?

Thank you,
Brian Laskey

Reply via email to