We don't currently allow multiple tika-config files.

In TIKA-3297, I added a "skipOcr" parameter to TesseractOCRConfig.  So
you can turn off OCR at runtime via the TesseractOCRConfig sent in the
ParseContext _per parse_.

On Wed, Feb 10, 2021 at 10:15 AM Peter Kronenberg
<[email protected]> wrote:
>
> Is it possible to use multiple tika-config files?  Perhaps a separate one for 
> each parser, and then combine them?
>
>
>
> I’m also wondering how to implement the recommendation here, 
> https://cwiki.apache.org/confluence/display/TIKA/TikaOCR, to disable OCR.  
> Currently, I load this tika-config when I want to totally disable.  But if 
> I’m already using tika-config with all my other options, how do I ‘add on’?

Reply via email to