We don't currently allow multiple tika-config files. In TIKA-3297, I added a "skipOcr" parameter to TesseractOCRConfig. So you can turn off OCR at runtime via the TesseractOCRConfig sent in the ParseContext _per parse_.
On Wed, Feb 10, 2021 at 10:15 AM Peter Kronenberg <[email protected]> wrote: > > Is it possible to use multiple tika-config files? Perhaps a separate one for > each parser, and then combine them? > > > > I’m also wondering how to implement the recommendation here, > https://cwiki.apache.org/confluence/display/TIKA/TikaOCR, to disable OCR. > Currently, I load this tika-config when I want to totally disable. But if > I’m already using tika-config with all my other options, how do I ‘add on’?
