Hello everyone, we are using TikaOCR to access tesseract OCR via Tika Server's web API, which is working perfectly satisfying. However, as we process documents in different languages, I was wondering if it is possible to get a list of available languages from the server? Furthermore, does anybody know how I can tell TikaOCR not to return a response in plain text but hOCR-XML?
Thanks in advance, Mirko
smime.p7s
Description: S/MIME Cryptographic Signature
