On 29.10.2023 14:16, Tyler Salwierz wrote:
I’m using fscrawler which uses Tika and it’s not generating OCR on heic images. The actual image metadata is indexed but the content is empty.Is there any fix for this if it is a Tika bug? https://pastebin.com/raw/Jp5kBi5M
Heic isn't supported by tesseract, thus it isn't a bug. https://github.com/tesseract-ocr/tesseract/issues/2930 https://tesseract-ocr.github.io/tessdoc/InputFormats.html Tilman
