On 29.10.2023 18:00, Tyler Salwierz wrote:
Is Apple using their own custom ocr scanner then because spotlight does ocr locally on Heic.

if they can display HEIC, then they can also convert it.

They have an OCR:

https://developer.apple.com/documentation/vision/recognizing_text_in_images

Tilman



On Oct 29, 2023, at 9:12 AM, Tilman Hausherr <[email protected]> wrote:


On 29.10.2023 14:16, Tyler Salwierz wrote:
I’m using fscrawler which uses Tika and it’s not generating OCR on heic images. The actual image metadata is indexed but the content is empty.

Is there any fix for this if it is a Tika bug?

https://pastebin.com/raw/Jp5kBi5M

Heic isn't supported by tesseract, thus it isn't a bug.

https://github.com/tesseract-ocr/tesseract/issues/2930

https://tesseract-ocr.github.io/tessdoc/InputFormats.html

Tilman


Reply via email to