Hello, I use Nutch v1.10, i just want to know if Nutch with Tika parser v1.8 can natively OCR images from PDF files? I can OCR JPEG or PNG files but Tika do not convert images from PDF. I use Elastic to index.
Thank you
Hello, I use Nutch v1.10, i just want to know if Nutch with Tika parser v1.8 can natively OCR images from PDF files? I can OCR JPEG or PNG files but Tika do not convert images from PDF. I use Elastic to index.
Thank you