I am not sure what is your problem: file is not empty and tesseract gave you output exactly what you asked for[1], [2]. It is not a tesseract issue or bug that you are not familiar with the command you used.
[1] https://github.com/tesseract-ocr/tesseract/blob/723eb135c5815e6f22c50a91cfb1c030329213d8/src/ccmain/tesseractclass.cpp#L307 [2] https://github.com/tesseract-ocr/tessdoc/blob/f8dd4d1a259bbaa024e01127e9c1d6208d8bd236/FAQ.md#what-output-formats-can-tesseract-produce Zdenko št 29. 4. 2021 o 20:57 Sharp Subbu <sharpsu...@gmail.com> napísal(a): > Dear Friends, > > Kindly find the attached pdf file "TextOnlyPDF_NoData.pdf". > This pdf file is created using the Tesseract OCR v5.0.0. using the below > command: > Command: tesseract Invoice.tiff TextOnlyPDF_NoData -l eng -c > textonly_pdf=1 pdf > > But, this pdf does not contain any data. It is empty. > > Kindly let us know is there any bug/issue present in Tesseract OCR > v5.0.0.0 latest source which generates above output pdf file with > textonly_pdf=1. > > NOTE: > For your reference, we are attaching a text only pdf file > "Invoice--Adobe-PDF-(ABBYY-OCR).pdf" generated by ABBYY OCR. > We are trying to generate similar text only pdf file using Tesseract OCR > v5.0.0. > > Kindly help us to fix the above textonly pdf issue from Tesseract OCR > v5.0.0. side. > > Thank you very much in advance. > > Regards, > Subramanyam > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to tesseract-ocr+unsubscr...@googlegroups.com. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/2740afaf-47ff-4518-b829-5c69f9e94457n%40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/2740afaf-47ff-4518-b829-5c69f9e94457n%40googlegroups.com?utm_medium=email&utm_source=footer> > . > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAJbzG8wg2nh2w%3DP_A7WtJskzM4n9ZOmOPtR02KATD%2BoMUX8zDQ%40mail.gmail.com.