Sure. I'll need to find a test file that doesn't contain private 
information.

Before seeing your response now, I ran my script on a file that I had 
converted to a searchable PDF last year and the output file was very poor. 
Out of curiosity, I changed the converted image from .tiff to .png and the 
result was very good. I'm wondering if it's something with the convert 
package.

Rich

On Wednesday, January 19, 2022 at 11:18:20 PM UTC-7 zdenop wrote:

> Please provide details for reproducing problem: input image, output pdf, 
> tesseract details (tesseract -v)
>
> Zdenko
>
>
> št 20. 1. 2022 o 5:03 Rich M <[email protected]> napísal(a):
>
>> Hi,
>>
>> I'm fairly new to tesseract and had a written a bash script in Debian 
>> Buster(previous release) using tesseract 3 which worked very well. I've 
>> since upgraded my OS to the next stable release, Bullseye which also 
>> upgraded tesseract to V4. After the upgrade, tesseract isn't "working" any 
>> longer. I'm needing help in troubleshooting the issue.
>>
>> Basically the important line of the script is
>> tesseract PDFIn001.tiff PDFOut001 -l eng pdf
>>
>> Then in the terminal,
>> Tesseract Open Source OCR Engine v4.1.1 with Leptonica
>>
>> The resulting PDF file is 2.4kB and appears to be empty or corrupted. 
>>
>> With the previous Debian release, I didn't need to install any 
>> "training". Is that what I'm missing?
>>
>> Thanks, 
>> Rich
>>
>> I don't recall seeing the response in the terminal about Leptonica.
>>
>> -- 
>> You received this message because you are subscribed to the Google Groups 
>> "tesseract-ocr" group.
>> To unsubscribe from this group and stop receiving emails from it, send an 
>> email to [email protected].
>> To view this discussion on the web visit 
>> https://groups.google.com/d/msgid/tesseract-ocr/3a998a4a-6a6c-4062-84ca-8719adfb05ffn%40googlegroups.com
>>  
>> <https://groups.google.com/d/msgid/tesseract-ocr/3a998a4a-6a6c-4062-84ca-8719adfb05ffn%40googlegroups.com?utm_medium=email&utm_source=footer>
>> .
>>
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/8e60bbd9-7d15-4f92-8156-99b5dfd338d4n%40googlegroups.com.

Reply via email to