[tesseract-ocr] New user's questions

Dr Rainer Woitok Fri, 21 Feb 2020 11:08:14 -0800

Greetings,

after playing a while  with "tesseract"  and after having read plenty of
manual pages  and documentation on the web  I still have some questions.
I want to create a PDF file with an OCR layer, but:


1. Some of my TIFF files created by "ScanTailor" have light text on dark
   background,  and documentation says to manually invert such files be-
   fore feeding them  to current "tesseract" versions.   But of course I
   want the PDF file to contain the original document with light text on
   dark background.

2. According to the documentation  the TIFF file for OCR-ing should have
   at least 300 dpi.   But for the background image within the final PDF
   document I'd like to use a JP2 file with only 150 dpi and a high com-
   pression rate.

So is it possible  to pass "tesseract"  a high quality image for OCR-ing
and a lesser quality image for building the PDF file with?

Sincerely,
  Rainer

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/24144.4317.65160.558394%40tux.local.

[tesseract-ocr] New user's questions

Reply via email to