Greetings, after playing a while with "tesseract" and after having read plenty of manual pages and documentation on the web I still have some questions. I want to create a PDF file with an OCR layer, but:
1. Some of my TIFF files created by "ScanTailor" have light text on dark background, and documentation says to manually invert such files be- fore feeding them to current "tesseract" versions. But of course I want the PDF file to contain the original document with light text on dark background. 2. According to the documentation the TIFF file for OCR-ing should have at least 300 dpi. But for the background image within the final PDF document I'd like to use a JP2 file with only 150 dpi and a high com- pression rate. So is it possible to pass "tesseract" a high quality image for OCR-ing and a lesser quality image for building the PDF file with? Sincerely, Rainer -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/24144.4317.65160.558394%40tux.local.

