On Wed, Jul 09, 2014 at 09:48:20AM -0700, Rani Yaroshinski wrote: > From the point of view of the performance measures of the OCR ?
I don't think anybody has figures on this. You could do some tests yourself, and let us know the results. I would guess that file size would be a bigger slowdown than the sorts of compression that are available, so whatever is smallest would be fastest, but that's just a hunch. > If I can choose the format, which should I prefer ? I always prefer PNG, only because it's simple and well-defined, whereas there are lots and lots of different ways of constructing a tiff image. That said I'm pretty sure you can get way smaller (but still losslessly compressed) files with tiff using the correct options, for grayscale / binarised scans. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at http://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/20140709170638.GD9792%40manta.lan. For more options, visit https://groups.google.com/d/optout.

