I wrote:
JPG's never lossless.
That's what I thought, but ABBYY and IrfanView both have what they
call "lossless" JPG setting. . .
Ah ha. That's not right. The choices are:
JPEG, Color (for photos),
JPEG, Gray (for photos),
LZW, Color (lossless),
LZW, Gray (lossless),
ZIP, Color (lossless),
ZIP, Gray (lossless),
CCITT4, Black and white.
I made a hybrid version, with most pages CCITT4 and some pages (with
figures) JPG gray. You can't mix LZW and CCITT4.
Still working on the de-speckle problem. It turns out it de-speckled
away most of the dots on "i" and "j"s and several periods. It needs
to be tweaked.
Terry Blanton wrote:
Have you tried any of the web based services?
eg http://www.onlineocr.net/
Max file size 20 MB.
<http://www.onlineocr.net/support/KeyFeatures.aspx>http://www.onlineocr.net/support/KeyFeatures.aspx
My input files are 3.4 GB, TIF.
- Jed