Stephane Racle wrote:

Funny you mention that. That's what I tried to use yesterday when I put up the excerpt from CGF #1. It did a reasonable job (actually, an excellent job), for anything with a light background. The problem was with text areas with darker backgrounds (for example, the Ultima tidbit and the Data East/Epyx lawsuit). Acrobat tended to identify those as image zones and didn't recognize the text. And unfortunately the Acrobat plug-in doesn't give you the option of identifying which zones are text and which are not. So that's why I didn't bother with OCR for now.

Dedicated OCR programs properly handle areas like that. TextBridge Pro 11, in my experience, does a great job (better than omnipage). It also outputs PDF.


I'm looking into other option - one possibility is Adobe Acrobat Capture, which is a much more robust solution. However, I think it's a fairly expensive program (in an case, definitely not free). In the mean time, if anyone has other suggestions... please share!

Acrobat Capture is servicable but not as good as dedictated OCR. I used to use it for software manuals, but to get the best results I had to scan to TIFs first, then clean them up as much as possible (no staple marks, no smudges, etc.) before running through Capture. It created more work than it saved.
--
Jim Leonard ([EMAIL PROTECTED]) http://www.oldskool.org/
Want to help an ambitious games project? http://www.mobygames.com/
Or check out some trippy MindCandy at http://www.mindcandydvd.com/


----------------------------------------------------------------------
This message was sent to you because you are currently subscribed to
the swcollect mailing list. To unsubscribe, send mail to [EMAIL PROTECTED] with a subject of 'unsubscribe swcollect'
Archives are available at: http://www.mail-archive.com/[EMAIL PROTECTED]/


Reply via email to