Thanks for all of the detailed information -- this is very helpful. I've been working from the 3.00 release (which I see isn't even the latest published version now -- I'm further behind the times than I realized!) and will try updating to the latest trunk next week.
Regarding the configuration files, I did try some of the samples included with 3.00, but I got error messages about invalid parameters. Perhaps this has all been fixed by 3.02; I'll follow up if I'm still having trouble after the upgrade. Thanks again for your help! - Demian ________________________________________ From: [email protected] [[email protected]] On Behalf Of TP [[email protected]] Sent: Friday, March 23, 2012 5:21 AM To: [email protected] Subject: Re: Tesseract 3 and paragraph separation On Thu, Mar 22, 2012 at 12:59 PM, Demian Katz <[email protected]> wrote: > I'm using Tesseract 3 as a simple command-line tool to generate OCR. > It's doing a fairly good job, but I have one unmet need -- I need to > be able to separate paragraphs with blank lines. Hmmm, I just tried this on a sample image (something I should have done first), and the latest trunk version (3.02) of tesseract already puts blank lines between paragraphs. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

