Re: New line recogniztion

2013-04-06 Thread zdenko podobny
On Fri, Apr 5, 2013 at 11:20 PM, Ruud van Houtum ruudvhou...@gmail.comwrote: Hello, I am using Tesseract to output text files from scanned documents. All text images contain typed text and are fairly clear/clean. So far Tesseract has a pretty good accuracy and I am quite content. However

Re: Can I pass a langue file direct to init? Have any way?

2013-04-06 Thread zdenko podobny
AFAIK this should be fixed in 3.02.02 version (I am using svn version;-) ). Zdenko On Sat, Apr 6, 2013 at 1:29 AM, Patrick Questembert patrick.questemb...@gmail.com wrote: Remember to include /. at the end of TESSDATA_PREFIX, e.g. TESSDATA_PREFIX=/home/ubuntu/mytess/. (where a tessdata

Re: lector (was: please help an absolut beginner)

2013-04-06 Thread Janusz S. Bień
Dnia 6 Kwietnia 2013, 2:28 pm, So, zdenko podobny napisał(a): I am a contributor (e.g. I was not aware that issue) to the project but I have a lack of time (or a lot of interests ;-) ). Anyway I tried to fix it (at least it works for me on openSUSE 12.3) Works now for me on Debian sid (I

Re: hOCR output and ocr_carea

2013-04-06 Thread zdenko podobny
Thanks for idea. I will try to have a look on it. If anybody has patch ready I will welcome it warmly Zdenko On Tue, Apr 2, 2013 at 7:16 AM, Janusz S. Bień jsb...@mimuw.edu.plwrote: The hOCR specification states that ocr_carea is content area which used to be called ocr_column. I've