Make sure the command and parameters/options are in proper order.
 
Usage:tesseract.exe imagename outputbase|stdout [-l lang] [-psm 
pagesegmode] [configfile...]

On Friday, January 3, 2014 5:56:50 PM UTC-6, Benjamin Sølberg wrote:

> Have given it a try.
>
> The output is now in one block as needed, thats good.
>
> But the problem now seems to be that it does not take my training data 
> into much account.
> Special chars are no longer reconized.
>  I guess the "-psm 6" option makes it stop earlier in the process.
> It it possible to just make it skip the segmentation process and have the 
> rest as usual ?
> I am just taking a pure guess here on how it works.
>
> Benjamin
>
> Den fredag den 3. januar 2014 20.02.24 UTC+1 skrev Benjamin Sølberg:
>>
>> Thank you, i'll try that.
>>
>> Is it possible to achieve the same functionality by using a config 
>> parameter as I also need to run this on an iPhone ?
>>
>> Regards
>> Benjamin
>>
>> Den fredag den 3. januar 2014 17.48.06 UTC+1 skrev Quan Nguyen:
>>>
>>> Try with PSM 4, 5, or 6.
>>>
>>> On Thursday, January 2, 2014 6:12:53 PM UTC-6, Benjamin Sølberg wrote:
>>>
>>>> Hi all
>>>>
>>>> I am training tesseract to work with a custom font.
>>>> Things are moving forward but there are clouds in the sky.
>>>>
>>>> When using tesseract it insists to cut the texts into sections.
>>>> I understand why as the textual layout may seems to be column based.
>>>> The text is very much like an excel sheet with columns.
>>>> But I would very much like tesseract to give me all the text in one 
>>>> giant line based blob instead of many sections.
>>>>
>>>> Is it possible to make tesseract not chop up the image into (columns in 
>>>> my case) ?
>>>>
>>>> Regards
>>>> Benjamin
>>>>
>>>>

-- 
-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

--- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/groups/opt_out.

Reply via email to