Correct command

tesseract 3tmp3.png.tif s3.txt -l deu --oem 1 --psm 6 -c "
*textord_min_linesize=3.25 "*

On Tue, Nov 13, 2018 at 6:29 PM Martin Jenniges <[email protected]>
wrote:

> Hello,
>
> my command-line is tesseract 3tmp3.png.tif s3.txt -l deu --oem 1 --psm 6
> -c *textord_min_linesize 3.25 *
>
> and receive
>
> read_params_file: Can't open 3.25
> Missing in configvar assigment
>
> Martin
>
>
> Am 12.11.2018 um 13:01 schrieb Zdenko Podobny:
>
> What kind of error message you get?
> Please share your image for testing too.
>
> Zdenko
>
>
> ne 11. 11. 2018 o 15:39 Martin Jenniges <[email protected]>
> napĂ­sal(a):
>
>> Hello,
>>
>>
>> I have found the follow Tip for tesseract; but when I give this parameter
>> with -c  *textord_min_linesize 3.25 in tesseract 4, I receive a error
>> message. What is wrong ?*
>>
>>
>>
>> Example 3: Line Size Command
>>
>> *tesseract image.jpg outputfilename config*
>> Command Line Arguments
>>
>> None
>> Config Settings
>>
>> *textord_min_linesize 3.25*
>> Notes
>>
>>    - textord_min_linesize seems to have an affect on the line heights
>>    detected by Tesseract when it performs the layout analysis on the image.
>>    The default value for this setting is 1.25.
>>    - When set to 3.25, the "broken" line problem in the original
>>    baseline output is corrected.  Lower settings (for example, 3.0) do not
>>    correct the "broken" lines.
>>    - This settings causes other character recognition errors.
>>    - The text in the output that is highlighted in red is again
>>    correctly contained on a single line.
>>    - The words highlighted in blue include extra characters that are a
>>    results of "noise" (specks and imperfections in the image).  None of these
>>    have corrected, but no new ones have appeared.
>>    - Lines between "paragraphs" now appear in somewhat odd locations.
>>    Again, there are NO lines between paragraphs on the source image.
>>    - The garbage words at the end of the page do not appear.
>>    - A small number of errors in individual words that appear in the
>>    original output were corrected, a few other incorrect words changed (but
>>    were still incorrect), a small number of correct words now are  incorrect.
>>    These have been highlighted in purple.
>>
>> --
>> You received this message because you are subscribed to the Google Groups
>> "tesseract-ocr" group.
>> To unsubscribe from this group and stop receiving emails from it, send an
>> email to [email protected].
>> To post to this group, send email to [email protected].
>> Visit this group at https://groups.google.com/group/tesseract-ocr.
>> To view this discussion on the web visit
>> https://groups.google.com/d/msgid/tesseract-ocr/0611f640-034f-3251-932f-e29e6fea4773%40skynet.be
>> <https://groups.google.com/d/msgid/tesseract-ocr/0611f640-034f-3251-932f-e29e6fea4773%40skynet.be?utm_medium=email&utm_source=footer>
>> .
>> For more options, visit https://groups.google.com/d/optout.
>>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to [email protected].
> To post to this group, send email to [email protected].
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/CAJbzG8yaEio-Rjr7e%2BuptUjf2Ux7bewHRxFpcKjdNC1uQvD2TQ%40mail.gmail.com
> <https://groups.google.com/d/msgid/tesseract-ocr/CAJbzG8yaEio-Rjr7e%2BuptUjf2Ux7bewHRxFpcKjdNC1uQvD2TQ%40mail.gmail.com?utm_medium=email&utm_source=footer>
> .
> For more options, visit https://groups.google.com/d/optout.
>
>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to [email protected].
> To post to this group, send email to [email protected].
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit
> https://groups.google.com/d/msgid/tesseract-ocr/f234e6b8-8537-10c4-dc76-40406c2a5a6d%40skynet.be
> <https://groups.google.com/d/msgid/tesseract-ocr/f234e6b8-8537-10c4-dc76-40406c2a5a6d%40skynet.be?utm_medium=email&utm_source=footer>
> .
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAN557ayMo6-Pkzw37E%3DWkLHV%2B9u_xOZEFYJNZni0-dtVDFE6gg%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to