Hello,

my command-line is tesseract 3tmp3.png.tif s3.txt -l deu --oem 1 --psm 6 -c /textord_min_linesize 3.25 /

and receive

read_params_file: Can't open 3.25
Missing in configvar assigment

Martin


Am 12.11.2018 um 13:01 schrieb Zdenko Podobny:
What kind of error message you get?
Please share your image for testing too.

Zdenko


ne 11. 11. 2018 o 15:39 Martin Jenniges <[email protected] <mailto:[email protected]>> napísal(a):

    Hello,


    I have found the follow Tip for tesseract; but when I give this
    parameter with -c /textord_min_linesize 3.25 in tesseract 4, I
    receive a error message. What is wrong ?/

    /
    /


            Example 3: Line Size


              Command

    /tesseract image.jpg outputfilename config/


              Command Line Arguments

    None


              Config Settings

    /textord_min_linesize 3.25/


              Notes

      * textord_min_linesize seems to have an affect on the line
        heights detected by Tesseract when it performs the layout
        analysis on the image.  The default value for this setting is
        1.25.
      * When set to 3.25, the "broken" line problem in the original
        baseline output is corrected.  Lower settings (for example,
        3.0) do not correct the "broken" lines.
      * This settings causes other character recognition errors.
      * The text in the output that is highlighted in red is again
        correctly contained on a single line.
      * The words highlighted in blue include extra characters that
        are a results of "noise" (specks and imperfections in the
        image).  None of these have corrected, but no new ones have
        appeared.
      * Lines between "paragraphs" now appear in somewhat odd
        locations.  Again, there are NO lines between paragraphs on
        the source image.
      * The garbage words at the end of the page do not appear.
      * A small number of errors in individual words that appear in
        the original output were corrected, a few other incorrect
        words changed (but were still incorrect), a small number of
        correct words now are incorrect.  These have been highlighted
        in purple.

-- You received this message because you are subscribed to the Google
    Groups "tesseract-ocr" group.
    To unsubscribe from this group and stop receiving emails from it,
    send an email to [email protected]
    <mailto:[email protected]>.
    To post to this group, send email to
    [email protected]
    <mailto:[email protected]>.
    Visit this group at https://groups.google.com/group/tesseract-ocr.
    To view this discussion on the web visit
    
https://groups.google.com/d/msgid/tesseract-ocr/0611f640-034f-3251-932f-e29e6fea4773%40skynet.be
    
<https://groups.google.com/d/msgid/tesseract-ocr/0611f640-034f-3251-932f-e29e6fea4773%40skynet.be?utm_medium=email&utm_source=footer>.
    For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected] <mailto:[email protected]>. To post to this group, send email to [email protected] <mailto:[email protected]>.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAJbzG8yaEio-Rjr7e%2BuptUjf2Ux7bewHRxFpcKjdNC1uQvD2TQ%40mail.gmail.com <https://groups.google.com/d/msgid/tesseract-ocr/CAJbzG8yaEio-Rjr7e%2BuptUjf2Ux7bewHRxFpcKjdNC1uQvD2TQ%40mail.gmail.com?utm_medium=email&utm_source=footer>.
For more options, visit https://groups.google.com/d/optout.


--
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/f234e6b8-8537-10c4-dc76-40406c2a5a6d%40skynet.be.
For more options, visit https://groups.google.com/d/optout.

Reply via email to