Thank You very much!

Am 13.11.2018 um 14:01 schrieb Vinod Gattani:
Correct command

tesseract 3tmp3.png.tif s3.txt -l deu --oem 1 --psm 6 -c "/textord_min_linesize=3.25 "/

On Tue, Nov 13, 2018 at 6:29 PM Martin Jenniges <[email protected] <mailto:[email protected]>> wrote:

    Hello,

    my command-line is tesseract 3tmp3.png.tif s3.txt -l deu --oem 1
    --psm 6 -c /textord_min_linesize 3.25 /

    and receive

    read_params_file: Can't open 3.25
    Missing in configvar assigment

    Martin


    Am 12.11.2018 um 13:01 schrieb Zdenko Podobny:
    What kind of error message you get?
    Please share your image for testing too.

    Zdenko


    ne 11. 11. 2018 o 15:39 Martin Jenniges <[email protected]
    <mailto:[email protected]>> napísal(a):

        Hello,


        I have found the follow Tip for tesseract; but when I give
        this parameter with -c /textord_min_linesize 3.25 in
        tesseract 4, I receive a error message. What is wrong ?/

        /
        /


                Example 3: Line Size


                  Command

        /tesseract image.jpg outputfilename config/


                  Command Line Arguments

        None


                  Config Settings

        /textord_min_linesize 3.25/


                  Notes

          * textord_min_linesize seems to have an affect on the line
            heights detected by Tesseract when it performs the layout
            analysis on the image. The default value for this setting
            is 1.25.
          * When set to 3.25, the "broken" line problem in the
            original baseline output is corrected. Lower settings
            (for example, 3.0) do not correct the "broken" lines.
          * This settings causes other character recognition errors.
          * The text in the output that is highlighted in red is
            again correctly contained on a single line.
          * The words highlighted in blue include extra characters
            that are a results of "noise" (specks and imperfections
            in the image).  None of these have corrected, but no new
            ones have appeared.
          * Lines between "paragraphs" now appear in somewhat odd
            locations.  Again, there are NO lines between paragraphs
            on the source image.
          * The garbage words at the end of the page do not appear.
          * A small number of errors in individual words that appear
            in the original output were corrected, a few other
            incorrect words changed (but were still incorrect), a
            small number of correct words now are  incorrect.  These
            have been highlighted in purple.

-- You received this message because you are subscribed to the
        Google Groups "tesseract-ocr" group.
        To unsubscribe from this group and stop receiving emails from
        it, send an email to
        [email protected]
        <mailto:[email protected]>.
        To post to this group, send email to
        [email protected]
        <mailto:[email protected]>.
        Visit this group at
        https://groups.google.com/group/tesseract-ocr.
        To view this discussion on the web visit
        
https://groups.google.com/d/msgid/tesseract-ocr/0611f640-034f-3251-932f-e29e6fea4773%40skynet.be
        
<https://groups.google.com/d/msgid/tesseract-ocr/0611f640-034f-3251-932f-e29e6fea4773%40skynet.be?utm_medium=email&utm_source=footer>.
        For more options, visit https://groups.google.com/d/optout.

-- You received this message because you are subscribed to the
    Google Groups "tesseract-ocr" group.
    To unsubscribe from this group and stop receiving emails from it,
    send an email to [email protected]
    <mailto:[email protected]>.
    To post to this group, send email to
    [email protected]
    <mailto:[email protected]>.
    Visit this group at https://groups.google.com/group/tesseract-ocr.
    To view this discussion on the web visit
    
https://groups.google.com/d/msgid/tesseract-ocr/CAJbzG8yaEio-Rjr7e%2BuptUjf2Ux7bewHRxFpcKjdNC1uQvD2TQ%40mail.gmail.com
    
<https://groups.google.com/d/msgid/tesseract-ocr/CAJbzG8yaEio-Rjr7e%2BuptUjf2Ux7bewHRxFpcKjdNC1uQvD2TQ%40mail.gmail.com?utm_medium=email&utm_source=footer>.
    For more options, visit https://groups.google.com/d/optout.


-- You received this message because you are subscribed to the Google
    Groups "tesseract-ocr" group.
    To unsubscribe from this group and stop receiving emails from it,
    send an email to [email protected]
    <mailto:[email protected]>.
    To post to this group, send email to
    [email protected]
    <mailto:[email protected]>.
    Visit this group at https://groups.google.com/group/tesseract-ocr.
    To view this discussion on the web visit
    
https://groups.google.com/d/msgid/tesseract-ocr/f234e6b8-8537-10c4-dc76-40406c2a5a6d%40skynet.be
    
<https://groups.google.com/d/msgid/tesseract-ocr/f234e6b8-8537-10c4-dc76-40406c2a5a6d%40skynet.be?utm_medium=email&utm_source=footer>.
    For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected] <mailto:[email protected]>. To post to this group, send email to [email protected] <mailto:[email protected]>.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAN557ayMo6-Pkzw37E%3DWkLHV%2B9u_xOZEFYJNZni0-dtVDFE6gg%40mail.gmail.com <https://groups.google.com/d/msgid/tesseract-ocr/CAN557ayMo6-Pkzw37E%3DWkLHV%2B9u_xOZEFYJNZni0-dtVDFE6gg%40mail.gmail.com?utm_medium=email&utm_source=footer>.
For more options, visit https://groups.google.com/d/optout.


--
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/e5b6fe50-7085-d060-d295-a71de9f1030f%40skynet.be.
For more options, visit https://groups.google.com/d/optout.

Reply via email to