Correct command tesseract 3tmp3.png.tif s3.txt -l deu --oem 1 --psm 6 -c " *textord_min_linesize=3.25 "*
On Tue, Nov 13, 2018 at 6:29 PM Martin Jenniges <[email protected]> wrote: > Hello, > > my command-line is tesseract 3tmp3.png.tif s3.txt -l deu --oem 1 --psm 6 > -c *textord_min_linesize 3.25 * > > and receive > > read_params_file: Can't open 3.25 > Missing in configvar assigment > > Martin > > > Am 12.11.2018 um 13:01 schrieb Zdenko Podobny: > > What kind of error message you get? > Please share your image for testing too. > > Zdenko > > > ne 11. 11. 2018 o 15:39 Martin Jenniges <[email protected]> > napĂsal(a): > >> Hello, >> >> >> I have found the follow Tip for tesseract; but when I give this parameter >> with -c *textord_min_linesize 3.25 in tesseract 4, I receive a error >> message. What is wrong ?* >> >> >> >> Example 3: Line Size Command >> >> *tesseract image.jpg outputfilename config* >> Command Line Arguments >> >> None >> Config Settings >> >> *textord_min_linesize 3.25* >> Notes >> >> - textord_min_linesize seems to have an affect on the line heights >> detected by Tesseract when it performs the layout analysis on the image. >> The default value for this setting is 1.25. >> - When set to 3.25, the "broken" line problem in the original >> baseline output is corrected. Lower settings (for example, 3.0) do not >> correct the "broken" lines. >> - This settings causes other character recognition errors. >> - The text in the output that is highlighted in red is again >> correctly contained on a single line. >> - The words highlighted in blue include extra characters that are a >> results of "noise" (specks and imperfections in the image). None of these >> have corrected, but no new ones have appeared. >> - Lines between "paragraphs" now appear in somewhat odd locations. >> Again, there are NO lines between paragraphs on the source image. >> - The garbage words at the end of the page do not appear. >> - A small number of errors in individual words that appear in the >> original output were corrected, a few other incorrect words changed (but >> were still incorrect), a small number of correct words now are incorrect. >> These have been highlighted in purple. >> >> -- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to [email protected]. >> To post to this group, send email to [email protected]. >> Visit this group at https://groups.google.com/group/tesseract-ocr. >> To view this discussion on the web visit >> https://groups.google.com/d/msgid/tesseract-ocr/0611f640-034f-3251-932f-e29e6fea4773%40skynet.be >> <https://groups.google.com/d/msgid/tesseract-ocr/0611f640-034f-3251-932f-e29e6fea4773%40skynet.be?utm_medium=email&utm_source=footer> >> . >> For more options, visit https://groups.google.com/d/optout. >> > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To post to this group, send email to [email protected]. > Visit this group at https://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/CAJbzG8yaEio-Rjr7e%2BuptUjf2Ux7bewHRxFpcKjdNC1uQvD2TQ%40mail.gmail.com > <https://groups.google.com/d/msgid/tesseract-ocr/CAJbzG8yaEio-Rjr7e%2BuptUjf2Ux7bewHRxFpcKjdNC1uQvD2TQ%40mail.gmail.com?utm_medium=email&utm_source=footer> > . > For more options, visit https://groups.google.com/d/optout. > > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To post to this group, send email to [email protected]. > Visit this group at https://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/f234e6b8-8537-10c4-dc76-40406c2a5a6d%40skynet.be > <https://groups.google.com/d/msgid/tesseract-ocr/f234e6b8-8537-10c4-dc76-40406c2a5a6d%40skynet.be?utm_medium=email&utm_source=footer> > . > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAN557ayMo6-Pkzw37E%3DWkLHV%2B9u_xOZEFYJNZni0-dtVDFE6gg%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.

