Thank You very much!
Am 13.11.2018 um 14:01 schrieb Vinod Gattani:
Correct command
tesseract 3tmp3.png.tif s3.txt -l deu --oem 1 --psm 6 -c
"/textord_min_linesize=3.25 "/
On Tue, Nov 13, 2018 at 6:29 PM Martin Jenniges
<[email protected] <mailto:[email protected]>> wrote:
Hello,
my command-line is tesseract 3tmp3.png.tif s3.txt -l deu --oem 1
--psm 6 -c /textord_min_linesize 3.25 /
and receive
read_params_file: Can't open 3.25
Missing in configvar assigment
Martin
Am 12.11.2018 um 13:01 schrieb Zdenko Podobny:
What kind of error message you get?
Please share your image for testing too.
Zdenko
ne 11. 11. 2018 o 15:39 Martin Jenniges <[email protected]
<mailto:[email protected]>> napísal(a):
Hello,
I have found the follow Tip for tesseract; but when I give
this parameter with -c /textord_min_linesize 3.25 in
tesseract 4, I receive a error message. What is wrong ?/
/
/
Example 3: Line Size
Command
/tesseract image.jpg outputfilename config/
Command Line Arguments
None
Config Settings
/textord_min_linesize 3.25/
Notes
* textord_min_linesize seems to have an affect on the line
heights detected by Tesseract when it performs the layout
analysis on the image. The default value for this setting
is 1.25.
* When set to 3.25, the "broken" line problem in the
original baseline output is corrected. Lower settings
(for example, 3.0) do not correct the "broken" lines.
* This settings causes other character recognition errors.
* The text in the output that is highlighted in red is
again correctly contained on a single line.
* The words highlighted in blue include extra characters
that are a results of "noise" (specks and imperfections
in the image). None of these have corrected, but no new
ones have appeared.
* Lines between "paragraphs" now appear in somewhat odd
locations. Again, there are NO lines between paragraphs
on the source image.
* The garbage words at the end of the page do not appear.
* A small number of errors in individual words that appear
in the original output were corrected, a few other
incorrect words changed (but were still incorrect), a
small number of correct words now are incorrect. These
have been highlighted in purple.
--
You received this message because you are subscribed to the
Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from
it, send an email to
[email protected]
<mailto:[email protected]>.
To post to this group, send email to
[email protected]
<mailto:[email protected]>.
Visit this group at
https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit
https://groups.google.com/d/msgid/tesseract-ocr/0611f640-034f-3251-932f-e29e6fea4773%40skynet.be
<https://groups.google.com/d/msgid/tesseract-ocr/0611f640-034f-3251-932f-e29e6fea4773%40skynet.be?utm_medium=email&utm_source=footer>.
For more options, visit https://groups.google.com/d/optout.
--
You received this message because you are subscribed to the
Google Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it,
send an email to [email protected]
<mailto:[email protected]>.
To post to this group, send email to
[email protected]
<mailto:[email protected]>.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit
https://groups.google.com/d/msgid/tesseract-ocr/CAJbzG8yaEio-Rjr7e%2BuptUjf2Ux7bewHRxFpcKjdNC1uQvD2TQ%40mail.gmail.com
<https://groups.google.com/d/msgid/tesseract-ocr/CAJbzG8yaEio-Rjr7e%2BuptUjf2Ux7bewHRxFpcKjdNC1uQvD2TQ%40mail.gmail.com?utm_medium=email&utm_source=footer>.
For more options, visit https://groups.google.com/d/optout.
--
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it,
send an email to [email protected]
<mailto:[email protected]>.
To post to this group, send email to
[email protected]
<mailto:[email protected]>.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit
https://groups.google.com/d/msgid/tesseract-ocr/f234e6b8-8537-10c4-dc76-40406c2a5a6d%40skynet.be
<https://groups.google.com/d/msgid/tesseract-ocr/f234e6b8-8537-10c4-dc76-40406c2a5a6d%40skynet.be?utm_medium=email&utm_source=footer>.
For more options, visit https://groups.google.com/d/optout.
--
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send
an email to [email protected]
<mailto:[email protected]>.
To post to this group, send email to [email protected]
<mailto:[email protected]>.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit
https://groups.google.com/d/msgid/tesseract-ocr/CAN557ayMo6-Pkzw37E%3DWkLHV%2B9u_xOZEFYJNZni0-dtVDFE6gg%40mail.gmail.com
<https://groups.google.com/d/msgid/tesseract-ocr/CAN557ayMo6-Pkzw37E%3DWkLHV%2B9u_xOZEFYJNZni0-dtVDFE6gg%40mail.gmail.com?utm_medium=email&utm_source=footer>.
For more options, visit https://groups.google.com/d/optout.
--
You received this message because you are subscribed to the Google Groups
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit
https://groups.google.com/d/msgid/tesseract-ocr/e5b6fe50-7085-d060-d295-a71de9f1030f%40skynet.be.
For more options, visit https://groups.google.com/d/optout.