tesseract problem with OCR of tables is known problem - search archive and issue tracker.
Zdenko pi 1. 3. 2019 o 5:13 sachin chavan <[email protected]> napĂsal(a): > I'm also facing the same issue > > On Sat, Feb 23, 2019 at 2:09 AM Russia Aiyappa <[email protected]> > wrote: > >> Tesseract misses the extraction of some words like "Monthly" and "Total" >> (under section V) in the attached form. Upon using the PRImA tools I found >> that "Monthly" was omitted as it wasn't segmented correctly while "Total" >> even though fell under the segmentation region wasn't extracted. >> >> Any idea what could have caused such a behavior and how to fix this? I >> used PSM 3. >> >> Thank you. >> >> -- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to [email protected]. >> To post to this group, send email to [email protected]. >> Visit this group at https://groups.google.com/group/tesseract-ocr. >> To view this discussion on the web visit >> https://groups.google.com/d/msgid/tesseract-ocr/2df7c54f-bc66-482d-9f77-0fd65a6c2ae0%40googlegroups.com >> <https://groups.google.com/d/msgid/tesseract-ocr/2df7c54f-bc66-482d-9f77-0fd65a6c2ae0%40googlegroups.com?utm_medium=email&utm_source=footer> >> . >> For more options, visit https://groups.google.com/d/optout. >> > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To post to this group, send email to [email protected]. > Visit this group at https://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/CAF_bWh1S4CjndnRmEN12HCs8LOh%2BMSLL81DmqdD3Te95A-rKfA%40mail.gmail.com > <https://groups.google.com/d/msgid/tesseract-ocr/CAF_bWh1S4CjndnRmEN12HCs8LOh%2BMSLL81DmqdD3Te95A-rKfA%40mail.gmail.com?utm_medium=email&utm_source=footer> > . > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAJbzG8yV9NW4P822VA4MxiLU5EjbWg2ECZLg1rG%3DYbktxsxgww%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.

