Please try with a different psm and see if you get better results. If you share a sample image we can test and respond.
On Fri, Jun 22, 2018 at 5:29 PM <[email protected]> wrote: > Could someone please try to give me an answer for my language. > > On Friday, June 15, 2018 at 2:42:00 PM UTC+2, [email protected] wrote: >> >> Dear All, >> >> In the project that I am currently working in, I have a pure text line >> cropped from an document image. >> >> As a next step, I need to recognize the text using and at the same time, >> I need to get the words coordinates. >> >> To get that coordinates I am passing the hocr parameters to the command >> line and assign the page segmentation mode to 7 (line). >> >> tesseract file.png out.txt --psm 7 hocr. >> >> However, the output is really bad because by passing these parameters, >> the line will be conisders as a page and some words will not be detected at >> the output. >> >> Is there another way to get the word coordinate of that line? >> > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To post to this group, send email to [email protected]. > Visit this group at https://groups.google.com/group/tesseract-ocr. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/d24b268f-5cfa-4d20-89c0-9dfd2360f0dc%40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/d24b268f-5cfa-4d20-89c0-9dfd2360f0dc%40googlegroups.com?utm_medium=email&utm_source=footer> > . > For more options, visit https://groups.google.com/d/optout. > -- ____________________________________________________________ भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at https://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduXL-VCLpqzi3adCuBDwRfBhQ_ksCaqyQ%3DYgiGOwG1bEHg%40mail.gmail.com. For more options, visit https://groups.google.com/d/optout.

