Use --psm 6 Page segmentation mode instead of the default
On Mon, Jul 13, 2020, 22:05 Deepak Sen <[email protected]> wrote: > Hi, > I am using latest tessaract version and getting the hOCR output of a table > where line no of (column2, row1) is not line-1 so what i want is tessaract > first goes through all the rows in column1 and goes to column2 but I want > it to go like row1(all columns) row2(all columns). > > Thanks, I hope my question is clear. > > *This message contains information that may be privileged or confidential > and is the property of the Quantiphi Inc and/or its affiliates**. It is > intended only for the person to whom it is addressed. **If you are not > the intended recipient, any review, dissemination, distribution, copying, > storage or other use of all or any portion of this message is strictly > prohibited. If you received this message in error, please immediately > notify the sender by reply e-mail and delete this message in its * > *entirety* > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/1acfffc9-2d2b-4ded-9fbb-4d8fe647880an%40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/1acfffc9-2d2b-4ded-9fbb-4d8fe647880an%40googlegroups.com?utm_medium=email&utm_source=footer> > . > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduWGsrXP-iOmxXCNyozCynj%2BKoq%3Daueifu5JDv9wurPT5A%40mail.gmail.com.

