What psm are you using? On Tue, Feb 11, 2020, 20:46 KOLLOL CHOWDHURY <[email protected]> wrote:
> Hi, > > There are certain pages with multi column and when I try to OCR it, it > doesn't recognise the multi column and takes all the words in a particular > line . > > I am using Tesseract 4.01 and trying to output an hocr/pdf file. > > Any help will be appreciated. > > TIA > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to [email protected]. > To view this discussion on the web visit > https://groups.google.com/d/msgid/tesseract-ocr/fce508c8-d2aa-4d90-b4c5-b7546dea6aee%40googlegroups.com > <https://groups.google.com/d/msgid/tesseract-ocr/fce508c8-d2aa-4d90-b4c5-b7546dea6aee%40googlegroups.com?utm_medium=email&utm_source=footer> > . > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduVLsuckR7Kiu2PXZ03vLEzJy8OvJP6z6-atUcW3H-5Ybg%40mail.gmail.com.

