Which page segmentation mode (psm) did you try?
On 12 Oct 2016 11:21 p.m., "fuzzy7k" wrote:
> I have scanned some index pages that I would like to ocr for rapid
> searching. I am using tesseract from the command line. The problem is that
> tesseract ignores the whitespace
I have scanned some index pages that I would like to ocr for rapid
searching. I am using tesseract from the command line. The problem is that
tesseract ignores the whitespace between columns and merges everything
together, essentially fragmenting the contents. Using some debug output I
see
2 matches
Mail list logo