date:20161012

Re: [tesseract-ocr] Failure to recognize columns

2016-10-12 Thread ShreeDevi Kumar

Which page segmentation mode (psm) did you try? On 12 Oct 2016 11:21 p.m., "fuzzy7k" wrote: > I have scanned some index pages that I would like to ocr for rapid > searching. I am using tesseract from the command line. The problem is that > tesseract ignores the whitespace

[tesseract-ocr] Failure to recognize columns

2016-10-12 Thread fuzzy7k

I have scanned some index pages that I would like to ocr for rapid searching. I am using tesseract from the command line. The problem is that tesseract ignores the whitespace between columns and merges everything together, essentially fragmenting the contents. Using some debug output I see